


{"id":1510,"date":"2019-06-27T12:23:10","date_gmt":"2019-06-27T10:23:10","guid":{"rendered":"https:\/\/www-intuidoc.irisa.fr\/?p=1510"},"modified":"2019-06-27T15:47:04","modified_gmt":"2019-06-27T13:47:04","slug":"cbad-table-subset","status":"publish","type":"post","link":"https:\/\/www-intuidoc.irisa.fr\/en\/cbad-table-subset\/","title":{"rendered":"CBAD &#8211; table subset"},"content":{"rendered":"<p><\/p>\n<h3>Description<\/h3>\n<p>In the context of a work on text-lines localization in handwritten documents containing tables, we proposed to evaluate our system on a subset of cBAD (Competition on Baseline Detection, ICDAR 2017 [1]) dataset (track B) that contains exclusively documents with tabular structures. The dataset of the cBAD competition is available <a href=\"https:\/\/scriptnet.iit.demokritos.gr\/competitions\/5\/1\/\">here<\/a>.<\/p>\n<p>Identifying which structure can be consider as a tabular structure in cBAD dataset is not always obvious. This is why we consider the following rule in order to select the documents of the subset.<\/p>\n<p>A document contains a tabular structure if at least one of those two properties is verified:<\/p>\n<p>\u2022 the tabular structure is materialized by vertical and horizontal rulings,<\/p>\n<p>\u2022 columns of the tabular structure are materialized by vertical rulings and those columns have names.<\/p>\n<p>The table subset is composed of 315 documents (51 084 text-lines).<\/p>\n<p>[1] M. Diem, F. Kleber, S. Fiel, T. Gr\u00fcning, B. Gatos (2017, November). cbad: Icdar2017 competition on baseline detection. In 2017 14th IAPR International Conference on Document Analysis and Recognition (ICDAR) (Vol. 1, pp. 1355-1360). IEEE.<\/p>\n<h3>Data<\/h3>\n<p><a href=\"https:\/\/www-intuidoc.irisa.fr\/files\/2019\/06\/table_test_set.txt\">Download the list of images chosen for our table subset.<\/a><\/p>","protected":false},"excerpt":{"rendered":"<\/p>\n<h3>Description<\/h3>\n<p>In the context of a work on text-lines localization in handwritten documents containing tables, we proposed to evaluate our system on a subset of cBAD (Competition on Baseline Detection, ICDAR 2017 [1]) dataset (track B) that contains exclusively documents with tabular structures. The dataset of the cBAD competition is available <a href=\"https:\/\/scriptnet.iit.demokritos.gr\/competitions\/5\/1\/\">here<\/a>.<\/p>\n<p>Identifying &hellip; <\/p>\n<p><a class=\"more-link btn\" href=\"https:\/\/www-intuidoc.irisa.fr\/en\/cbad-table-subset\/\">Continue reading<\/a><\/p>\n","protected":false},"author":1565,"featured_media":0,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[13],"tags":[],"class_list":["post-1510","post","type-post","status-publish","format-standard","hentry","category-bases-de-donnees","nodate","item-wrap"],"_links":{"self":[{"href":"https:\/\/www-intuidoc.irisa.fr\/en\/wp-json\/wp\/v2\/posts\/1510","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www-intuidoc.irisa.fr\/en\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www-intuidoc.irisa.fr\/en\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www-intuidoc.irisa.fr\/en\/wp-json\/wp\/v2\/users\/1565"}],"replies":[{"embeddable":true,"href":"https:\/\/www-intuidoc.irisa.fr\/en\/wp-json\/wp\/v2\/comments?post=1510"}],"version-history":[{"count":3,"href":"https:\/\/www-intuidoc.irisa.fr\/en\/wp-json\/wp\/v2\/posts\/1510\/revisions"}],"predecessor-version":[{"id":1516,"href":"https:\/\/www-intuidoc.irisa.fr\/en\/wp-json\/wp\/v2\/posts\/1510\/revisions\/1516"}],"wp:attachment":[{"href":"https:\/\/www-intuidoc.irisa.fr\/en\/wp-json\/wp\/v2\/media?parent=1510"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www-intuidoc.irisa.fr\/en\/wp-json\/wp\/v2\/categories?post=1510"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www-intuidoc.irisa.fr\/en\/wp-json\/wp\/v2\/tags?post=1510"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}