Segmentation Methodology of Table-Form Documents

This article presents a method for the automatic extraction of the contents of passive and/or active cells in forms. The approach is based on the analysis and recognition of the types of intersection of the lines that make up such cells. Very little a priori knowledge of the form is required. The p...

Full description

Bibliographic Details
Main Authors: Luiz Antonio Pereira Neves, Jacques Facon
Format: Article
Language:English
Published: Centro Latinoamericano de Estudios en Informática 2001-12-01
Series:CLEI Electronic Journal
Online Access:http://clei.org/cleiej-beta/index.php/cleiej/article/view/364
id doaj-5c6b656d443042b9bc40fadb7b020bd4
record_format Article
spelling doaj-5c6b656d443042b9bc40fadb7b020bd42020-11-24T21:01:23ZengCentro Latinoamericano de Estudios en InformáticaCLEI Electronic Journal0717-50002001-12-014210.19153/cleiej.4.2.1Segmentation Methodology of Table-Form DocumentsLuiz Antonio Pereira Neves0Jacques Facon1PUCPR-Curitiba, PR, BrazilPUCPR-Curitiba, PR, Brazil This article presents a method for the automatic extraction of the contents of passive and/or active cells in forms. The approach is based on the analysis and recognition of the types of intersection of the lines that make up such cells. Very little a priori knowledge of the form is required. The performance of this approach depends on the correction module mechanisms for detection and correction of errors generated during the intersection identification phase. The potentialities and advantages of this approach are described and illustrated with tests carried out on different form bases.  http://clei.org/cleiej-beta/index.php/cleiej/article/view/364
collection DOAJ
language English
format Article
sources DOAJ
author Luiz Antonio Pereira Neves
Jacques Facon
spellingShingle Luiz Antonio Pereira Neves
Jacques Facon
Segmentation Methodology of Table-Form Documents
CLEI Electronic Journal
author_facet Luiz Antonio Pereira Neves
Jacques Facon
author_sort Luiz Antonio Pereira Neves
title Segmentation Methodology of Table-Form Documents
title_short Segmentation Methodology of Table-Form Documents
title_full Segmentation Methodology of Table-Form Documents
title_fullStr Segmentation Methodology of Table-Form Documents
title_full_unstemmed Segmentation Methodology of Table-Form Documents
title_sort segmentation methodology of table-form documents
publisher Centro Latinoamericano de Estudios en Informática
series CLEI Electronic Journal
issn 0717-5000
publishDate 2001-12-01
description This article presents a method for the automatic extraction of the contents of passive and/or active cells in forms. The approach is based on the analysis and recognition of the types of intersection of the lines that make up such cells. Very little a priori knowledge of the form is required. The performance of this approach depends on the correction module mechanisms for detection and correction of errors generated during the intersection identification phase. The potentialities and advantages of this approach are described and illustrated with tests carried out on different form bases. 
url http://clei.org/cleiej-beta/index.php/cleiej/article/view/364
work_keys_str_mv AT luizantoniopereiraneves segmentationmethodologyoftableformdocuments
AT jacquesfacon segmentationmethodologyoftableformdocuments
_version_ 1716778180604329984