Segmentation of heterogeneous document images : an approach based on machine learning, connected components analysis, and texture analysis
Document page segmentation is one of the most crucial steps in document image analysis. It ideally aims to explain the full structure of any document page, distinguishing text zones, graphics, photographs, halftones, figures, tables, etc. Although to date, there have been made several attempts of ac...
Main Author: | |
---|---|
Language: | English |
Published: |
Université Paris-Est
2012
|
Subjects: | |
Online Access: | http://tel.archives-ouvertes.fr/tel-00912566 http://tel.archives-ouvertes.fr/docs/00/91/25/66/PDF/TH2012PEST1063_complete.pdf |