(High Accuracy Text Detection and Recognition using ResNet as Feature Extraction

碩士 === 大同大學 === 資訊工程學系(所) === 107 === In passed decades, a complete procedure in Optical Character Recognition (OCR) has established. The procedure is mainly split into three parts: text detection, text recognition, and post-processing. In fact, traditional OCR system cannot handle scene text. Text...

Full description

Bibliographic Details
Main Authors:	Chi-Hsin Yang, 楊啟昕
Other Authors:	Chen-Chiung Hsieh
Format:	Others
Language:	en_US
Published:	2019
Online Access:	http://ndltd.ncl.edu.tw/handle/ry38j8

id	ndltd-TW-107TTU05392018
record_format	oai_dc
spelling	ndltd-TW-107TTU053920182019-11-05T03:37:54Z http://ndltd.ncl.edu.tw/handle/ry38j8 (High Accuracy Text Detection and Recognition using ResNet as Feature Extraction 使用ResNet之高精度文字偵測與辨識 Chi-Hsin Yang 楊啟昕碩士大同大學資訊工程學系(所) 107 In passed decades, a complete procedure in Optical Character Recognition (OCR) has established. The procedure is mainly split into three parts: text detection, text recognition, and post-processing. In fact, traditional OCR system cannot handle scene text. Text detection in natural scene images is much more difficult than the recognition of text in scanned document images because of its complicating background. Moreover, Chinese character detection and recognition are difficult topic in OCR due to its numerous characters. Popular existing methods for characters segmentation are CTPN, EAST and PixelLink. However, they are not very capable dealing with the small and densely character in large image, and connected characters. To cope with these problems, we adopted above popular character segmentation networks CTPN or EAST as the main structure and proposed using ResNet as the feature extraction network due to its excellent sensitivity of tiny features among those existing methods. The experimental result shows that the feature extraction network can affect the precision of locating text significantly. In the experiment with ICDAR dataset, the effect of deeper depth and larger width of ResNet on EAST is notable. The performance of text segmentation on ICDAR2015 is 83.4% accuracy which is 7% higher than original PVANET-EAST. The text detection network also have great ability of generalization that can detect untrained scanned document in 83.9% accuracy and Chinese calligraphy in 86.3% accuracy. Chen-Chiung Hsieh 謝禎冏 2019 學位論文 ; thesis 60 en_US
collection	NDLTD
language	en_US
format	Others
sources	NDLTD
description	碩士 === 大同大學 === 資訊工程學系(所) === 107 === In passed decades, a complete procedure in Optical Character Recognition (OCR) has established. The procedure is mainly split into three parts: text detection, text recognition, and post-processing. In fact, traditional OCR system cannot handle scene text. Text detection in natural scene images is much more difficult than the recognition of text in scanned document images because of its complicating background. Moreover, Chinese character detection and recognition are difficult topic in OCR due to its numerous characters. Popular existing methods for characters segmentation are CTPN, EAST and PixelLink. However, they are not very capable dealing with the small and densely character in large image, and connected characters. To cope with these problems, we adopted above popular character segmentation networks CTPN or EAST as the main structure and proposed using ResNet as the feature extraction network due to its excellent sensitivity of tiny features among those existing methods. The experimental result shows that the feature extraction network can affect the precision of locating text significantly. In the experiment with ICDAR dataset, the effect of deeper depth and larger width of ResNet on EAST is notable. The performance of text segmentation on ICDAR2015 is 83.4% accuracy which is 7% higher than original PVANET-EAST. The text detection network also have great ability of generalization that can detect untrained scanned document in 83.9% accuracy and Chinese calligraphy in 86.3% accuracy.
author2	Chen-Chiung Hsieh
author_facet	Chen-Chiung Hsieh Chi-Hsin Yang 楊啟昕
author	Chi-Hsin Yang 楊啟昕
spellingShingle	Chi-Hsin Yang 楊啟昕 (High Accuracy Text Detection and Recognition using ResNet as Feature Extraction
author_sort	Chi-Hsin Yang
title	(High Accuracy Text Detection and Recognition using ResNet as Feature Extraction
title_short	(High Accuracy Text Detection and Recognition using ResNet as Feature Extraction
title_full	(High Accuracy Text Detection and Recognition using ResNet as Feature Extraction
title_fullStr	(High Accuracy Text Detection and Recognition using ResNet as Feature Extraction
title_full_unstemmed	(High Accuracy Text Detection and Recognition using ResNet as Feature Extraction
title_sort	(high accuracy text detection and recognition using resnet as feature extraction
publishDate	2019
url	http://ndltd.ncl.edu.tw/handle/ry38j8
work_keys_str_mv	AT chihsinyang highaccuracytextdetectionandrecognitionusingresnetasfeatureextraction AT yángqǐxīn highaccuracytextdetectionandrecognitionusingresnetasfeatureextraction AT chihsinyang shǐyòngresnetzhīgāojīngdùwénzìzhēncèyǔbiànshí AT yángqǐxīn shǐyòngresnetzhīgāojīngdùwénzìzhēncèyǔbiànshí
_version_	1719287194057703424

(High Accuracy Text Detection and Recognition using ResNet as Feature Extraction

Similar Items