Fully Convolutional Networks for Text Understanding in Scene Images

Text understanding in scene images has gained plenty of attention in the computer vision community and it is an important task in many applications as text carries semantically  rich  information  about  scene  content  and  context.   For  instance, reading text in a scene can be applied to autonom...

Full description

Bibliographic Details
Main Author: Dena Bazazian
Format: Article
Language:English
Published: Computer Vision Center Press 2020-02-01
Series:ELCVIA Electronic Letters on Computer Vision and Image Analysis
Subjects:
Online Access:https://elcvia.cvc.uab.es/article/view/1187
id doaj-258e383b226e4aee84b66eb0f2746d86
record_format Article
spelling doaj-258e383b226e4aee84b66eb0f2746d862021-09-18T12:38:12ZengComputer Vision Center PressELCVIA Electronic Letters on Computer Vision and Image Analysis1577-50972020-02-0118210.5565/rev/elcvia.1187338Fully Convolutional Networks for Text Understanding in Scene ImagesDena BazazianText understanding in scene images has gained plenty of attention in the computer vision community and it is an important task in many applications as text carries semantically  rich  information  about  scene  content  and  context.   For  instance, reading text in a scene can be applied to autonomous driving, scene understanding or assisting visually impaired people. The general aim of scene text understanding is to localize and recognize text in scene images. Text regions are first localized in the original image by a trained detector model and afterwards fed into a recognition module. The tasks of localization and recognition are highly correlated since an inaccurate localization can affect the recognition task. The main purpose of this thesis is to devise efficient methods for scene text understanding. We investigate how the latest results on deep learning can advance text understanding pipelines. Recently, Fully Convolutional Networks (FCNs) and derived methods have achieved a significant performance on semantic segmentation and pixel level classification tasks. Therefore, we took benefit of the strengths of FCN approaches in order to detect and recognize text in natural scenes images.https://elcvia.cvc.uab.es/article/view/1187Text UnderstandingText DetectionWord SpottingFully Convolutional Network (FCN)Scene Images.
collection DOAJ
language English
format Article
sources DOAJ
author Dena Bazazian
spellingShingle Dena Bazazian
Fully Convolutional Networks for Text Understanding in Scene Images
ELCVIA Electronic Letters on Computer Vision and Image Analysis
Text Understanding
Text Detection
Word Spotting
Fully Convolutional Network (FCN)
Scene Images.
author_facet Dena Bazazian
author_sort Dena Bazazian
title Fully Convolutional Networks for Text Understanding in Scene Images
title_short Fully Convolutional Networks for Text Understanding in Scene Images
title_full Fully Convolutional Networks for Text Understanding in Scene Images
title_fullStr Fully Convolutional Networks for Text Understanding in Scene Images
title_full_unstemmed Fully Convolutional Networks for Text Understanding in Scene Images
title_sort fully convolutional networks for text understanding in scene images
publisher Computer Vision Center Press
series ELCVIA Electronic Letters on Computer Vision and Image Analysis
issn 1577-5097
publishDate 2020-02-01
description Text understanding in scene images has gained plenty of attention in the computer vision community and it is an important task in many applications as text carries semantically  rich  information  about  scene  content  and  context.   For  instance, reading text in a scene can be applied to autonomous driving, scene understanding or assisting visually impaired people. The general aim of scene text understanding is to localize and recognize text in scene images. Text regions are first localized in the original image by a trained detector model and afterwards fed into a recognition module. The tasks of localization and recognition are highly correlated since an inaccurate localization can affect the recognition task. The main purpose of this thesis is to devise efficient methods for scene text understanding. We investigate how the latest results on deep learning can advance text understanding pipelines. Recently, Fully Convolutional Networks (FCNs) and derived methods have achieved a significant performance on semantic segmentation and pixel level classification tasks. Therefore, we took benefit of the strengths of FCN approaches in order to detect and recognize text in natural scenes images.
topic Text Understanding
Text Detection
Word Spotting
Fully Convolutional Network (FCN)
Scene Images.
url https://elcvia.cvc.uab.es/article/view/1187
work_keys_str_mv AT denabazazian fullyconvolutionalnetworksfortextunderstandinginsceneimages
_version_ 1717376961978826752