Image-Matching Based Identification of Store Signage Using Web-Crawled Information

We address automatic matching of street images with relevant web resources to enable the identification of store signage in street images. Identification methods for signage usually involve image matching, which attempts to match query images to other similar viewings using pre-labeled copies from a...

Full description

Bibliographic Details
Main Authors: Chenyi Liao, Weimin Wang, Ken Sakurada, Nobuo Kawaguchi
Format: Article
Language:English
Published: IEEE 2018-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/8439069/
id doaj-bc0268f23b7540e0b8a0b9a3dbdd3f95
record_format Article
spelling doaj-bc0268f23b7540e0b8a0b9a3dbdd3f952021-03-29T21:19:39ZengIEEEIEEE Access2169-35362018-01-016455904560510.1109/ACCESS.2018.28654908439069Image-Matching Based Identification of Store Signage Using Web-Crawled InformationChenyi Liao0https://orcid.org/0000-0002-0737-3577Weimin Wang1Ken Sakurada2Nobuo Kawaguchi3Graduate School of Engineering, Nagoya University, Nagoya, JapanNational Institute of Advanced Industrial Science and Technology, Tokyo, JapanNational Institute of Advanced Industrial Science and Technology, Tokyo, JapanGraduate School of Engineering, Nagoya University, Nagoya, JapanWe address automatic matching of street images with relevant web resources to enable the identification of store signage in street images. Identification methods for signage usually involve image matching, which attempts to match query images to other similar viewings using pre-labeled copies from a target data set. Manual target data set, such as a fingerprinting database can ensure high-quality data but collected data must be fed manually, which significantly adds costs. Utilizing web-crawled information is a way for automatic data set generation at lower cost, however, imbalanced and noisy data can adversely affect identification accuracy. Our work aims to resolve these issues. We propose a signage identifier in Web-crawled information - SIWI. The SIWI includes a web image data set construction method, which can self-generate high-quality data sets through automated web-mining, including data filtering and pruning strategies, which effectively reduce the identification error caused by noise, imbalance, and insufficient data. Furthermore, by applying a Hybrid Image Matching method that combines the deep learning approach with the feature point matching to signage identification without Optical Character Recognition, it can handle arbitrary signage designs. Because there is no specialized training involved, the same process should also work for any other locations without manual adjustment. An experimental result achieves 91% accuracy in a real-life application, which confirms its effectiveness.https://ieeexplore.ieee.org/document/8439069/Web miningdata set generationimage matchingstore signage identification
collection DOAJ
language English
format Article
sources DOAJ
author Chenyi Liao
Weimin Wang
Ken Sakurada
Nobuo Kawaguchi
spellingShingle Chenyi Liao
Weimin Wang
Ken Sakurada
Nobuo Kawaguchi
Image-Matching Based Identification of Store Signage Using Web-Crawled Information
IEEE Access
Web mining
data set generation
image matching
store signage identification
author_facet Chenyi Liao
Weimin Wang
Ken Sakurada
Nobuo Kawaguchi
author_sort Chenyi Liao
title Image-Matching Based Identification of Store Signage Using Web-Crawled Information
title_short Image-Matching Based Identification of Store Signage Using Web-Crawled Information
title_full Image-Matching Based Identification of Store Signage Using Web-Crawled Information
title_fullStr Image-Matching Based Identification of Store Signage Using Web-Crawled Information
title_full_unstemmed Image-Matching Based Identification of Store Signage Using Web-Crawled Information
title_sort image-matching based identification of store signage using web-crawled information
publisher IEEE
series IEEE Access
issn 2169-3536
publishDate 2018-01-01
description We address automatic matching of street images with relevant web resources to enable the identification of store signage in street images. Identification methods for signage usually involve image matching, which attempts to match query images to other similar viewings using pre-labeled copies from a target data set. Manual target data set, such as a fingerprinting database can ensure high-quality data but collected data must be fed manually, which significantly adds costs. Utilizing web-crawled information is a way for automatic data set generation at lower cost, however, imbalanced and noisy data can adversely affect identification accuracy. Our work aims to resolve these issues. We propose a signage identifier in Web-crawled information - SIWI. The SIWI includes a web image data set construction method, which can self-generate high-quality data sets through automated web-mining, including data filtering and pruning strategies, which effectively reduce the identification error caused by noise, imbalance, and insufficient data. Furthermore, by applying a Hybrid Image Matching method that combines the deep learning approach with the feature point matching to signage identification without Optical Character Recognition, it can handle arbitrary signage designs. Because there is no specialized training involved, the same process should also work for any other locations without manual adjustment. An experimental result achieves 91% accuracy in a real-life application, which confirms its effectiveness.
topic Web mining
data set generation
image matching
store signage identification
url https://ieeexplore.ieee.org/document/8439069/
work_keys_str_mv AT chenyiliao imagematchingbasedidentificationofstoresignageusingwebcrawledinformation
AT weiminwang imagematchingbasedidentificationofstoresignageusingwebcrawledinformation
AT kensakurada imagematchingbasedidentificationofstoresignageusingwebcrawledinformation
AT nobuokawaguchi imagematchingbasedidentificationofstoresignageusingwebcrawledinformation
_version_ 1724193124347019264