Classification of Occluded Images for Large-Scale Datasets With Numerous Occlusion Patterns
Large-scale image datasets with numerous occlusion patterns prevail in real applications. The classification scheme based on subspace decomposition-based estimation with squared l<sub>2</sub> -norm regularization (SDBE_L2) has shown promising performance for the classification of partial...
Main Authors: | , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
IEEE
2020-01-01
|
Series: | IEEE Access |
Subjects: | |
Online Access: | https://ieeexplore.ieee.org/document/9199833/ |
id |
doaj-4f68ba7a13fe4aafb554308e0bb2f3ab |
---|---|
record_format |
Article |
spelling |
doaj-4f68ba7a13fe4aafb554308e0bb2f3ab2021-03-30T03:57:55ZengIEEEIEEE Access2169-35362020-01-01817088317089710.1109/ACCESS.2020.30250359199833Classification of Occluded Images for Large-Scale Datasets With Numerous Occlusion PatternsFeng Cen0https://orcid.org/0000-0002-0825-385XXiaoyu Zhao1Wuzhuang Li2Fanglai Zhu3Department of Control Science and Engineering, Tongji University, Shanghai, ChinaDepartment of Control Science and Engineering, Tongji University, Shanghai, ChinaDepartment of Control Science and Engineering, Tongji University, Shanghai, ChinaDepartment of Control Science and Engineering, Tongji University, Shanghai, ChinaLarge-scale image datasets with numerous occlusion patterns prevail in real applications. The classification scheme based on subspace decomposition-based estimation with squared l<sub>2</sub> -norm regularization (SDBE_L2) has shown promising performance for the classification of partially occluded images. For the large-scale image datasets with numerous occlusion patterns, it however suffers from a high labor intensity in acquiring extra image pairs and a large consumption of computational resources in the training stage. To reduce the labor intensity, this paper enumerates several useful types of extra image pairs to guide the collection of extra images and introduces an intra-class random pairing method to semi-automatically form the extra image pairs. To alleviate the consumption of computational resources, this paper proposes two dictionary compression approaches: 1) uncentered PCA-based single partition compression (UPSPC), which compresses the dictionary to a size not larger than twice the column vector length without affecting the classification accuracy, and 2) uncentered PCA-based intra-class partition compression (UPIPC), which can further shrink the occlusion error dictionary (or class dictionary) when it has a small number of occlusion classes (or image classes). The proposed approaches are based on the property of SDBE_L2 being invariant to the uncentered PCA of sub-dictionaries. The extensive experiments on the Caltech-101 dataset and Oxford-102 flower dataset demonstrate the enumerated examples and the intra-class random pairing method facilitate acquiring the extra images and forming the extra image pairs only with a small loss in the classification accuracy. The experimental results on a large-scale occluded image dataset synthesized from the ILSVRC 2012 classification dataset with numerous occlusion patterns show that the proposed dictionary compression approaches reduce the dictionary size by over 11 times and shorten the training time by more than 39 times without loss in the classification accuracy.https://ieeexplore.ieee.org/document/9199833/Convolutional neural networksSDBEocclusionimage classificationprincipal component analysis |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Feng Cen Xiaoyu Zhao Wuzhuang Li Fanglai Zhu |
spellingShingle |
Feng Cen Xiaoyu Zhao Wuzhuang Li Fanglai Zhu Classification of Occluded Images for Large-Scale Datasets With Numerous Occlusion Patterns IEEE Access Convolutional neural networks SDBE occlusion image classification principal component analysis |
author_facet |
Feng Cen Xiaoyu Zhao Wuzhuang Li Fanglai Zhu |
author_sort |
Feng Cen |
title |
Classification of Occluded Images for Large-Scale Datasets With Numerous Occlusion Patterns |
title_short |
Classification of Occluded Images for Large-Scale Datasets With Numerous Occlusion Patterns |
title_full |
Classification of Occluded Images for Large-Scale Datasets With Numerous Occlusion Patterns |
title_fullStr |
Classification of Occluded Images for Large-Scale Datasets With Numerous Occlusion Patterns |
title_full_unstemmed |
Classification of Occluded Images for Large-Scale Datasets With Numerous Occlusion Patterns |
title_sort |
classification of occluded images for large-scale datasets with numerous occlusion patterns |
publisher |
IEEE |
series |
IEEE Access |
issn |
2169-3536 |
publishDate |
2020-01-01 |
description |
Large-scale image datasets with numerous occlusion patterns prevail in real applications. The classification scheme based on subspace decomposition-based estimation with squared l<sub>2</sub> -norm regularization (SDBE_L2) has shown promising performance for the classification of partially occluded images. For the large-scale image datasets with numerous occlusion patterns, it however suffers from a high labor intensity in acquiring extra image pairs and a large consumption of computational resources in the training stage. To reduce the labor intensity, this paper enumerates several useful types of extra image pairs to guide the collection of extra images and introduces an intra-class random pairing method to semi-automatically form the extra image pairs. To alleviate the consumption of computational resources, this paper proposes two dictionary compression approaches: 1) uncentered PCA-based single partition compression (UPSPC), which compresses the dictionary to a size not larger than twice the column vector length without affecting the classification accuracy, and 2) uncentered PCA-based intra-class partition compression (UPIPC), which can further shrink the occlusion error dictionary (or class dictionary) when it has a small number of occlusion classes (or image classes). The proposed approaches are based on the property of SDBE_L2 being invariant to the uncentered PCA of sub-dictionaries. The extensive experiments on the Caltech-101 dataset and Oxford-102 flower dataset demonstrate the enumerated examples and the intra-class random pairing method facilitate acquiring the extra images and forming the extra image pairs only with a small loss in the classification accuracy. The experimental results on a large-scale occluded image dataset synthesized from the ILSVRC 2012 classification dataset with numerous occlusion patterns show that the proposed dictionary compression approaches reduce the dictionary size by over 11 times and shorten the training time by more than 39 times without loss in the classification accuracy. |
topic |
Convolutional neural networks SDBE occlusion image classification principal component analysis |
url |
https://ieeexplore.ieee.org/document/9199833/ |
work_keys_str_mv |
AT fengcen classificationofoccludedimagesforlargescaledatasetswithnumerousocclusionpatterns AT xiaoyuzhao classificationofoccludedimagesforlargescaledatasetswithnumerousocclusionpatterns AT wuzhuangli classificationofoccludedimagesforlargescaledatasetswithnumerousocclusionpatterns AT fanglaizhu classificationofoccludedimagesforlargescaledatasetswithnumerousocclusionpatterns |
_version_ |
1724182507344101376 |