Large-Scale Fine-Grained Bird Recognition Based on a Triplet Network and Bilinear Model

The main purpose of fine-grained classification is to distinguish among many subcategories of a single basic category, such as birds or flowers. We propose a model based on a triple network and bilinear methods for fine-grained bird identification. Our proposed model can be trained in an end-to-end...

Full description

Bibliographic Details
Main Authors: Zhicheng Zhao, Ze Luo, Jian Li, Kaihua Wang, Bingying Shi
Format: Article
Language:English
Published: MDPI AG 2018-10-01
Series:Applied Sciences
Subjects:
Online Access:http://www.mdpi.com/2076-3417/8/10/1906
id doaj-8a38f8d8e9e941c4adc08b140ae7944c
record_format Article
spelling doaj-8a38f8d8e9e941c4adc08b140ae7944c2020-11-24T23:08:34ZengMDPI AGApplied Sciences2076-34172018-10-01810190610.3390/app8101906app8101906Large-Scale Fine-Grained Bird Recognition Based on a Triplet Network and Bilinear ModelZhicheng Zhao0Ze Luo1Jian Li2Kaihua Wang3Bingying Shi4University of Chinese Academy of Sciences, Beijing 100049, ChinaComputer Network Information Center, Chinese Academy of Sciences, Beijing 100190, ChinaComputer Network Information Center, Chinese Academy of Sciences, Beijing 100190, ChinaUniversity of Chinese Academy of Sciences, Beijing 100049, ChinaUniversity of Chinese Academy of Sciences, Beijing 100049, ChinaThe main purpose of fine-grained classification is to distinguish among many subcategories of a single basic category, such as birds or flowers. We propose a model based on a triple network and bilinear methods for fine-grained bird identification. Our proposed model can be trained in an end-to-end manner, which effectively increases the inter-class distance of the network extraction features and improves the accuracy of bird recognition. When experimentally tested on 1096 birds in a custom-built dataset and on Caltech-UCSD (a public bird dataset), the model achieved an accuracy of 88.91% and 85.58%, respectively. The experimental results confirm the high generalization ability of our model in fine-grained image classification. Moreover, our model requires no additional manual annotation information such as object-labeling frames and part-labeling points, which guarantees good versatility and robustness in fine-grained bird recognition.http://www.mdpi.com/2076-3417/8/10/1906bird recognitionfine-grainedtriplet networkbilinear modelXception
collection DOAJ
language English
format Article
sources DOAJ
author Zhicheng Zhao
Ze Luo
Jian Li
Kaihua Wang
Bingying Shi
spellingShingle Zhicheng Zhao
Ze Luo
Jian Li
Kaihua Wang
Bingying Shi
Large-Scale Fine-Grained Bird Recognition Based on a Triplet Network and Bilinear Model
Applied Sciences
bird recognition
fine-grained
triplet network
bilinear model
Xception
author_facet Zhicheng Zhao
Ze Luo
Jian Li
Kaihua Wang
Bingying Shi
author_sort Zhicheng Zhao
title Large-Scale Fine-Grained Bird Recognition Based on a Triplet Network and Bilinear Model
title_short Large-Scale Fine-Grained Bird Recognition Based on a Triplet Network and Bilinear Model
title_full Large-Scale Fine-Grained Bird Recognition Based on a Triplet Network and Bilinear Model
title_fullStr Large-Scale Fine-Grained Bird Recognition Based on a Triplet Network and Bilinear Model
title_full_unstemmed Large-Scale Fine-Grained Bird Recognition Based on a Triplet Network and Bilinear Model
title_sort large-scale fine-grained bird recognition based on a triplet network and bilinear model
publisher MDPI AG
series Applied Sciences
issn 2076-3417
publishDate 2018-10-01
description The main purpose of fine-grained classification is to distinguish among many subcategories of a single basic category, such as birds or flowers. We propose a model based on a triple network and bilinear methods for fine-grained bird identification. Our proposed model can be trained in an end-to-end manner, which effectively increases the inter-class distance of the network extraction features and improves the accuracy of bird recognition. When experimentally tested on 1096 birds in a custom-built dataset and on Caltech-UCSD (a public bird dataset), the model achieved an accuracy of 88.91% and 85.58%, respectively. The experimental results confirm the high generalization ability of our model in fine-grained image classification. Moreover, our model requires no additional manual annotation information such as object-labeling frames and part-labeling points, which guarantees good versatility and robustness in fine-grained bird recognition.
topic bird recognition
fine-grained
triplet network
bilinear model
Xception
url http://www.mdpi.com/2076-3417/8/10/1906
work_keys_str_mv AT zhichengzhao largescalefinegrainedbirdrecognitionbasedonatripletnetworkandbilinearmodel
AT zeluo largescalefinegrainedbirdrecognitionbasedonatripletnetworkandbilinearmodel
AT jianli largescalefinegrainedbirdrecognitionbasedonatripletnetworkandbilinearmodel
AT kaihuawang largescalefinegrainedbirdrecognitionbasedonatripletnetworkandbilinearmodel
AT bingyingshi largescalefinegrainedbirdrecognitionbasedonatripletnetworkandbilinearmodel
_version_ 1725613594683375616