Large-Scale Fine-Grained Bird Recognition Based on a Triplet Network and Bilinear Model
The main purpose of fine-grained classification is to distinguish among many subcategories of a single basic category, such as birds or flowers. We propose a model based on a triple network and bilinear methods for fine-grained bird identification. Our proposed model can be trained in an end-to-end...
Main Authors: | , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
MDPI AG
2018-10-01
|
Series: | Applied Sciences |
Subjects: | |
Online Access: | http://www.mdpi.com/2076-3417/8/10/1906 |
id |
doaj-8a38f8d8e9e941c4adc08b140ae7944c |
---|---|
record_format |
Article |
spelling |
doaj-8a38f8d8e9e941c4adc08b140ae7944c2020-11-24T23:08:34ZengMDPI AGApplied Sciences2076-34172018-10-01810190610.3390/app8101906app8101906Large-Scale Fine-Grained Bird Recognition Based on a Triplet Network and Bilinear ModelZhicheng Zhao0Ze Luo1Jian Li2Kaihua Wang3Bingying Shi4University of Chinese Academy of Sciences, Beijing 100049, ChinaComputer Network Information Center, Chinese Academy of Sciences, Beijing 100190, ChinaComputer Network Information Center, Chinese Academy of Sciences, Beijing 100190, ChinaUniversity of Chinese Academy of Sciences, Beijing 100049, ChinaUniversity of Chinese Academy of Sciences, Beijing 100049, ChinaThe main purpose of fine-grained classification is to distinguish among many subcategories of a single basic category, such as birds or flowers. We propose a model based on a triple network and bilinear methods for fine-grained bird identification. Our proposed model can be trained in an end-to-end manner, which effectively increases the inter-class distance of the network extraction features and improves the accuracy of bird recognition. When experimentally tested on 1096 birds in a custom-built dataset and on Caltech-UCSD (a public bird dataset), the model achieved an accuracy of 88.91% and 85.58%, respectively. The experimental results confirm the high generalization ability of our model in fine-grained image classification. Moreover, our model requires no additional manual annotation information such as object-labeling frames and part-labeling points, which guarantees good versatility and robustness in fine-grained bird recognition.http://www.mdpi.com/2076-3417/8/10/1906bird recognitionfine-grainedtriplet networkbilinear modelXception |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Zhicheng Zhao Ze Luo Jian Li Kaihua Wang Bingying Shi |
spellingShingle |
Zhicheng Zhao Ze Luo Jian Li Kaihua Wang Bingying Shi Large-Scale Fine-Grained Bird Recognition Based on a Triplet Network and Bilinear Model Applied Sciences bird recognition fine-grained triplet network bilinear model Xception |
author_facet |
Zhicheng Zhao Ze Luo Jian Li Kaihua Wang Bingying Shi |
author_sort |
Zhicheng Zhao |
title |
Large-Scale Fine-Grained Bird Recognition Based on a Triplet Network and Bilinear Model |
title_short |
Large-Scale Fine-Grained Bird Recognition Based on a Triplet Network and Bilinear Model |
title_full |
Large-Scale Fine-Grained Bird Recognition Based on a Triplet Network and Bilinear Model |
title_fullStr |
Large-Scale Fine-Grained Bird Recognition Based on a Triplet Network and Bilinear Model |
title_full_unstemmed |
Large-Scale Fine-Grained Bird Recognition Based on a Triplet Network and Bilinear Model |
title_sort |
large-scale fine-grained bird recognition based on a triplet network and bilinear model |
publisher |
MDPI AG |
series |
Applied Sciences |
issn |
2076-3417 |
publishDate |
2018-10-01 |
description |
The main purpose of fine-grained classification is to distinguish among many subcategories of a single basic category, such as birds or flowers. We propose a model based on a triple network and bilinear methods for fine-grained bird identification. Our proposed model can be trained in an end-to-end manner, which effectively increases the inter-class distance of the network extraction features and improves the accuracy of bird recognition. When experimentally tested on 1096 birds in a custom-built dataset and on Caltech-UCSD (a public bird dataset), the model achieved an accuracy of 88.91% and 85.58%, respectively. The experimental results confirm the high generalization ability of our model in fine-grained image classification. Moreover, our model requires no additional manual annotation information such as object-labeling frames and part-labeling points, which guarantees good versatility and robustness in fine-grained bird recognition. |
topic |
bird recognition fine-grained triplet network bilinear model Xception |
url |
http://www.mdpi.com/2076-3417/8/10/1906 |
work_keys_str_mv |
AT zhichengzhao largescalefinegrainedbirdrecognitionbasedonatripletnetworkandbilinearmodel AT zeluo largescalefinegrainedbirdrecognitionbasedonatripletnetworkandbilinearmodel AT jianli largescalefinegrainedbirdrecognitionbasedonatripletnetworkandbilinearmodel AT kaihuawang largescalefinegrainedbirdrecognitionbasedonatripletnetworkandbilinearmodel AT bingyingshi largescalefinegrainedbirdrecognitionbasedonatripletnetworkandbilinearmodel |
_version_ |
1725613594683375616 |