Pose-Guided Spatial Alignment and Key Frame Selection for One-Shot Video-Based Person Re-Identification
One-shot video-based person re-identification exploits the unlabeled data by using a single-labeled sample for each individual to train a model and to reduce the need for laborious labeling. Although recent works focusing on this task have made some achievements, most state-of-the-art models are vul...
Main Authors: | , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
IEEE
2019-01-01
|
Series: | IEEE Access |
Subjects: | |
Online Access: | https://ieeexplore.ieee.org/document/8736229/ |
id |
doaj-0f14440a2ea944daa47396992f34250d |
---|---|
record_format |
Article |
spelling |
doaj-0f14440a2ea944daa47396992f34250d2021-03-29T23:31:06ZengIEEEIEEE Access2169-35362019-01-017789917900410.1109/ACCESS.2019.29226798736229Pose-Guided Spatial Alignment and Key Frame Selection for One-Shot Video-Based Person Re-IdentificationYuzhong Chen0https://orcid.org/0000-0001-7408-2684Tengda Huang1Yuzhen Niu2https://orcid.org/0000-0002-9874-9719Xiao Ke3https://orcid.org/0000-0001-9059-5391Yangyang Lin4Fujian Key Laboratory of Network Computing and Intelligent Information Processing, College of Mathematics and Computer Science, Fuzhou University, Fuzhou, ChinaFujian Key Laboratory of Network Computing and Intelligent Information Processing, College of Mathematics and Computer Science, Fuzhou University, Fuzhou, ChinaFujian Key Laboratory of Network Computing and Intelligent Information Processing, College of Mathematics and Computer Science, Fuzhou University, Fuzhou, ChinaFujian Key Laboratory of Network Computing and Intelligent Information Processing, College of Mathematics and Computer Science, Fuzhou University, Fuzhou, ChinaFujian Key Laboratory of Network Computing and Intelligent Information Processing, College of Mathematics and Computer Science, Fuzhou University, Fuzhou, ChinaOne-shot video-based person re-identification exploits the unlabeled data by using a single-labeled sample for each individual to train a model and to reduce the need for laborious labeling. Although recent works focusing on this task have made some achievements, most state-of-the-art models are vulnerable to misalignment, pose variation and corrupted frames. To address these challenges, we propose a one-shot video-based person re-identification model based on pose-guided spatial alignment and KFS. First, a spatial transformer sub-network trained using pose-guided regression is employed to perform the spatial alignment. Second, we propose a novel training strategy based on KFS. Key frames with abruptly changing poses are deliberately identified and selected to make the network adaptive to pose variation. Finally, we propose a frame feature pooling method by incorporating long short-term memory with an attention mechanism to reduce the influence of corrupted frames. Comprehensive experiments are presented based on the MARS and DukeMTMC-VideoReID datasets. The mAP values for these datasets reach 46.5% and 68.4%, respectively, demonstrating that the proposed model achieves significant improvements over state-of-the-art one-shot person re-identification methods.https://ieeexplore.ieee.org/document/8736229/Person re-identificationone-shot learningspatial alignmentkey frame selectionframe feature pooling |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Yuzhong Chen Tengda Huang Yuzhen Niu Xiao Ke Yangyang Lin |
spellingShingle |
Yuzhong Chen Tengda Huang Yuzhen Niu Xiao Ke Yangyang Lin Pose-Guided Spatial Alignment and Key Frame Selection for One-Shot Video-Based Person Re-Identification IEEE Access Person re-identification one-shot learning spatial alignment key frame selection frame feature pooling |
author_facet |
Yuzhong Chen Tengda Huang Yuzhen Niu Xiao Ke Yangyang Lin |
author_sort |
Yuzhong Chen |
title |
Pose-Guided Spatial Alignment and Key Frame Selection for One-Shot Video-Based Person Re-Identification |
title_short |
Pose-Guided Spatial Alignment and Key Frame Selection for One-Shot Video-Based Person Re-Identification |
title_full |
Pose-Guided Spatial Alignment and Key Frame Selection for One-Shot Video-Based Person Re-Identification |
title_fullStr |
Pose-Guided Spatial Alignment and Key Frame Selection for One-Shot Video-Based Person Re-Identification |
title_full_unstemmed |
Pose-Guided Spatial Alignment and Key Frame Selection for One-Shot Video-Based Person Re-Identification |
title_sort |
pose-guided spatial alignment and key frame selection for one-shot video-based person re-identification |
publisher |
IEEE |
series |
IEEE Access |
issn |
2169-3536 |
publishDate |
2019-01-01 |
description |
One-shot video-based person re-identification exploits the unlabeled data by using a single-labeled sample for each individual to train a model and to reduce the need for laborious labeling. Although recent works focusing on this task have made some achievements, most state-of-the-art models are vulnerable to misalignment, pose variation and corrupted frames. To address these challenges, we propose a one-shot video-based person re-identification model based on pose-guided spatial alignment and KFS. First, a spatial transformer sub-network trained using pose-guided regression is employed to perform the spatial alignment. Second, we propose a novel training strategy based on KFS. Key frames with abruptly changing poses are deliberately identified and selected to make the network adaptive to pose variation. Finally, we propose a frame feature pooling method by incorporating long short-term memory with an attention mechanism to reduce the influence of corrupted frames. Comprehensive experiments are presented based on the MARS and DukeMTMC-VideoReID datasets. The mAP values for these datasets reach 46.5% and 68.4%, respectively, demonstrating that the proposed model achieves significant improvements over state-of-the-art one-shot person re-identification methods. |
topic |
Person re-identification one-shot learning spatial alignment key frame selection frame feature pooling |
url |
https://ieeexplore.ieee.org/document/8736229/ |
work_keys_str_mv |
AT yuzhongchen poseguidedspatialalignmentandkeyframeselectionforoneshotvideobasedpersonreidentification AT tengdahuang poseguidedspatialalignmentandkeyframeselectionforoneshotvideobasedpersonreidentification AT yuzhenniu poseguidedspatialalignmentandkeyframeselectionforoneshotvideobasedpersonreidentification AT xiaoke poseguidedspatialalignmentandkeyframeselectionforoneshotvideobasedpersonreidentification AT yangyanglin poseguidedspatialalignmentandkeyframeselectionforoneshotvideobasedpersonreidentification |
_version_ |
1724189269455536128 |