Pose-Guided Spatial Alignment and Key Frame Selection for One-Shot Video-Based Person Re-Identification

One-shot video-based person re-identification exploits the unlabeled data by using a single-labeled sample for each individual to train a model and to reduce the need for laborious labeling. Although recent works focusing on this task have made some achievements, most state-of-the-art models are vul...

Full description

Bibliographic Details
Main Authors: Yuzhong Chen, Tengda Huang, Yuzhen Niu, Xiao Ke, Yangyang Lin
Format: Article
Language:English
Published: IEEE 2019-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/8736229/
id doaj-0f14440a2ea944daa47396992f34250d
record_format Article
spelling doaj-0f14440a2ea944daa47396992f34250d2021-03-29T23:31:06ZengIEEEIEEE Access2169-35362019-01-017789917900410.1109/ACCESS.2019.29226798736229Pose-Guided Spatial Alignment and Key Frame Selection for One-Shot Video-Based Person Re-IdentificationYuzhong Chen0https://orcid.org/0000-0001-7408-2684Tengda Huang1Yuzhen Niu2https://orcid.org/0000-0002-9874-9719Xiao Ke3https://orcid.org/0000-0001-9059-5391Yangyang Lin4Fujian Key Laboratory of Network Computing and Intelligent Information Processing, College of Mathematics and Computer Science, Fuzhou University, Fuzhou, ChinaFujian Key Laboratory of Network Computing and Intelligent Information Processing, College of Mathematics and Computer Science, Fuzhou University, Fuzhou, ChinaFujian Key Laboratory of Network Computing and Intelligent Information Processing, College of Mathematics and Computer Science, Fuzhou University, Fuzhou, ChinaFujian Key Laboratory of Network Computing and Intelligent Information Processing, College of Mathematics and Computer Science, Fuzhou University, Fuzhou, ChinaFujian Key Laboratory of Network Computing and Intelligent Information Processing, College of Mathematics and Computer Science, Fuzhou University, Fuzhou, ChinaOne-shot video-based person re-identification exploits the unlabeled data by using a single-labeled sample for each individual to train a model and to reduce the need for laborious labeling. Although recent works focusing on this task have made some achievements, most state-of-the-art models are vulnerable to misalignment, pose variation and corrupted frames. To address these challenges, we propose a one-shot video-based person re-identification model based on pose-guided spatial alignment and KFS. First, a spatial transformer sub-network trained using pose-guided regression is employed to perform the spatial alignment. Second, we propose a novel training strategy based on KFS. Key frames with abruptly changing poses are deliberately identified and selected to make the network adaptive to pose variation. Finally, we propose a frame feature pooling method by incorporating long short-term memory with an attention mechanism to reduce the influence of corrupted frames. Comprehensive experiments are presented based on the MARS and DukeMTMC-VideoReID datasets. The mAP values for these datasets reach 46.5% and 68.4%, respectively, demonstrating that the proposed model achieves significant improvements over state-of-the-art one-shot person re-identification methods.https://ieeexplore.ieee.org/document/8736229/Person re-identificationone-shot learningspatial alignmentkey frame selectionframe feature pooling
collection DOAJ
language English
format Article
sources DOAJ
author Yuzhong Chen
Tengda Huang
Yuzhen Niu
Xiao Ke
Yangyang Lin
spellingShingle Yuzhong Chen
Tengda Huang
Yuzhen Niu
Xiao Ke
Yangyang Lin
Pose-Guided Spatial Alignment and Key Frame Selection for One-Shot Video-Based Person Re-Identification
IEEE Access
Person re-identification
one-shot learning
spatial alignment
key frame selection
frame feature pooling
author_facet Yuzhong Chen
Tengda Huang
Yuzhen Niu
Xiao Ke
Yangyang Lin
author_sort Yuzhong Chen
title Pose-Guided Spatial Alignment and Key Frame Selection for One-Shot Video-Based Person Re-Identification
title_short Pose-Guided Spatial Alignment and Key Frame Selection for One-Shot Video-Based Person Re-Identification
title_full Pose-Guided Spatial Alignment and Key Frame Selection for One-Shot Video-Based Person Re-Identification
title_fullStr Pose-Guided Spatial Alignment and Key Frame Selection for One-Shot Video-Based Person Re-Identification
title_full_unstemmed Pose-Guided Spatial Alignment and Key Frame Selection for One-Shot Video-Based Person Re-Identification
title_sort pose-guided spatial alignment and key frame selection for one-shot video-based person re-identification
publisher IEEE
series IEEE Access
issn 2169-3536
publishDate 2019-01-01
description One-shot video-based person re-identification exploits the unlabeled data by using a single-labeled sample for each individual to train a model and to reduce the need for laborious labeling. Although recent works focusing on this task have made some achievements, most state-of-the-art models are vulnerable to misalignment, pose variation and corrupted frames. To address these challenges, we propose a one-shot video-based person re-identification model based on pose-guided spatial alignment and KFS. First, a spatial transformer sub-network trained using pose-guided regression is employed to perform the spatial alignment. Second, we propose a novel training strategy based on KFS. Key frames with abruptly changing poses are deliberately identified and selected to make the network adaptive to pose variation. Finally, we propose a frame feature pooling method by incorporating long short-term memory with an attention mechanism to reduce the influence of corrupted frames. Comprehensive experiments are presented based on the MARS and DukeMTMC-VideoReID datasets. The mAP values for these datasets reach 46.5% and 68.4%, respectively, demonstrating that the proposed model achieves significant improvements over state-of-the-art one-shot person re-identification methods.
topic Person re-identification
one-shot learning
spatial alignment
key frame selection
frame feature pooling
url https://ieeexplore.ieee.org/document/8736229/
work_keys_str_mv AT yuzhongchen poseguidedspatialalignmentandkeyframeselectionforoneshotvideobasedpersonreidentification
AT tengdahuang poseguidedspatialalignmentandkeyframeselectionforoneshotvideobasedpersonreidentification
AT yuzhenniu poseguidedspatialalignmentandkeyframeselectionforoneshotvideobasedpersonreidentification
AT xiaoke poseguidedspatialalignmentandkeyframeselectionforoneshotvideobasedpersonreidentification
AT yangyanglin poseguidedspatialalignmentandkeyframeselectionforoneshotvideobasedpersonreidentification
_version_ 1724189269455536128