Pose-Guided Spatial Alignment and Key Frame Selection for One-Shot Video-Based Person Re-Identification

One-shot video-based person re-identification exploits the unlabeled data by using a single-labeled sample for each individual to train a model and to reduce the need for laborious labeling. Although recent works focusing on this task have made some achievements, most state-of-the-art models are vul...

Full description

Bibliographic Details
Main Authors:	Yuzhong Chen, Tengda Huang, Yuzhen Niu, Xiao Ke, Yangyang Lin
Format:	Article
Language:	English
Published:	IEEE 2019-01-01
Series:	IEEE Access
Subjects:	Person re-identification one-shot learning spatial alignment key frame selection frame feature pooling
Online Access:	https://ieeexplore.ieee.org/document/8736229/

id	doaj-0f14440a2ea944daa47396992f34250d
record_format	Article
spelling	doaj-0f14440a2ea944daa47396992f34250d2021-03-29T23:31:06ZengIEEEIEEE Access2169-35362019-01-017789917900410.1109/ACCESS.2019.29226798736229Pose-Guided Spatial Alignment and Key Frame Selection for One-Shot Video-Based Person Re-IdentificationYuzhong Chen0https://orcid.org/0000-0001-7408-2684Tengda Huang1Yuzhen Niu2https://orcid.org/0000-0002-9874-9719Xiao Ke3https://orcid.org/0000-0001-9059-5391Yangyang Lin4Fujian Key Laboratory of Network Computing and Intelligent Information Processing, College of Mathematics and Computer Science, Fuzhou University, Fuzhou, ChinaFujian Key Laboratory of Network Computing and Intelligent Information Processing, College of Mathematics and Computer Science, Fuzhou University, Fuzhou, ChinaFujian Key Laboratory of Network Computing and Intelligent Information Processing, College of Mathematics and Computer Science, Fuzhou University, Fuzhou, ChinaFujian Key Laboratory of Network Computing and Intelligent Information Processing, College of Mathematics and Computer Science, Fuzhou University, Fuzhou, ChinaFujian Key Laboratory of Network Computing and Intelligent Information Processing, College of Mathematics and Computer Science, Fuzhou University, Fuzhou, ChinaOne-shot video-based person re-identification exploits the unlabeled data by using a single-labeled sample for each individual to train a model and to reduce the need for laborious labeling. Although recent works focusing on this task have made some achievements, most state-of-the-art models are vulnerable to misalignment, pose variation and corrupted frames. To address these challenges, we propose a one-shot video-based person re-identification model based on pose-guided spatial alignment and KFS. First, a spatial transformer sub-network trained using pose-guided regression is employed to perform the spatial alignment. Second, we propose a novel training strategy based on KFS. Key frames with abruptly changing poses are deliberately identified and selected to make the network adaptive to pose variation. Finally, we propose a frame feature pooling method by incorporating long short-term memory with an attention mechanism to reduce the influence of corrupted frames. Comprehensive experiments are presented based on the MARS and DukeMTMC-VideoReID datasets. The mAP values for these datasets reach 46.5% and 68.4%, respectively, demonstrating that the proposed model achieves significant improvements over state-of-the-art one-shot person re-identification methods.https://ieeexplore.ieee.org/document/8736229/Person re-identificationone-shot learningspatial alignmentkey frame selectionframe feature pooling
collection	DOAJ
language	English
format	Article
sources	DOAJ
author	Yuzhong Chen Tengda Huang Yuzhen Niu Xiao Ke Yangyang Lin
spellingShingle	Yuzhong Chen Tengda Huang Yuzhen Niu Xiao Ke Yangyang Lin Pose-Guided Spatial Alignment and Key Frame Selection for One-Shot Video-Based Person Re-Identification IEEE Access Person re-identification one-shot learning spatial alignment key frame selection frame feature pooling
author_facet	Yuzhong Chen Tengda Huang Yuzhen Niu Xiao Ke Yangyang Lin
author_sort	Yuzhong Chen
title	Pose-Guided Spatial Alignment and Key Frame Selection for One-Shot Video-Based Person Re-Identification
title_short	Pose-Guided Spatial Alignment and Key Frame Selection for One-Shot Video-Based Person Re-Identification
title_full	Pose-Guided Spatial Alignment and Key Frame Selection for One-Shot Video-Based Person Re-Identification
title_fullStr	Pose-Guided Spatial Alignment and Key Frame Selection for One-Shot Video-Based Person Re-Identification
title_full_unstemmed	Pose-Guided Spatial Alignment and Key Frame Selection for One-Shot Video-Based Person Re-Identification
title_sort	pose-guided spatial alignment and key frame selection for one-shot video-based person re-identification
publisher	IEEE
series	IEEE Access
issn	2169-3536
publishDate	2019-01-01
description	One-shot video-based person re-identification exploits the unlabeled data by using a single-labeled sample for each individual to train a model and to reduce the need for laborious labeling. Although recent works focusing on this task have made some achievements, most state-of-the-art models are vulnerable to misalignment, pose variation and corrupted frames. To address these challenges, we propose a one-shot video-based person re-identification model based on pose-guided spatial alignment and KFS. First, a spatial transformer sub-network trained using pose-guided regression is employed to perform the spatial alignment. Second, we propose a novel training strategy based on KFS. Key frames with abruptly changing poses are deliberately identified and selected to make the network adaptive to pose variation. Finally, we propose a frame feature pooling method by incorporating long short-term memory with an attention mechanism to reduce the influence of corrupted frames. Comprehensive experiments are presented based on the MARS and DukeMTMC-VideoReID datasets. The mAP values for these datasets reach 46.5% and 68.4%, respectively, demonstrating that the proposed model achieves significant improvements over state-of-the-art one-shot person re-identification methods.
topic	Person re-identification one-shot learning spatial alignment key frame selection frame feature pooling
url	https://ieeexplore.ieee.org/document/8736229/
work_keys_str_mv	AT yuzhongchen poseguidedspatialalignmentandkeyframeselectionforoneshotvideobasedpersonreidentification AT tengdahuang poseguidedspatialalignmentandkeyframeselectionforoneshotvideobasedpersonreidentification AT yuzhenniu poseguidedspatialalignmentandkeyframeselectionforoneshotvideobasedpersonreidentification AT xiaoke poseguidedspatialalignmentandkeyframeselectionforoneshotvideobasedpersonreidentification AT yangyanglin poseguidedspatialalignmentandkeyframeselectionforoneshotvideobasedpersonreidentification
_version_	1724189269455536128

Pose-Guided Spatial Alignment and Key Frame Selection for One-Shot Video-Based Person Re-Identification

Similar Items