Learning Local Structured Correlation Filters for Visual Tracking via Spatial Joint Regularization

Robust visual tracking is a fundamental problem in the field of computer vision and has a wide range of practical applications. Recent progress in developing robust tracking methods are mainly made upon discriminative correlation filters (DCF). However, most DCF-based methods develop their trackers...

Full description

Bibliographic Details
Main Authors: Chenggang Guo, Dongyi Chen, Zhiqi Huang
Format: Article
Language:English
Published: IEEE 2019-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/8672069/
id doaj-e1e7b4437b224044a3b3d6e42f8fc4f6
record_format Article
spelling doaj-e1e7b4437b224044a3b3d6e42f8fc4f62021-04-05T17:01:10ZengIEEEIEEE Access2169-35362019-01-017391583917110.1109/ACCESS.2019.29065088672069Learning Local Structured Correlation Filters for Visual Tracking via Spatial Joint RegularizationChenggang Guo0https://orcid.org/0000-0003-4504-8692Dongyi Chen1Zhiqi Huang2School of Automation Engineering, University of Electronic Science and Technology of China, Chengdu, ChinaSchool of Automation Engineering, University of Electronic Science and Technology of China, Chengdu, ChinaSchool of Automation Engineering, University of Electronic Science and Technology of China, Chengdu, ChinaRobust visual tracking is a fundamental problem in the field of computer vision and has a wide range of practical applications. Recent progress in developing robust tracking methods are mainly made upon discriminative correlation filters (DCF). However, most DCF-based methods develop their trackers under the assumption of a holistic appearance model, ignoring the underlying spatial local structural information. In this paper, we introduce the tree-structured group sparsity regularization into the DCF-based formula. The correlation filter to be learned is divided into hierarchical local groups. The relationship between the response and the circularly shifted target appearance is regularized by applying the l<sub>1</sub>-norm across the l<sub>2</sub>norm of the hierarchical local filter groups. Moreover, a local response consistency term is incorporated together with the structured sparsity to make each local filter group contributes equally to the final response. The accelerated proximal gradient method is employed to optimize this non-smooth composite regularization problem. Benefiting from the properties of circulant matrices, several key steps in the optimization process can be efficiently solved in the frequency domain. The experiments are conducted on four publicly available visual tracking benchmarks. Both quantitative and qualitative evaluations demonstrate that the proposed tracking method performs favorably against a number of state-of-the-art tracking methods.https://ieeexplore.ieee.org/document/8672069/Correlation filterspatial regularizationstructure sparsityvisual tracking
collection DOAJ
language English
format Article
sources DOAJ
author Chenggang Guo
Dongyi Chen
Zhiqi Huang
spellingShingle Chenggang Guo
Dongyi Chen
Zhiqi Huang
Learning Local Structured Correlation Filters for Visual Tracking via Spatial Joint Regularization
IEEE Access
Correlation filter
spatial regularization
structure sparsity
visual tracking
author_facet Chenggang Guo
Dongyi Chen
Zhiqi Huang
author_sort Chenggang Guo
title Learning Local Structured Correlation Filters for Visual Tracking via Spatial Joint Regularization
title_short Learning Local Structured Correlation Filters for Visual Tracking via Spatial Joint Regularization
title_full Learning Local Structured Correlation Filters for Visual Tracking via Spatial Joint Regularization
title_fullStr Learning Local Structured Correlation Filters for Visual Tracking via Spatial Joint Regularization
title_full_unstemmed Learning Local Structured Correlation Filters for Visual Tracking via Spatial Joint Regularization
title_sort learning local structured correlation filters for visual tracking via spatial joint regularization
publisher IEEE
series IEEE Access
issn 2169-3536
publishDate 2019-01-01
description Robust visual tracking is a fundamental problem in the field of computer vision and has a wide range of practical applications. Recent progress in developing robust tracking methods are mainly made upon discriminative correlation filters (DCF). However, most DCF-based methods develop their trackers under the assumption of a holistic appearance model, ignoring the underlying spatial local structural information. In this paper, we introduce the tree-structured group sparsity regularization into the DCF-based formula. The correlation filter to be learned is divided into hierarchical local groups. The relationship between the response and the circularly shifted target appearance is regularized by applying the l<sub>1</sub>-norm across the l<sub>2</sub>norm of the hierarchical local filter groups. Moreover, a local response consistency term is incorporated together with the structured sparsity to make each local filter group contributes equally to the final response. The accelerated proximal gradient method is employed to optimize this non-smooth composite regularization problem. Benefiting from the properties of circulant matrices, several key steps in the optimization process can be efficiently solved in the frequency domain. The experiments are conducted on four publicly available visual tracking benchmarks. Both quantitative and qualitative evaluations demonstrate that the proposed tracking method performs favorably against a number of state-of-the-art tracking methods.
topic Correlation filter
spatial regularization
structure sparsity
visual tracking
url https://ieeexplore.ieee.org/document/8672069/
work_keys_str_mv AT chenggangguo learninglocalstructuredcorrelationfiltersforvisualtrackingviaspatialjointregularization
AT dongyichen learninglocalstructuredcorrelationfiltersforvisualtrackingviaspatialjointregularization
AT zhiqihuang learninglocalstructuredcorrelationfiltersforvisualtrackingviaspatialjointregularization
_version_ 1721540533558444032