Human Action Recognition from Gradient Boundary Histograms
This thesis presents a framework for automatic recognition of human actions in un- controlled, realistic video data with fixed cameras, such as surveillance videos. In this thesis, we divide human action recognition into three steps: description, representation, and classification of local spatio-te...
Main Author: | |
---|---|
Other Authors: | |
Language: | en |
Published: |
Université d'Ottawa / University of Ottawa
2017
|
Subjects: | |
Online Access: | http://hdl.handle.net/10393/35931 http://dx.doi.org/10.20381/ruor-20212 |
id |
ndltd-uottawa.ca-oai-ruor.uottawa.ca-10393-35931 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-uottawa.ca-oai-ruor.uottawa.ca-10393-359312018-01-05T19:03:01Z Human Action Recognition from Gradient Boundary Histograms Wang, Xuelu Laganière, Robert real-time recognition This thesis presents a framework for automatic recognition of human actions in un- controlled, realistic video data with fixed cameras, such as surveillance videos. In this thesis, we divide human action recognition into three steps: description, representation, and classification of local spatio-temporal features. The bag-of-features model was used to build the classifier. Fisher Vectors were also studied. We focus on the potential of the methods, with the joint optimization of two constraints: the classification precision and its efficiency. On the performance side, a new local descriptor, called Gradient Boundary Histograms (GBH), is adopted. It is built on simple spatio-temporal gradients, which can be computed quickly. We demonstrate that GBH can better represent local structure and motion than other gradient-based descriptors, and significantly outperforms them on large datasets. Our evaluation shows that compared to HOG descriptors, which are based solely on spatial gradient, GBH descriptor preserves the recognition precision even in difficult situation. Since surveillance video captured with fixed cameras is the emphasis of our study, removing the background before action recognition is helpful for improving efficiency. We first preprocess the video data by applying HOG to detect humans. GBH descriptor is then used at reduced spatial resolutions, which yields both high efficiency and low memory usage; in addition, we apply PCA to reduce the feature dimensions, which results in fast matching and an accelerated classification process. Experiments our methods achieved good performance in recognizing precision, while simultaneously highlighting effectiveness and efficiency. 2017-03-31T16:28:52Z 2017-03-31T16:28:52Z 2017 Thesis http://hdl.handle.net/10393/35931 http://dx.doi.org/10.20381/ruor-20212 en Université d'Ottawa / University of Ottawa |
collection |
NDLTD |
language |
en |
sources |
NDLTD |
topic |
real-time recognition |
spellingShingle |
real-time recognition Wang, Xuelu Human Action Recognition from Gradient Boundary Histograms |
description |
This thesis presents a framework for automatic recognition of human actions in un- controlled, realistic video data with fixed cameras, such as surveillance videos. In this thesis, we divide human action recognition into three steps: description, representation, and classification of local spatio-temporal features. The bag-of-features model was used to build the classifier. Fisher Vectors were also studied. We focus on the potential of the methods, with the joint optimization of two constraints: the classification precision and its efficiency.
On the performance side, a new local descriptor, called Gradient Boundary Histograms (GBH), is adopted. It is built on simple spatio-temporal gradients, which can be computed quickly. We demonstrate that GBH can better represent local structure and motion than other gradient-based descriptors, and significantly outperforms them on large datasets. Our evaluation shows that compared to HOG descriptors, which are based solely on spatial gradient, GBH descriptor preserves the recognition precision even in difficult situation.
Since surveillance video captured with fixed cameras is the emphasis of our study, removing the background before action recognition is helpful for improving efficiency. We first preprocess the video data by applying HOG to detect humans. GBH descriptor is then used at reduced spatial resolutions, which yields both high efficiency and low memory usage; in addition, we apply PCA to reduce the feature dimensions, which results in fast matching and an accelerated classification process.
Experiments our methods achieved good performance in recognizing precision, while simultaneously highlighting effectiveness and efficiency. |
author2 |
Laganière, Robert |
author_facet |
Laganière, Robert Wang, Xuelu |
author |
Wang, Xuelu |
author_sort |
Wang, Xuelu |
title |
Human Action Recognition from Gradient Boundary Histograms |
title_short |
Human Action Recognition from Gradient Boundary Histograms |
title_full |
Human Action Recognition from Gradient Boundary Histograms |
title_fullStr |
Human Action Recognition from Gradient Boundary Histograms |
title_full_unstemmed |
Human Action Recognition from Gradient Boundary Histograms |
title_sort |
human action recognition from gradient boundary histograms |
publisher |
Université d'Ottawa / University of Ottawa |
publishDate |
2017 |
url |
http://hdl.handle.net/10393/35931 http://dx.doi.org/10.20381/ruor-20212 |
work_keys_str_mv |
AT wangxuelu humanactionrecognitionfromgradientboundaryhistograms |
_version_ |
1718598809182797824 |