Human Action Recognition from Gradient Boundary Histograms

This thesis presents a framework for automatic recognition of human actions in un- controlled, realistic video data with fixed cameras, such as surveillance videos. In this thesis, we divide human action recognition into three steps: description, representation, and classification of local spatio-te...

Full description

Bibliographic Details
Main Author:	Wang, Xuelu
Other Authors:	Laganière, Robert
Language:	en
Published:	Université d'Ottawa / University of Ottawa 2017
Subjects:	real-time recognition
Online Access:	http://hdl.handle.net/10393/35931 http://dx.doi.org/10.20381/ruor-20212

id	ndltd-uottawa.ca-oai-ruor.uottawa.ca-10393-35931
record_format	oai_dc
spelling	ndltd-uottawa.ca-oai-ruor.uottawa.ca-10393-359312018-01-05T19:03:01Z Human Action Recognition from Gradient Boundary Histograms Wang, Xuelu Laganière, Robert real-time recognition This thesis presents a framework for automatic recognition of human actions in un- controlled, realistic video data with fixed cameras, such as surveillance videos. In this thesis, we divide human action recognition into three steps: description, representation, and classification of local spatio-temporal features. The bag-of-features model was used to build the classifier. Fisher Vectors were also studied. We focus on the potential of the methods, with the joint optimization of two constraints: the classification precision and its efficiency. On the performance side, a new local descriptor, called Gradient Boundary Histograms (GBH), is adopted. It is built on simple spatio-temporal gradients, which can be computed quickly. We demonstrate that GBH can better represent local structure and motion than other gradient-based descriptors, and significantly outperforms them on large datasets. Our evaluation shows that compared to HOG descriptors, which are based solely on spatial gradient, GBH descriptor preserves the recognition precision even in difficult situation. Since surveillance video captured with fixed cameras is the emphasis of our study, removing the background before action recognition is helpful for improving efficiency. We first preprocess the video data by applying HOG to detect humans. GBH descriptor is then used at reduced spatial resolutions, which yields both high efficiency and low memory usage; in addition, we apply PCA to reduce the feature dimensions, which results in fast matching and an accelerated classification process. Experiments our methods achieved good performance in recognizing precision, while simultaneously highlighting effectiveness and efficiency. 2017-03-31T16:28:52Z 2017-03-31T16:28:52Z 2017 Thesis http://hdl.handle.net/10393/35931 http://dx.doi.org/10.20381/ruor-20212 en Université d'Ottawa / University of Ottawa
collection	NDLTD
language	en
sources	NDLTD
topic	real-time recognition
spellingShingle	real-time recognition Wang, Xuelu Human Action Recognition from Gradient Boundary Histograms
description	This thesis presents a framework for automatic recognition of human actions in un- controlled, realistic video data with fixed cameras, such as surveillance videos. In this thesis, we divide human action recognition into three steps: description, representation, and classification of local spatio-temporal features. The bag-of-features model was used to build the classifier. Fisher Vectors were also studied. We focus on the potential of the methods, with the joint optimization of two constraints: the classification precision and its efficiency. On the performance side, a new local descriptor, called Gradient Boundary Histograms (GBH), is adopted. It is built on simple spatio-temporal gradients, which can be computed quickly. We demonstrate that GBH can better represent local structure and motion than other gradient-based descriptors, and significantly outperforms them on large datasets. Our evaluation shows that compared to HOG descriptors, which are based solely on spatial gradient, GBH descriptor preserves the recognition precision even in difficult situation. Since surveillance video captured with fixed cameras is the emphasis of our study, removing the background before action recognition is helpful for improving efficiency. We first preprocess the video data by applying HOG to detect humans. GBH descriptor is then used at reduced spatial resolutions, which yields both high efficiency and low memory usage; in addition, we apply PCA to reduce the feature dimensions, which results in fast matching and an accelerated classification process. Experiments our methods achieved good performance in recognizing precision, while simultaneously highlighting effectiveness and efficiency.
author2	Laganière, Robert
author_facet	Laganière, Robert Wang, Xuelu
author	Wang, Xuelu
author_sort	Wang, Xuelu
title	Human Action Recognition from Gradient Boundary Histograms
title_short	Human Action Recognition from Gradient Boundary Histograms
title_full	Human Action Recognition from Gradient Boundary Histograms
title_fullStr	Human Action Recognition from Gradient Boundary Histograms
title_full_unstemmed	Human Action Recognition from Gradient Boundary Histograms
title_sort	human action recognition from gradient boundary histograms
publisher	Université d'Ottawa / University of Ottawa
publishDate	2017
url	http://hdl.handle.net/10393/35931 http://dx.doi.org/10.20381/ruor-20212
work_keys_str_mv	AT wangxuelu humanactionrecognitionfromgradientboundaryhistograms
_version_	1718598809182797824

Human Action Recognition from Gradient Boundary Histograms

Similar Items