Human Action Recognition from Gradient Boundary Histograms

This thesis presents a framework for automatic recognition of human actions in un- controlled, realistic video data with fixed cameras, such as surveillance videos. In this thesis, we divide human action recognition into three steps: description, representation, and classification of local spatio-te...

Full description

Bibliographic Details
Main Author: Wang, Xuelu
Other Authors: Laganière, Robert
Language:en
Published: Université d'Ottawa / University of Ottawa 2017
Subjects:
Online Access:http://hdl.handle.net/10393/35931
http://dx.doi.org/10.20381/ruor-20212
id ndltd-uottawa.ca-oai-ruor.uottawa.ca-10393-35931
record_format oai_dc
spelling ndltd-uottawa.ca-oai-ruor.uottawa.ca-10393-359312018-01-05T19:03:01Z Human Action Recognition from Gradient Boundary Histograms Wang, Xuelu Laganière, Robert real-time recognition This thesis presents a framework for automatic recognition of human actions in un- controlled, realistic video data with fixed cameras, such as surveillance videos. In this thesis, we divide human action recognition into three steps: description, representation, and classification of local spatio-temporal features. The bag-of-features model was used to build the classifier. Fisher Vectors were also studied. We focus on the potential of the methods, with the joint optimization of two constraints: the classification precision and its efficiency. On the performance side, a new local descriptor, called Gradient Boundary Histograms (GBH), is adopted. It is built on simple spatio-temporal gradients, which can be computed quickly. We demonstrate that GBH can better represent local structure and motion than other gradient-based descriptors, and significantly outperforms them on large datasets. Our evaluation shows that compared to HOG descriptors, which are based solely on spatial gradient, GBH descriptor preserves the recognition precision even in difficult situation. Since surveillance video captured with fixed cameras is the emphasis of our study, removing the background before action recognition is helpful for improving efficiency. We first preprocess the video data by applying HOG to detect humans. GBH descriptor is then used at reduced spatial resolutions, which yields both high efficiency and low memory usage; in addition, we apply PCA to reduce the feature dimensions, which results in fast matching and an accelerated classification process. Experiments our methods achieved good performance in recognizing precision, while simultaneously highlighting effectiveness and efficiency. 2017-03-31T16:28:52Z 2017-03-31T16:28:52Z 2017 Thesis http://hdl.handle.net/10393/35931 http://dx.doi.org/10.20381/ruor-20212 en Université d'Ottawa / University of Ottawa
collection NDLTD
language en
sources NDLTD
topic real-time
recognition
spellingShingle real-time
recognition
Wang, Xuelu
Human Action Recognition from Gradient Boundary Histograms
description This thesis presents a framework for automatic recognition of human actions in un- controlled, realistic video data with fixed cameras, such as surveillance videos. In this thesis, we divide human action recognition into three steps: description, representation, and classification of local spatio-temporal features. The bag-of-features model was used to build the classifier. Fisher Vectors were also studied. We focus on the potential of the methods, with the joint optimization of two constraints: the classification precision and its efficiency. On the performance side, a new local descriptor, called Gradient Boundary Histograms (GBH), is adopted. It is built on simple spatio-temporal gradients, which can be computed quickly. We demonstrate that GBH can better represent local structure and motion than other gradient-based descriptors, and significantly outperforms them on large datasets. Our evaluation shows that compared to HOG descriptors, which are based solely on spatial gradient, GBH descriptor preserves the recognition precision even in difficult situation. Since surveillance video captured with fixed cameras is the emphasis of our study, removing the background before action recognition is helpful for improving efficiency. We first preprocess the video data by applying HOG to detect humans. GBH descriptor is then used at reduced spatial resolutions, which yields both high efficiency and low memory usage; in addition, we apply PCA to reduce the feature dimensions, which results in fast matching and an accelerated classification process. Experiments our methods achieved good performance in recognizing precision, while simultaneously highlighting effectiveness and efficiency.
author2 Laganière, Robert
author_facet Laganière, Robert
Wang, Xuelu
author Wang, Xuelu
author_sort Wang, Xuelu
title Human Action Recognition from Gradient Boundary Histograms
title_short Human Action Recognition from Gradient Boundary Histograms
title_full Human Action Recognition from Gradient Boundary Histograms
title_fullStr Human Action Recognition from Gradient Boundary Histograms
title_full_unstemmed Human Action Recognition from Gradient Boundary Histograms
title_sort human action recognition from gradient boundary histograms
publisher Université d'Ottawa / University of Ottawa
publishDate 2017
url http://hdl.handle.net/10393/35931
http://dx.doi.org/10.20381/ruor-20212
work_keys_str_mv AT wangxuelu humanactionrecognitionfromgradientboundaryhistograms
_version_ 1718598809182797824