Fusion Mechanisms for Human Activity Recognition Using Automated Machine Learning

Human activity recognition has been a branch of interest in the field of computer vision for decades, due to its numerous applications in different domains, such as medicine, surveillance, entertainment or human-computer interaction. We propose an intuitive, effective, quickly trainable and customiz...

Full description

Bibliographic Details
Main Authors: Ana-Cosmina Popescu, Irina Mocanu, Bogdan Cramariuc
Format: Article
Language:English
Published: IEEE 2020-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/9153764/
id doaj-0a45652df1a94b568328be12f63916a3
record_format Article
spelling doaj-0a45652df1a94b568328be12f63916a32021-03-30T04:51:43ZengIEEEIEEE Access2169-35362020-01-01814399614401410.1109/ACCESS.2020.30134069153764Fusion Mechanisms for Human Activity Recognition Using Automated Machine LearningAna-Cosmina Popescu0https://orcid.org/0000-0002-3843-6091Irina Mocanu1https://orcid.org/0000-0001-5176-9344Bogdan Cramariuc2https://orcid.org/0000-0001-7356-071XComputer Science Department, University POLITEHNICA of Bucharest, Bucharest, RomaniaComputer Science Department, University POLITEHNICA of Bucharest, Bucharest, RomaniaIT Center for Science and Technology, Bucharest, RomaniaHuman activity recognition has been a branch of interest in the field of computer vision for decades, due to its numerous applications in different domains, such as medicine, surveillance, entertainment or human-computer interaction. We propose an intuitive, effective, quickly trainable and customizable system for recognizing human activities designed with an automated machine learning method based on Neural Architecture Search. Information from all channels of a 3D video (RGB and depth data, skeleton and context objects) is merged by independently passing these data streams through 2D convolutional neural networks. The outputs of all networks are combined in a summarizing array of class scores using fusion mechanisms that are not computationally intensive but reflect the meaningful information from a video. The proposed system is tested using three public datasets and a new dataset-PRECIS HAR-that was created in our laboratory. In all our experiments, the system is proven to be highly accurate: 98.43% on MSRDailyActivity3D, 91.41% on UTD-MHAD, 90.95% on NTU RGB+D, and 94.38% on our dataset.https://ieeexplore.ieee.org/document/9153764/Automated machine learningcontextconvolutional neural networksdata fusionhuman activity recognitionRGB-D data
collection DOAJ
language English
format Article
sources DOAJ
author Ana-Cosmina Popescu
Irina Mocanu
Bogdan Cramariuc
spellingShingle Ana-Cosmina Popescu
Irina Mocanu
Bogdan Cramariuc
Fusion Mechanisms for Human Activity Recognition Using Automated Machine Learning
IEEE Access
Automated machine learning
context
convolutional neural networks
data fusion
human activity recognition
RGB-D data
author_facet Ana-Cosmina Popescu
Irina Mocanu
Bogdan Cramariuc
author_sort Ana-Cosmina Popescu
title Fusion Mechanisms for Human Activity Recognition Using Automated Machine Learning
title_short Fusion Mechanisms for Human Activity Recognition Using Automated Machine Learning
title_full Fusion Mechanisms for Human Activity Recognition Using Automated Machine Learning
title_fullStr Fusion Mechanisms for Human Activity Recognition Using Automated Machine Learning
title_full_unstemmed Fusion Mechanisms for Human Activity Recognition Using Automated Machine Learning
title_sort fusion mechanisms for human activity recognition using automated machine learning
publisher IEEE
series IEEE Access
issn 2169-3536
publishDate 2020-01-01
description Human activity recognition has been a branch of interest in the field of computer vision for decades, due to its numerous applications in different domains, such as medicine, surveillance, entertainment or human-computer interaction. We propose an intuitive, effective, quickly trainable and customizable system for recognizing human activities designed with an automated machine learning method based on Neural Architecture Search. Information from all channels of a 3D video (RGB and depth data, skeleton and context objects) is merged by independently passing these data streams through 2D convolutional neural networks. The outputs of all networks are combined in a summarizing array of class scores using fusion mechanisms that are not computationally intensive but reflect the meaningful information from a video. The proposed system is tested using three public datasets and a new dataset-PRECIS HAR-that was created in our laboratory. In all our experiments, the system is proven to be highly accurate: 98.43% on MSRDailyActivity3D, 91.41% on UTD-MHAD, 90.95% on NTU RGB+D, and 94.38% on our dataset.
topic Automated machine learning
context
convolutional neural networks
data fusion
human activity recognition
RGB-D data
url https://ieeexplore.ieee.org/document/9153764/
work_keys_str_mv AT anacosminapopescu fusionmechanismsforhumanactivityrecognitionusingautomatedmachinelearning
AT irinamocanu fusionmechanismsforhumanactivityrecognitionusingautomatedmachinelearning
AT bogdancramariuc fusionmechanismsforhumanactivityrecognitionusingautomatedmachinelearning
_version_ 1724181059827924992