Fusion Mechanisms for Human Activity Recognition Using Automated Machine Learning
Human activity recognition has been a branch of interest in the field of computer vision for decades, due to its numerous applications in different domains, such as medicine, surveillance, entertainment or human-computer interaction. We propose an intuitive, effective, quickly trainable and customiz...
Main Authors: | , , |
---|---|
Format: | Article |
Language: | English |
Published: |
IEEE
2020-01-01
|
Series: | IEEE Access |
Subjects: | |
Online Access: | https://ieeexplore.ieee.org/document/9153764/ |
id |
doaj-0a45652df1a94b568328be12f63916a3 |
---|---|
record_format |
Article |
spelling |
doaj-0a45652df1a94b568328be12f63916a32021-03-30T04:51:43ZengIEEEIEEE Access2169-35362020-01-01814399614401410.1109/ACCESS.2020.30134069153764Fusion Mechanisms for Human Activity Recognition Using Automated Machine LearningAna-Cosmina Popescu0https://orcid.org/0000-0002-3843-6091Irina Mocanu1https://orcid.org/0000-0001-5176-9344Bogdan Cramariuc2https://orcid.org/0000-0001-7356-071XComputer Science Department, University POLITEHNICA of Bucharest, Bucharest, RomaniaComputer Science Department, University POLITEHNICA of Bucharest, Bucharest, RomaniaIT Center for Science and Technology, Bucharest, RomaniaHuman activity recognition has been a branch of interest in the field of computer vision for decades, due to its numerous applications in different domains, such as medicine, surveillance, entertainment or human-computer interaction. We propose an intuitive, effective, quickly trainable and customizable system for recognizing human activities designed with an automated machine learning method based on Neural Architecture Search. Information from all channels of a 3D video (RGB and depth data, skeleton and context objects) is merged by independently passing these data streams through 2D convolutional neural networks. The outputs of all networks are combined in a summarizing array of class scores using fusion mechanisms that are not computationally intensive but reflect the meaningful information from a video. The proposed system is tested using three public datasets and a new dataset-PRECIS HAR-that was created in our laboratory. In all our experiments, the system is proven to be highly accurate: 98.43% on MSRDailyActivity3D, 91.41% on UTD-MHAD, 90.95% on NTU RGB+D, and 94.38% on our dataset.https://ieeexplore.ieee.org/document/9153764/Automated machine learningcontextconvolutional neural networksdata fusionhuman activity recognitionRGB-D data |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Ana-Cosmina Popescu Irina Mocanu Bogdan Cramariuc |
spellingShingle |
Ana-Cosmina Popescu Irina Mocanu Bogdan Cramariuc Fusion Mechanisms for Human Activity Recognition Using Automated Machine Learning IEEE Access Automated machine learning context convolutional neural networks data fusion human activity recognition RGB-D data |
author_facet |
Ana-Cosmina Popescu Irina Mocanu Bogdan Cramariuc |
author_sort |
Ana-Cosmina Popescu |
title |
Fusion Mechanisms for Human Activity Recognition Using Automated Machine Learning |
title_short |
Fusion Mechanisms for Human Activity Recognition Using Automated Machine Learning |
title_full |
Fusion Mechanisms for Human Activity Recognition Using Automated Machine Learning |
title_fullStr |
Fusion Mechanisms for Human Activity Recognition Using Automated Machine Learning |
title_full_unstemmed |
Fusion Mechanisms for Human Activity Recognition Using Automated Machine Learning |
title_sort |
fusion mechanisms for human activity recognition using automated machine learning |
publisher |
IEEE |
series |
IEEE Access |
issn |
2169-3536 |
publishDate |
2020-01-01 |
description |
Human activity recognition has been a branch of interest in the field of computer vision for decades, due to its numerous applications in different domains, such as medicine, surveillance, entertainment or human-computer interaction. We propose an intuitive, effective, quickly trainable and customizable system for recognizing human activities designed with an automated machine learning method based on Neural Architecture Search. Information from all channels of a 3D video (RGB and depth data, skeleton and context objects) is merged by independently passing these data streams through 2D convolutional neural networks. The outputs of all networks are combined in a summarizing array of class scores using fusion mechanisms that are not computationally intensive but reflect the meaningful information from a video. The proposed system is tested using three public datasets and a new dataset-PRECIS HAR-that was created in our laboratory. In all our experiments, the system is proven to be highly accurate: 98.43% on MSRDailyActivity3D, 91.41% on UTD-MHAD, 90.95% on NTU RGB+D, and 94.38% on our dataset. |
topic |
Automated machine learning context convolutional neural networks data fusion human activity recognition RGB-D data |
url |
https://ieeexplore.ieee.org/document/9153764/ |
work_keys_str_mv |
AT anacosminapopescu fusionmechanismsforhumanactivityrecognitionusingautomatedmachinelearning AT irinamocanu fusionmechanismsforhumanactivityrecognitionusingautomatedmachinelearning AT bogdancramariuc fusionmechanismsforhumanactivityrecognitionusingautomatedmachinelearning |
_version_ |
1724181059827924992 |