A Comprehensive Survey of RGB-Based and Skeleton-Based Human Action Recognition

With the advancement of computer vision, human action recognition (HAR) has shown its broad research worth and application prospects in a wide range of fields such as intelligent security, automatic driving and human-machine interaction. Based on the type of data captured by cameras and sensors, e.g...

Full description

Bibliographic Details
Published in:IEEE Access
Main Authors: Cailing Wang, Jingjing Yan
Format: Article
Language:English
Published: IEEE 2023-01-01
Subjects:
Online Access:https://ieeexplore.ieee.org/document/10143178/
_version_ 1851924132372938752
author Cailing Wang
Jingjing Yan
author_facet Cailing Wang
Jingjing Yan
author_sort Cailing Wang
collection DOAJ
container_title IEEE Access
description With the advancement of computer vision, human action recognition (HAR) has shown its broad research worth and application prospects in a wide range of fields such as intelligent security, automatic driving and human-machine interaction. Based on the type of data captured by cameras and sensors, e.g., RGB, depth, skeleton, and infrared data, HAR methods can be classified into RGB-based and skeleton-based. RGB data is easy and inexpensive to obtain, but RGB-based methods need to cope with a large amount of irrelevant background information and are easily affected by factors such as lighting and shooting angle. The skeleton-based methods eliminate the impact of background variables and require little computational work due to their skeleton-focused features, but they lack the context data necessary for HAR. This paper gives a thorough survey of these two approaches, covering deep learning methods, handcrafted feature extraction methods, common datasets, challenges, and future research directions. The skeleton-based action recognition methods Section specifically presents the most well-liked 2D and 3D pose estimation algorithms. This survey aims to give researchers new to the area or engaged in a long-term study a selection of datasets and algorithms, as well as an overview of the present issues and expected future directions in the field.
format Article
id doaj-art-e31af2f2d0ea41faafd6a68f4b7329cb
institution Directory of Open Access Journals
issn 2169-3536
language English
publishDate 2023-01-01
publisher IEEE
record_format Article
spelling doaj-art-e31af2f2d0ea41faafd6a68f4b7329cb2025-08-19T21:57:09ZengIEEEIEEE Access2169-35362023-01-0111538805389810.1109/ACCESS.2023.328231110143178A Comprehensive Survey of RGB-Based and Skeleton-Based Human Action RecognitionCailing Wang0https://orcid.org/0000-0002-3455-8649Jingjing Yan1School of Computer Science, Xi’an Shiyou University, Xi’an, ChinaSchool of Computer Science, Xi’an Shiyou University, Xi’an, ChinaWith the advancement of computer vision, human action recognition (HAR) has shown its broad research worth and application prospects in a wide range of fields such as intelligent security, automatic driving and human-machine interaction. Based on the type of data captured by cameras and sensors, e.g., RGB, depth, skeleton, and infrared data, HAR methods can be classified into RGB-based and skeleton-based. RGB data is easy and inexpensive to obtain, but RGB-based methods need to cope with a large amount of irrelevant background information and are easily affected by factors such as lighting and shooting angle. The skeleton-based methods eliminate the impact of background variables and require little computational work due to their skeleton-focused features, but they lack the context data necessary for HAR. This paper gives a thorough survey of these two approaches, covering deep learning methods, handcrafted feature extraction methods, common datasets, challenges, and future research directions. The skeleton-based action recognition methods Section specifically presents the most well-liked 2D and 3D pose estimation algorithms. This survey aims to give researchers new to the area or engaged in a long-term study a selection of datasets and algorithms, as well as an overview of the present issues and expected future directions in the field.https://ieeexplore.ieee.org/document/10143178/Action datasetdeep learningpose estimationRGB-based action recognitionskeleton-based action recognitionsystematic survey
spellingShingle Cailing Wang
Jingjing Yan
A Comprehensive Survey of RGB-Based and Skeleton-Based Human Action Recognition
Action dataset
deep learning
pose estimation
RGB-based action recognition
skeleton-based action recognition
systematic survey
title A Comprehensive Survey of RGB-Based and Skeleton-Based Human Action Recognition
title_full A Comprehensive Survey of RGB-Based and Skeleton-Based Human Action Recognition
title_fullStr A Comprehensive Survey of RGB-Based and Skeleton-Based Human Action Recognition
title_full_unstemmed A Comprehensive Survey of RGB-Based and Skeleton-Based Human Action Recognition
title_short A Comprehensive Survey of RGB-Based and Skeleton-Based Human Action Recognition
title_sort comprehensive survey of rgb based and skeleton based human action recognition
topic Action dataset
deep learning
pose estimation
RGB-based action recognition
skeleton-based action recognition
systematic survey
url https://ieeexplore.ieee.org/document/10143178/
work_keys_str_mv AT cailingwang acomprehensivesurveyofrgbbasedandskeletonbasedhumanactionrecognition
AT jingjingyan acomprehensivesurveyofrgbbasedandskeletonbasedhumanactionrecognition
AT cailingwang comprehensivesurveyofrgbbasedandskeletonbasedhumanactionrecognition
AT jingjingyan comprehensivesurveyofrgbbasedandskeletonbasedhumanactionrecognition