A Comprehensive Survey of RGB-Based and Skeleton-Based Human Action Recognition
With the advancement of computer vision, human action recognition (HAR) has shown its broad research worth and application prospects in a wide range of fields such as intelligent security, automatic driving and human-machine interaction. Based on the type of data captured by cameras and sensors, e.g...
| Published in: | IEEE Access |
|---|---|
| Main Authors: | , |
| Format: | Article |
| Language: | English |
| Published: |
IEEE
2023-01-01
|
| Subjects: | |
| Online Access: | https://ieeexplore.ieee.org/document/10143178/ |
| _version_ | 1851924132372938752 |
|---|---|
| author | Cailing Wang Jingjing Yan |
| author_facet | Cailing Wang Jingjing Yan |
| author_sort | Cailing Wang |
| collection | DOAJ |
| container_title | IEEE Access |
| description | With the advancement of computer vision, human action recognition (HAR) has shown its broad research worth and application prospects in a wide range of fields such as intelligent security, automatic driving and human-machine interaction. Based on the type of data captured by cameras and sensors, e.g., RGB, depth, skeleton, and infrared data, HAR methods can be classified into RGB-based and skeleton-based. RGB data is easy and inexpensive to obtain, but RGB-based methods need to cope with a large amount of irrelevant background information and are easily affected by factors such as lighting and shooting angle. The skeleton-based methods eliminate the impact of background variables and require little computational work due to their skeleton-focused features, but they lack the context data necessary for HAR. This paper gives a thorough survey of these two approaches, covering deep learning methods, handcrafted feature extraction methods, common datasets, challenges, and future research directions. The skeleton-based action recognition methods Section specifically presents the most well-liked 2D and 3D pose estimation algorithms. This survey aims to give researchers new to the area or engaged in a long-term study a selection of datasets and algorithms, as well as an overview of the present issues and expected future directions in the field. |
| format | Article |
| id | doaj-art-e31af2f2d0ea41faafd6a68f4b7329cb |
| institution | Directory of Open Access Journals |
| issn | 2169-3536 |
| language | English |
| publishDate | 2023-01-01 |
| publisher | IEEE |
| record_format | Article |
| spelling | doaj-art-e31af2f2d0ea41faafd6a68f4b7329cb2025-08-19T21:57:09ZengIEEEIEEE Access2169-35362023-01-0111538805389810.1109/ACCESS.2023.328231110143178A Comprehensive Survey of RGB-Based and Skeleton-Based Human Action RecognitionCailing Wang0https://orcid.org/0000-0002-3455-8649Jingjing Yan1School of Computer Science, Xi’an Shiyou University, Xi’an, ChinaSchool of Computer Science, Xi’an Shiyou University, Xi’an, ChinaWith the advancement of computer vision, human action recognition (HAR) has shown its broad research worth and application prospects in a wide range of fields such as intelligent security, automatic driving and human-machine interaction. Based on the type of data captured by cameras and sensors, e.g., RGB, depth, skeleton, and infrared data, HAR methods can be classified into RGB-based and skeleton-based. RGB data is easy and inexpensive to obtain, but RGB-based methods need to cope with a large amount of irrelevant background information and are easily affected by factors such as lighting and shooting angle. The skeleton-based methods eliminate the impact of background variables and require little computational work due to their skeleton-focused features, but they lack the context data necessary for HAR. This paper gives a thorough survey of these two approaches, covering deep learning methods, handcrafted feature extraction methods, common datasets, challenges, and future research directions. The skeleton-based action recognition methods Section specifically presents the most well-liked 2D and 3D pose estimation algorithms. This survey aims to give researchers new to the area or engaged in a long-term study a selection of datasets and algorithms, as well as an overview of the present issues and expected future directions in the field.https://ieeexplore.ieee.org/document/10143178/Action datasetdeep learningpose estimationRGB-based action recognitionskeleton-based action recognitionsystematic survey |
| spellingShingle | Cailing Wang Jingjing Yan A Comprehensive Survey of RGB-Based and Skeleton-Based Human Action Recognition Action dataset deep learning pose estimation RGB-based action recognition skeleton-based action recognition systematic survey |
| title | A Comprehensive Survey of RGB-Based and Skeleton-Based Human Action Recognition |
| title_full | A Comprehensive Survey of RGB-Based and Skeleton-Based Human Action Recognition |
| title_fullStr | A Comprehensive Survey of RGB-Based and Skeleton-Based Human Action Recognition |
| title_full_unstemmed | A Comprehensive Survey of RGB-Based and Skeleton-Based Human Action Recognition |
| title_short | A Comprehensive Survey of RGB-Based and Skeleton-Based Human Action Recognition |
| title_sort | comprehensive survey of rgb based and skeleton based human action recognition |
| topic | Action dataset deep learning pose estimation RGB-based action recognition skeleton-based action recognition systematic survey |
| url | https://ieeexplore.ieee.org/document/10143178/ |
| work_keys_str_mv | AT cailingwang acomprehensivesurveyofrgbbasedandskeletonbasedhumanactionrecognition AT jingjingyan acomprehensivesurveyofrgbbasedandskeletonbasedhumanactionrecognition AT cailingwang comprehensivesurveyofrgbbasedandskeletonbasedhumanactionrecognition AT jingjingyan comprehensivesurveyofrgbbasedandskeletonbasedhumanactionrecognition |
