Dynamic Hand Gesture Recognition Using MediaPipe and Transformer

We developed a low-cost, high-performance gesture recognition system with a dynamic hand gesture recognition technique based on the Transformer model combined with MediaPipe. The technique accurately extracts hand gesture key points. The system was designed with eight primary gestures: swipe up, swi...

Full description

Bibliographic Details
Published in:Engineering Proceedings
Main Authors: Hsin-Hua Li, Chen-Chiung Hsieh
Format: Article
Language:English
Published: MDPI AG 2025-09-01
Subjects:
Online Access:https://www.mdpi.com/2673-4591/108/1/22
Description
Summary:We developed a low-cost, high-performance gesture recognition system with a dynamic hand gesture recognition technique based on the Transformer model combined with MediaPipe. The technique accurately extracts hand gesture key points. The system was designed with eight primary gestures: swipe up, swipe down, swipe left, swipe right, thumbs up, OK, click, and enlarge. These gestures serve as alternatives to mouse and keyboard operations, simplifying human–computer interaction interfaces to meet the needs of media system control and presentation switching. The experiment results demonstrated that training deep learning models using the Transformer achieved over 99% accuracy, effectively enhancing recognition performance.
ISSN:2673-4591