AttentionPose: Attention-driven end-to-end model for precise 6D pose estimation

Addressing the complex problem of 6D pose estimation from single RGB images is essential for robotics, augmented reality, and autonomous driving applications. The aim of this study is to overcome limitations in handling scenes with high object occlusion and clutter. We introduce an attention-driven...

Full description

Bibliographic Details
Published in:Journal of Intelligent Systems
Main Authors: Rasheed Mayada Abdalsalam, Farhan Rabah Nori, Jasim Wesam M.
Format: Article
Language:English
Published: De Gruyter 2023-12-01
Subjects:
Online Access:https://doi.org/10.1515/jisys-2023-0153
_version_ 1850267305186951168
author Rasheed Mayada Abdalsalam
Farhan Rabah Nori
Jasim Wesam M.
author_facet Rasheed Mayada Abdalsalam
Farhan Rabah Nori
Jasim Wesam M.
author_sort Rasheed Mayada Abdalsalam
collection DOAJ
container_title Journal of Intelligent Systems
description Addressing the complex problem of 6D pose estimation from single RGB images is essential for robotics, augmented reality, and autonomous driving applications. The aim of this study is to overcome limitations in handling scenes with high object occlusion and clutter. We introduce an attention-driven end-to-end model that builds upon existing methods employing pixel-wise unit vectors and voting for object keypoints. Integrating attention mechanisms allows the model to focus computational resources on salient features, enhancing accuracy. Experimental results using the LINEMOD benchmark dataset demonstrate an accuracy rate of 99.73%, outperforming state-of-the-art approaches. The model also exhibits strong generalization capabilities, achieving an average accuracy of 97.36% on objects not included in the dataset. This work concludes that the attention mechanism significantly elevates the performance and robustness of 6D pose estimation, particularly in challenging environments, and opens new avenues for real-world applications.
format Article
id doaj-art-e24eb0cf472547bb845d7ae324258bbf
institution Directory of Open Access Journals
issn 2191-026X
language English
publishDate 2023-12-01
publisher De Gruyter
record_format Article
spelling doaj-art-e24eb0cf472547bb845d7ae324258bbf2025-08-19T23:44:12ZengDe GruyterJournal of Intelligent Systems2191-026X2023-12-01321228810.1515/jisys-2023-0153AttentionPose: Attention-driven end-to-end model for precise 6D pose estimationRasheed Mayada Abdalsalam0Farhan Rabah Nori1Jasim Wesam M.2College of Computer Science and Information Technology, University of Anbar, Ramadi 31001, IraqControl Systems and Communications Department, Renewable Energy Research Centre, University of Anbar, Ramadi 31001, IraqCollege of Computer Science and Information Technology, University of Anbar, Ramadi 31001, IraqAddressing the complex problem of 6D pose estimation from single RGB images is essential for robotics, augmented reality, and autonomous driving applications. The aim of this study is to overcome limitations in handling scenes with high object occlusion and clutter. We introduce an attention-driven end-to-end model that builds upon existing methods employing pixel-wise unit vectors and voting for object keypoints. Integrating attention mechanisms allows the model to focus computational resources on salient features, enhancing accuracy. Experimental results using the LINEMOD benchmark dataset demonstrate an accuracy rate of 99.73%, outperforming state-of-the-art approaches. The model also exhibits strong generalization capabilities, achieving an average accuracy of 97.36% on objects not included in the dataset. This work concludes that the attention mechanism significantly elevates the performance and robustness of 6D pose estimation, particularly in challenging environments, and opens new avenues for real-world applications.https://doi.org/10.1515/jisys-2023-0153pose estimationrobotic perceptionattention mechanismdeep learningimage segmentationobject localization
spellingShingle Rasheed Mayada Abdalsalam
Farhan Rabah Nori
Jasim Wesam M.
AttentionPose: Attention-driven end-to-end model for precise 6D pose estimation
pose estimation
robotic perception
attention mechanism
deep learning
image segmentation
object localization
title AttentionPose: Attention-driven end-to-end model for precise 6D pose estimation
title_full AttentionPose: Attention-driven end-to-end model for precise 6D pose estimation
title_fullStr AttentionPose: Attention-driven end-to-end model for precise 6D pose estimation
title_full_unstemmed AttentionPose: Attention-driven end-to-end model for precise 6D pose estimation
title_short AttentionPose: Attention-driven end-to-end model for precise 6D pose estimation
title_sort attentionpose attention driven end to end model for precise 6d pose estimation
topic pose estimation
robotic perception
attention mechanism
deep learning
image segmentation
object localization
url https://doi.org/10.1515/jisys-2023-0153
work_keys_str_mv AT rasheedmayadaabdalsalam attentionposeattentiondrivenendtoendmodelforprecise6dposeestimation
AT farhanrabahnori attentionposeattentiondrivenendtoendmodelforprecise6dposeestimation
AT jasimwesamm attentionposeattentiondrivenendtoendmodelforprecise6dposeestimation