AttentionPose: Attention-driven end-to-end model for precise 6D pose estimation
Addressing the complex problem of 6D pose estimation from single RGB images is essential for robotics, augmented reality, and autonomous driving applications. The aim of this study is to overcome limitations in handling scenes with high object occlusion and clutter. We introduce an attention-driven...
| Published in: | Journal of Intelligent Systems |
|---|---|
| Main Authors: | , , |
| Format: | Article |
| Language: | English |
| Published: |
De Gruyter
2023-12-01
|
| Subjects: | |
| Online Access: | https://doi.org/10.1515/jisys-2023-0153 |
| _version_ | 1850267305186951168 |
|---|---|
| author | Rasheed Mayada Abdalsalam Farhan Rabah Nori Jasim Wesam M. |
| author_facet | Rasheed Mayada Abdalsalam Farhan Rabah Nori Jasim Wesam M. |
| author_sort | Rasheed Mayada Abdalsalam |
| collection | DOAJ |
| container_title | Journal of Intelligent Systems |
| description | Addressing the complex problem of 6D pose estimation from single RGB images is essential for robotics, augmented reality, and autonomous driving applications. The aim of this study is to overcome limitations in handling scenes with high object occlusion and clutter. We introduce an attention-driven end-to-end model that builds upon existing methods employing pixel-wise unit vectors and voting for object keypoints. Integrating attention mechanisms allows the model to focus computational resources on salient features, enhancing accuracy. Experimental results using the LINEMOD benchmark dataset demonstrate an accuracy rate of 99.73%, outperforming state-of-the-art approaches. The model also exhibits strong generalization capabilities, achieving an average accuracy of 97.36% on objects not included in the dataset. This work concludes that the attention mechanism significantly elevates the performance and robustness of 6D pose estimation, particularly in challenging environments, and opens new avenues for real-world applications. |
| format | Article |
| id | doaj-art-e24eb0cf472547bb845d7ae324258bbf |
| institution | Directory of Open Access Journals |
| issn | 2191-026X |
| language | English |
| publishDate | 2023-12-01 |
| publisher | De Gruyter |
| record_format | Article |
| spelling | doaj-art-e24eb0cf472547bb845d7ae324258bbf2025-08-19T23:44:12ZengDe GruyterJournal of Intelligent Systems2191-026X2023-12-01321228810.1515/jisys-2023-0153AttentionPose: Attention-driven end-to-end model for precise 6D pose estimationRasheed Mayada Abdalsalam0Farhan Rabah Nori1Jasim Wesam M.2College of Computer Science and Information Technology, University of Anbar, Ramadi 31001, IraqControl Systems and Communications Department, Renewable Energy Research Centre, University of Anbar, Ramadi 31001, IraqCollege of Computer Science and Information Technology, University of Anbar, Ramadi 31001, IraqAddressing the complex problem of 6D pose estimation from single RGB images is essential for robotics, augmented reality, and autonomous driving applications. The aim of this study is to overcome limitations in handling scenes with high object occlusion and clutter. We introduce an attention-driven end-to-end model that builds upon existing methods employing pixel-wise unit vectors and voting for object keypoints. Integrating attention mechanisms allows the model to focus computational resources on salient features, enhancing accuracy. Experimental results using the LINEMOD benchmark dataset demonstrate an accuracy rate of 99.73%, outperforming state-of-the-art approaches. The model also exhibits strong generalization capabilities, achieving an average accuracy of 97.36% on objects not included in the dataset. This work concludes that the attention mechanism significantly elevates the performance and robustness of 6D pose estimation, particularly in challenging environments, and opens new avenues for real-world applications.https://doi.org/10.1515/jisys-2023-0153pose estimationrobotic perceptionattention mechanismdeep learningimage segmentationobject localization |
| spellingShingle | Rasheed Mayada Abdalsalam Farhan Rabah Nori Jasim Wesam M. AttentionPose: Attention-driven end-to-end model for precise 6D pose estimation pose estimation robotic perception attention mechanism deep learning image segmentation object localization |
| title | AttentionPose: Attention-driven end-to-end model for precise 6D pose estimation |
| title_full | AttentionPose: Attention-driven end-to-end model for precise 6D pose estimation |
| title_fullStr | AttentionPose: Attention-driven end-to-end model for precise 6D pose estimation |
| title_full_unstemmed | AttentionPose: Attention-driven end-to-end model for precise 6D pose estimation |
| title_short | AttentionPose: Attention-driven end-to-end model for precise 6D pose estimation |
| title_sort | attentionpose attention driven end to end model for precise 6d pose estimation |
| topic | pose estimation robotic perception attention mechanism deep learning image segmentation object localization |
| url | https://doi.org/10.1515/jisys-2023-0153 |
| work_keys_str_mv | AT rasheedmayadaabdalsalam attentionposeattentiondrivenendtoendmodelforprecise6dposeestimation AT farhanrabahnori attentionposeattentiondrivenendtoendmodelforprecise6dposeestimation AT jasimwesamm attentionposeattentiondrivenendtoendmodelforprecise6dposeestimation |
