FAST GDRNPP: Improving the Speed of State-of-the-Art 6D Object Pose Estimation

Pöllabauer, Thomas; Pramod, Ashwin; Knauthe, Volker; Wahl, Michael

FAST GDRNPP: Improving the Speed of State-of-the-Art 6D Object Pose Estimation

dc.contributor.author	Pöllabauer, Thomas	en_US
dc.contributor.author	Pramod, Ashwin	en_US
dc.contributor.author	Knauthe, Volker	en_US
dc.contributor.author	Wahl, Michael	en_US
dc.contributor.editor	Caputo, Ariel	en_US
dc.contributor.editor	Garro, Valeria	en_US
dc.contributor.editor	Giachetti, Andrea	en_US
dc.contributor.editor	Castellani, Umberto	en_US
dc.contributor.editor	Dulecha, Tinsae Gebrechristos	en_US
dc.date.accessioned	2024-11-11T12:48:04Z
dc.date.available	2024-11-11T12:48:04Z
dc.date.issued	2024
dc.description.abstract	6D object pose estimation involves determining the three-dimensional translation and rotation of an object within a scene and relative to a chosen coordinate system. This problem is of particular interest for many practical applications in industrial tasks such as quality control, bin picking, and robotic manipulation, where both speed and accuracy are critical for real-world deployment. Current models, both classical and deep-learning-based, often struggle with the trade-off between accuracy and latency. Our research focuses on enhancing the speed of a prominent state-of-the-art deep learning model, GDRNPP, while keeping its high accuracy. We employ several techniques to reduce the model size and improve inference time. These techniques include using smaller and quicker backbones, pruning unnecessary parameters, and distillation to transfer knowledge from a large, high-performing model to a smaller, more efficient student model. Our findings demonstrate that the proposed configuration maintains accuracy comparable to the state-of-the-art while significantly improving inference time. This advancement could lead to more efficient and practical applications in various industrial scenarios, thereby enhancing the overall applicability of 6D Object Pose Estimation models in real-world settings.	en_US
dc.description.sectionheaders	Computer Vision
dc.description.seriesinformation	Smart Tools and Applications in Graphics - Eurographics Italian Chapter Conference
dc.identifier.doi	10.2312/stag.20241335
dc.identifier.isbn	978-3-03868-265-3
dc.identifier.issn	2617-4855
dc.identifier.pages	10 pages
dc.identifier.uri	https://doi.org/10.2312/stag.20241335
dc.identifier.uri	https://diglib.eg.org/handle/10.2312/stag20241335
dc.publisher	The Eurographics Association	en_US
dc.rights	Attribution 4.0 International License
dc.rights.uri	https://creativecommons.org/licenses/by/4.0/
dc.subject	CCS Concepts: Computing methodologies → Scene understanding; Object detection; Neural networks
dc.subject	Computing methodologies → Scene understanding
dc.subject	Object detection
dc.subject	Neural networks
dc.title	FAST GDRNPP: Improving the Speed of State-of-the-Art 6D Object Pose Estimation	en_US

Files

Original bundle

Now showing 1 - 1 of 1

Name:: stag20241335.pdf
Size:: 954.65 KB
Format:: Adobe Portable Document Format

Download

Collections

Italian Chapter Conference 2024 - Smart Tools and Apps in Graphics