Jiehong Lin, Lihua Liu, Dekun Lu et al. (4 total)
2023-11-27
2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
10.1109/cvpr52733.2024.02636
137 citations
摘要
Zero-shot 6D object pose estimation involves the detection of novel objects with their 6D poses in cluttered scenes, presenting significant challenges for model generalizability. Fortunately, the recent Segment Anything Model (SAM) has showcased remarkable zero-shot transfer performance, which provides a promising solution to tackle this task. Motivated by this, we introduce SAM-6D, a novel framework designed to realize the task through two steps, including instance segmentation and pose estimat...
The research addresses the challenge of zero-shot 6D object pose estimation, which involves detecting novel objects and estimating their 6D poses in cluttered scenes. This is difficult due to the need for model generalizability to unseen objects.
The study introduces SAM-6D, a framework using the Segment Anything Model (SAM) for instance segmentation and a Pose Estimation Model (PEM) for pose estimation. It uses object matching scores and a two-stage point matching process.
SAM-6D outperforms existing methods on the seven core datasets of the BOP Benchmark for both instance segmentation and pose estimation of novel objects, demonstrating robust generalization capabilities without bells and whistles.