Geometry-Guided Perception for Rapid Prototyping Systems

May 2026

Geometry-Guided Perception for Rapid Prototyping Systems

Authors:

Kevin Tang

Abstract:

Rapid prototyping systems demand both adaptability to new components and precision in physical interaction. In this thesis, we explore how geometry can be used in two complementary ways: as a prompt for detecting novel industrial objects, and as a leverage for fine-grained perception in precision tasks.

CAD-Prompted SAM3 enables instance segmentation directly from geometric specification, supporting detection of unseen objects without object-specific training. In CAD-Prompted Manipulation, we extend it to a zero-shot robotic manipulation pipeline by integrating it with pose estimation and grasp generation. Eye-in-Finger incorporates geometric reasoning with tool-integrated sensing, enabling industrial-level perception accuracy using inexpensive commercial hardware.

Together, these works demonstrate how geometry-guided perception provides a scalable approach for rapid prototyping, enabling systems to both adapt to newly introduced components and achieve the precision required for reliable assembly.

Notes:

@mastersthesis{Tang-2026-88287,
author = {Kevin Tang},
title = {Geometry-Guided Perception for Rapid Prototyping Systems},
year = {2026},
month = {May},
school = {Carnegie Mellon University},
address = {Pittsburgh, PA},
number = {CMU-RI-TR-26-29},
keywords = {Computer Vision, SAM, Segmentation},
}
Copyright notice: This material is presented to ensure timely dissemination of scholarly and technical work. Copyright and all rights therein are retained by authors or by other copyright holders. All persons copying this information are expected to adhere to the terms and constraints invoked by each author's copyright. These works may not be reposted without the explicit permission of the copyright holder.