Abstract:
Rapid prototyping systems demand both adaptability to new components and precision in physical interaction. In this thesis, we explore how geometry can be used in two complementary ways: as a prompt for detecting novel industrial objects, and as a leverage for fine-grained perception in precision tasks.
CAD-Prompted SAM3 enables instance segmentation directly from geometric specification, supporting detection of unseen objects without object-specific training. In CAD-Prompted Manipulation, we extend it to a zero-shot robotic manipulation pipeline by integrating it with pose estimation and grasp generation. Eye-in-Finger incorporates geometric reasoning with tool-integrated sensing, enabling industrial-level perception accuracy using inexpensive commercial hardware.
Together, these works demonstrate how geometry-guided perception provides a scalable approach for rapid prototyping, enabling systems to both adapt to newly introduced components and achieve the precision required for reliable assembly.
CAD-Prompted SAM3 enables instance segmentation directly from geometric specification, supporting detection of unseen objects without object-specific training. In CAD-Prompted Manipulation, we extend it to a zero-shot robotic manipulation pipeline by integrating it with pose estimation and grasp generation. Eye-in-Finger incorporates geometric reasoning with tool-integrated sensing, enabling industrial-level perception accuracy using inexpensive commercial hardware.
Together, these works demonstrate how geometry-guided perception provides a scalable approach for rapid prototyping, enabling systems to both adapt to newly introduced components and achieve the precision required for reliable assembly.
Notes:
copied = false, 2000);
">
@mastersthesis{Tang-2026-88287,
author = {Kevin Tang},
title = {Geometry-Guided Perception for Rapid Prototyping Systems},
year = {2026},
month = {May},
school = {Carnegie Mellon University},
address = {Pittsburgh, PA},
number = {CMU-RI-TR-26-29},
keywords = {Computer Vision, SAM, Segmentation},
}
author = {Kevin Tang},
title = {Geometry-Guided Perception for Rapid Prototyping Systems},
year = {2026},
month = {May},
school = {Carnegie Mellon University},
address = {Pittsburgh, PA},
number = {CMU-RI-TR-26-29},
keywords = {Computer Vision, SAM, Segmentation},
}