Leveraging Tactile Sensing to Resolve Uncertainty in Contact-Rich Manipulation

May 2026

Leveraging Tactile Sensing to Resolve Uncertainty in Contact-Rich Manipulation

Authors:

Moonyoung Lee

Abstract:

Manipulation in agricultural and unstructured environments requires acting on objects
that are occluded, variably shaped, and in clutter. Such tasks are contact-rich
and long-horizon: the robot must push aside foliage to reveal targets and reason
about latent or hidden state during interaction. Most deployed systems treat contact
as a hazard and often rely on vision alone, which limits deployment from real-world
field settings. This thesis adopts a different perspective: contact is a strategy for obtaining
information where vision cannot provide. Touch can reveal additional spatial,
temporal, and semantic cues, along with other task relevant latent states.
Sensing alone, however, is insufficient. Under partial observability or POMDP
settings, identical current observation can arise from different hidden states, and endto-
end imitation policies often blend conflicting demonstrations or lose track of earlier
interactions as the horizon grows. This thesis argues that augmenting vision with
contact sensing, and conditioning learned policies on a structured representation of
task-relevant hidden states (inferred from sequential contacts), enables reliable manipulation
where vision alone cannot disambiguate hidden states. In POMDP settings,
we show that structured representations of task-relevant hidden states outperform
end-to-end approaches conditioned on raw observation history.
The dissertation develops this argument in three movements. It first establishes
the limits of vision-only manipulation through a precise contact-rich agricultural task,
where occlusion makes sub-centimeter alignment unreliable without physical interaction.
It then shows that vibrotactile sensing via arrays of contact microphones recovers
both the spatial location and the semantic identity of contacts, complementing
visual priors in occluded canopies. Finally, it demonstrates that conditioning learned
policies on structured hidden-state representations, whether a Bayesian belief over
continuous variables or a discrete stage variable resolving temporal ambiguity, yields
history-aware behavior on contact-rich, long-horizon tasks that end-to-end policies
cannot reliably solve.

Notes:

@phdthesis{Lee-2026-88295,
author = {Moonyoung Lee},
title = {Leveraging Tactile Sensing to Resolve Uncertainty in Contact-Rich Manipulation},
year = {2026},
month = {May},
school = {Carnegie Mellon University},
address = {Pittsburgh, PA},
number = {CMU-RI-TR-26-48},
keywords = {contact-rich manipulation, tactile sensing, multisensory learning},
}
Copyright notice: This material is presented to ensure timely dissemination of scholarly and technical work. Copyright and all rights therein are retained by authors or by other copyright holders. All persons copying this information are expected to adhere to the terms and constraints invoked by each author's copyright. These works may not be reposted without the explicit permission of the copyright holder.