Enhancing Robot Perception and Interaction Through Structured Domain Knowledge

June 2024

Enhancing Robot Perception and Interaction Through Structured Domain Knowledge

Authors:

Sarthak Bhagat

Abstract:

Despite the advancements in deep learning driven by increased computational power and large datasets, significant challenges remain. These include difficulty in handling novel entities, limited mechanisms for human experts to update knowledge, and lack of interpretability, all of which are crucial for human-centric applications like assistive robotics. To address these issues, we propose leveraging structured information sources, such as knowledge graphs, to enhance the robustness and reliability of deep learning models by utilizing additional domain knowledge. By integrating these knowledge sources through neurosymbolic architectures, which combine neural networks and symbolic reasoning, we can improve model interpretability, generalization, and flexibility. This approach enables AI systems to understand complex scenes and human actions better, ultimately leading to more reliable and transparent performance in real-world scenarios. Our work highlights the potential of augmenting neural networks with additional domain knowledge. Particularly, we demonstrate the benefit of this approach in the task of learning novel objects in a sample-efficient manner and action anticipation from short-video contexts in a human-robot collaborative setting.

Notes:

@mastersthesis{Bhagat-2024-141327,
author = {Sarthak Bhagat},
title = {Enhancing Robot Perception and Interaction Through Structured Domain Knowledge},
year = {2024},
month = {June},
school = {Carnegie Mellon University},
address = {Pittsburgh, PA},
number = {CMU-RI-TR-24-28},
keywords = {domain knowledge, concept learning, few-shot learning, action anticipation, video understanding, robot learning, knowledge graphs},
}
Copyright notice: This material is presented to ensure timely dissemination of scholarly and technical work. Copyright and all rights therein are retained by authors or by other copyright holders. All persons copying this information are expected to adhere to the terms and constraints invoked by each author's copyright. These works may not be reposted without the explicit permission of the copyright holder.