Robot Synesthesia:
In-Hand Manipulation with Visuotactile Sensing

Ying Yuan^*,2, Haichuan Che^*,1, Yuzhe Qin^*,1,
Binghao Huang³, Zhao-Heng Yin⁴, Kang-Won Lee⁵, Yi Wu², Soo-Chul Lim⁵, Xiaolong Wang¹

¹UC San Diego, ²Tsinghua University, ³University of Illinois Urbana-Champaign,
⁴UC Berkeley, ⁵Dongguk University

^*Equal Contribution
International Conference on Robotics and Automation (ICRA) 2024

Paper Video Code

We propose Robot Synesthesia, a novel visuotactile approach to perform in-hand object rotation with visual and tactile modalities. We train our policy in simulation on rotating single or multiple objects around a certain axis and then transfer it to the real robot hand without any real-world data.

Visualization of Point Cloud in Testing

We visualize real-world point cloud observation during testing, including camera point clouds and augmented point clouds.

Abstract

Executing contact-rich manipulation tasks necessitates the fusion of tactile and visual feedback. However, the distinct nature of these modalities poses significant challenges. In this paper, we introduce a system that leverages visual and tactile sensory inputs to enable dexterous in-hand manipulation. Specifically, we propose Robot Synesthesia, a novel point cloud-based tactile representation inspired by human tactile-visual synesthesia. This approach allows for the simultaneous and seamless integration of both sensory inputs, offering richer spatial information and facilitating better reasoning about robot actions. The method, trained in a simulated environment and then deployed to a real robot, is applicable to various in-hand object rotation tasks. Comprehensive ablations are performed on how the integration of vision and touch can improve reinforcement learning and Sim2Real performance.