ILeSiA: Interactive Learning of Situational Awareness from Camera Input
Submitted to ICRA 25

Video Materials Citation 🔗 Paper at ArXiv 📰 PDF Source code

Petr Vanc (1), Giovanni Franzese (2), and Jan Kristof Behrens (1), Cosimo Della Santina (2),
Karla Stepanova (1), and Jens Kober (2)

(1) Czech Technical University in Prague, Czech Institute of Informatics, Robotics, and Cybernetics
(2) Cognitive Robotics, Delft University of Technology, The Netherlands
petr.vanc@cvut.cz, G.Franzese@tudelft.nl, jan.kristof.behrens@cvut.cz, C.DellaSantina@tudelft.nl, karla.stepanova@cvut.cz, J.Kober@tudelft.nl


Abstract

Learning from demonstration is a promising way of teaching robots new skills. However, a central problem when executing acquired skills is to recognize risks and failures. This is essential since the demonstrations usually cover only a few mostly successful cases. Inevitable errors during exe- cution require specific reactions that were not apparent in the demonstrations. In this paper, we focus on teaching the robot situational awareness from an initial skill demonstration via kinesthetic teaching and sparse labeling of autonomous skill executions as safe or risky. At runtime, our system, called ILeSiA, detects risks based on the perceived camera images by encoding the images into a low-dimensional latent space representation and training a classifier based on the encoding and the provided labels. In this way, ILeSiA boosts the confidence and safety with which robotic skills can be executed. Our experiments demonstrate that classifiers, trained with only a small amount of user-provided data, can successfully detect numerous risks. The system is flexible because the risk cases are defined by labeling data. This also means that labels can be added as soon as risks are identified by a human supervisor.

Video

Video showing the usage of the proposed system.

Citation

Materials

Uncut video of Open a door skill: 1. Recording demonstration, 2. Playing recorded demonstration with different Robothon box position, 3. Labeling during execution, 4. Skill trial: Detection of door closed at the start and door opened at the end. It repeats when wrong door state is detected.
For more details, see: github.com/platonics-delft/ILeSiA?tab=readme-ov-file#demo-day-with-robothon-box