I. Introduction
Complex, multistage robotic manipulation tasks often arise in robotic manipulation applications of practical interest: from home robots that need to prepare a meal to industrial robots that need to assemble a complex device. Many of the tasks we might want to automate in these settings consist of complex low-level behaviors and also require these behaviors to be sequenced appropriately to perform the overall task. This presents a major challenge: when a robot naïvely executes a sequence of primitive behaviors to perform a complex task, the probability of failing at the task grows multiplicatively with each stage. Advances in perception, control, and robotic learning can enable each stage of a task to be more performant, but as long as sequencing stages naïvely leads to such difficulties, elaborate multistage tasks that consist of a sequence of individually difficult primitives will remain out of reach. In this article, we examine how hierarchical imitation learning (IL) with levels of learned primitives and high-level sequencing can address this issue, in the context of a difficult multistage cable routing task. Our focus is on providing for robustness at both levels of the hierarchy, not by trying to construct perfect robotic skills that never fail, but by endowing both levels of the hierarchy with the ability to correct and recover from mistakes. We study the problem of multistage cable routing, where a robot routes a cable through a series of clips (see Fig. 1). This task is representative of commonly occurring scenarios in manufacturing and maintenance, where a robot might need to route cables or hoses, and provides an interesting challenge for robotic manipulation: each stage of the cable routing task requires handling the deformable cable, reasoning about complex contact patterns between the cable and the clip, accounting for deformations, and also observing that the cable has been clipped into each clip successfully. At the same time, the higher level sequencing of primitives might require retrying a clipping motion, securing the cable by pulling it taut, and dynamically deciding when to advance to the next clip. This task is, therefore, both practically relevant for industrial applications, and captures many of the essential characteristics of multistage manipulation discussed in the preceding paragraph, with challenges and ambiguity at both the lower and upper level necessitating intelligent controllers that combine perception and control, and recovery from failure.