Humans use a lot of tools in daily life to complete different tasks (e.g open wine bottle with a wine-opener, cutting wood with a saw). Without various tools, we fumble at many tasks and even completely fail at some of them. Two key problems when learning to leverage tools are: 1) With multiple tools available, which one should be selected to best solve the task at hand? 2) What actions should be
taken at each instant given the tool-task combination (how should the tool be used)? We argue that the ability to choose and deploy tools is crucial for successful use of tools both for humans and robots. Specifically, in object rearrangement and cleaning tasks, it is important to have the ability to utilize different tools and correctly switch between and deploy suitable tools. Thus, we propose an end-to-end learning framework that jointly learns to choose different tools and deploy tool-conditioned policies with a limited amount of human demonstrations. We evaluate our method on parallel gripper and suction cup picking and placing, brush sweeping, and household rearrangement tasks, generalizing to different configurations, novel objects, and cluttered scenes in the real world.
|