Efficiently Combining Human Demonstrations and Interventions for Safe Training of Autonomous Systems in Real-Time

abstract

This paper investigates how to utilize different forms of human interaction to safely train autonomous systems in realtime by learning from both human demonstrations and interventions. We implement two components of the Cycle-of Learning for Autonomous Systems, which is our framework for combining multiple modalities of human interaction. The current effort employs human demonstrations to teach a desired behavior via imitation learning, then leverages intervention data to correct for undesired behaviors produced by the imitation learner to teach novel tasks to an autonomous agent safely, after only minutes of training. We demonstrate this method in an autonomous perching task using a quadrotor with continuous roll, pitch, yaw, and throttle commands and imagery captured from a downward-facing camera in a high-fidelity simulated environment. Our method improves task completion performance for the same amount of human interaction when compared to learning from demonstrations alone, while also requiring on average 32% less data to achieve that performance. This provides evidence that combining multiple modes of human interaction can increase both the training speed and overall performance of policies for autonomous systems.

authors

Valasek, John

published proceedings

THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE

author list (cited authors)

Goecks, V. G., Gremillion, G. M., Lawhern, V. J., Valasek, J., & Waytowich, N. R.

citation count

18

complete list of authors

Goecks, Vinicius G||Gremillion, Gregory M||Lawhern, Vernon J||Valasek, John||Waytowich, Nicholas R

publication date

July 2019

publisher

Association for the Advancement of Artificial Intelligence (AAAI) Publisher

published in

Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence Journal

keywords

Behavioral And Social Science
Clinical Research

Digital Object Identifier (DOI)

10.1609/aaai.v33i01.33012462

start page

2462

end page

2470

volume

33

issue

01

URL

http://dx.doi.org/10.1609/aaai.v33i01.33012462

Efficiently Combining Human Demonstrations and Interventions for Safe Training of Autonomous Systems in Real-Time Conference Paper

Overview

abstract

authors

published proceedings

author list (cited authors)

citation count

complete list of authors

publication date

publisher

published in

Research

keywords

Identity

Digital Object Identifier (DOI)

Additional Document Info

start page

end page

volume

issue

Other

URL