Optimal action sequence generation for assistive agents in fixed horizon tasks

K. Baraka, F.S. Melo, M. Couto, M. Veloso

Research output: Contribution to JournalArticleAcademicpeer-review

Abstract

© 2020, Springer Science+Business Media, LLC, part of Springer Nature.Agents providing assistance to humans are faced with the challenge of automatically adjusting the level of assistance to ensure optimal performance. In this work, we argue that identifying the right level of assistance consists in balancing positive assistance outcomes and some (domain-dependent) measure of cost associated with assistive actions. Towards this goal, we contribute a general mathematical framework for structured tasks where an agent playing the role of a ‘provider’—e.g., therapist, teacher—assists a human ‘receiver’—e.g., patient, student. We specifically consider tasks where the provider agent needs to plan a sequence of actions over a fixed time horizon, where actions are organized along a hierarchy with increasing success probabilities, and some associated costs. The goal of the provider is to achieve a success with the lowest expected cost possible. We present OAssistMe, an algorithm that generates cost-optimal action sequences given the action parameters, and investigate several extensions of it, motivated by different potential application domains. We provide an analysis of the algorithms, including proofs for a number of properties of optimal solutions that, we show, align with typical human provider strategies. Finally, we instantiate our theoretical framework in the context of robot-assisted therapy tasks for children with Autism Spectrum Disorder (ASD). In this context, we present methods for determining action parameters based on a survey of domain experts and real child-robot interaction data. Our contributions unlock increased levels of flexibility for agents introduced in a variety of assistive contexts.
Original languageEnglish
Article number33
JournalAutonomous Agents and Multi-Agent Systems
Volume34
Issue number2
DOIs
Publication statusPublished - 1 Oct 2020
Externally publishedYes

Funding

We would like to thank Ana Paiva for her input on the child-robot interaction study. We also thank the reviewers for their valuable suggestions. This research was partially supported by the CMUPERI/HCI/0051/2013 grant, associated with the CMU/Portugal INSIDE project ( http://www.project-inside.pt/ ), as well as national funds through Fundação para a Ciência e a Tecnologia (FCT) with references UID/CEC/50021/2020 and SFRH/BD/128359/2017. The views and conclusions contained in this document are those of the authors only.

FundersFunder number
Fuel Cell Technologies Program
Fundação para a Ciência e a TecnologiaUID/CEC/50021/2020, SFRH/BD/128359/2017

    Fingerprint

    Dive into the research topics of 'Optimal action sequence generation for assistive agents in fixed horizon tasks'. Together they form a unique fingerprint.

    Cite this