TY - JOUR
T1 - Human–robot dialogue annotation for multi-modal common ground
AU - Bonial, Claire
AU - Lukin, Stephanie M.
AU - Abrams, Mitchell
AU - Baker, Anthony
AU - Donatelli, Lucia
AU - Foots, Ashley
AU - Hayes, Cory J.
AU - Henry, Cassidy
AU - Hudson, Taylor
AU - Marge, Matthew
AU - Pollard, Kimberly A.
AU - Artstein, Ron
AU - Traum, David
AU - Voss, Clare R.
N1 - Publisher Copyright:
© This is a U.S. Government work and not under copyright protection in the US; foreign copyright protection may apply 2024.
PY - 2025/6
Y1 - 2025/6
N2 - In this paper, we describe the development of symbolic representations annotated on human–robot dialogue data to make dimensions of meaning accessible to autonomous systems participating in collaborative, natural language dialogue, and to enable common ground with human partners. A particular challenge for establishing common ground arises in remote dialogue (occurring in disaster relief or search-and-rescue tasks), where a human and robot are engaged in a joint navigation and exploration task of an unfamiliar environment, but where the robot cannot immediately share high quality visual information due to limited communication constraints. Engaging in a dialogue provides an effective way to communicate, while on-demand or lower-quality visual information can be supplemented for establishing common ground. Within this paradigm, we capture propositional semantics and the illocutionary force of a single utterance within the dialogue through our Dialogue-AMR annotation, an augmentation of Abstract Meaning Representation. We then capture patterns in how different utterances within and across speaker floors relate to one another in our development of a multi-floor Dialogue Structure annotation schema. Finally, we begin to annotate and analyze the ways in which the visual modalities provide contextual information to the dialogue for overcoming disparities in the collaborators’ understanding of the environment. We conclude by discussing the use-cases, architectures, and systems we have implemented from our annotations that enable physical robots to autonomously engage with humans in bi-directional dialogue and navigation.
AB - In this paper, we describe the development of symbolic representations annotated on human–robot dialogue data to make dimensions of meaning accessible to autonomous systems participating in collaborative, natural language dialogue, and to enable common ground with human partners. A particular challenge for establishing common ground arises in remote dialogue (occurring in disaster relief or search-and-rescue tasks), where a human and robot are engaged in a joint navigation and exploration task of an unfamiliar environment, but where the robot cannot immediately share high quality visual information due to limited communication constraints. Engaging in a dialogue provides an effective way to communicate, while on-demand or lower-quality visual information can be supplemented for establishing common ground. Within this paradigm, we capture propositional semantics and the illocutionary force of a single utterance within the dialogue through our Dialogue-AMR annotation, an augmentation of Abstract Meaning Representation. We then capture patterns in how different utterances within and across speaker floors relate to one another in our development of a multi-floor Dialogue Structure annotation schema. Finally, we begin to annotate and analyze the ways in which the visual modalities provide contextual information to the dialogue for overcoming disparities in the collaborators’ understanding of the environment. We conclude by discussing the use-cases, architectures, and systems we have implemented from our annotations that enable physical robots to autonomously engage with humans in bi-directional dialogue and navigation.
KW - Multi-floor dialogue
KW - Multi-modal dialogue
KW - Semantics
KW - Situated dialogue
UR - http://www.scopus.com/inward/record.url?scp=85209134492&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85209134492&partnerID=8YFLogxK
U2 - 10.1007/s10579-024-09784-2
DO - 10.1007/s10579-024-09784-2
M3 - Article
AN - SCOPUS:85209134492
SN - 1574-020X
VL - 59
SP - 1525
EP - 1575
JO - Language Resources and Evaluation
JF - Language Resources and Evaluation
IS - 2
ER -