Abstract
Learning from Demonstrations (LfD) transfers skills from human teachers to robots. However, data imbalance in demonstrations can bias policies towards majority situations. Previous work attempted to solve this problem after data collection, but few efforts were made to maintain a balanced distribution from the phase of data acquisition. Our method accounts for the influence of robots on human teachers and enables robots to actively guide interaction to approximate demonstration distributions to target distributions. Simulated and real-world experiments validated the method's efficacy in shaping demonstration distribution into various target distributions and robustness to various levels of uncertainties. Also, our method significantly improved the generalization ability of robot learning when LfD policies were trained with data collected by our method compared to natural data collection.
Original language | English |
---|---|
Title of host publication | 2023 32nd IEEE International Conference on Robot and Human Interactive Communication (RO-MAN) |
Subtitle of host publication | [Proceedings] |
Publisher | IEEE Computer Society |
Pages | 1737-1744 |
Number of pages | 8 |
ISBN (Electronic) | 9798350336702 |
ISBN (Print) | 9798350336719 |
DOIs | |
Publication status | Published - 2023 |
Event | 32nd IEEE International Conference on Robot and Human Interactive Communication, RO-MAN 2023 - Busan, Korea, Republic of Duration: 28 Aug 2023 → 31 Aug 2023 |
Publication series
Name | IEEE International Workshop on Robot and Human Communication, RO-MAN |
---|---|
ISSN (Print) | 1944-9445 |
ISSN (Electronic) | 1944-9437 |
Conference
Conference | 32nd IEEE International Conference on Robot and Human Interactive Communication, RO-MAN 2023 |
---|---|
Country/Territory | Korea, Republic of |
City | Busan |
Period | 28/08/23 → 31/08/23 |
Bibliographical note
Publisher Copyright:© 2023 IEEE.
Keywords
- data collection
- data imbalance
- human-robot interaction
- learning from demonstration