TY - GEN
T1 - Improving Confidence in the Estimation of Values and Norms
AU - Siebert, Luciano Cavalcante
AU - Mercuur, Rijk
AU - Dignum, Virginia
AU - van den Hoven, Jeroen
AU - Jonker, Catholijn
PY - 2021
Y1 - 2021
N2 - Autonomous agents (AA) will increasingly be interacting with us in our daily lives. While we want the benefits attached to AAs, it is essential that their behavior is aligned with our values and norms. Hence, an AA will need to estimate the values and norms of the humans it interacts with, which is not a straightforward task when solely observing an agent’s behavior. This paper analyses to what extent an AA is able to estimate the values and norms of a simulated human agent (SHA) based on its actions in the ultimatum game. We present two methods to reduce ambiguity in profiling the SHAs: one based on search space exploration and another based on counterfactual analysis. We found that both methods are able to increase the confidence in estimating human values and norms, but differ in their applicability, the latter being more efficient when the number of interactions with the agent is to be minimized. These insights are useful to improve the alignment of AAs with human values and norms.
AB - Autonomous agents (AA) will increasingly be interacting with us in our daily lives. While we want the benefits attached to AAs, it is essential that their behavior is aligned with our values and norms. Hence, an AA will need to estimate the values and norms of the humans it interacts with, which is not a straightforward task when solely observing an agent’s behavior. This paper analyses to what extent an AA is able to estimate the values and norms of a simulated human agent (SHA) based on its actions in the ultimatum game. We present two methods to reduce ambiguity in profiling the SHAs: one based on search space exploration and another based on counterfactual analysis. We found that both methods are able to increase the confidence in estimating human values and norms, but differ in their applicability, the latter being more efficient when the number of interactions with the agent is to be minimized. These insights are useful to improve the alignment of AAs with human values and norms.
UR - https://www.scopus.com/pages/publications/85107492756
UR - https://www.scopus.com/pages/publications/85107492756#tab=citedBy
U2 - 10.1007/978-3-030-72376-7_6
DO - 10.1007/978-3-030-72376-7_6
M3 - Conference contribution
SN - 9783030723750
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 98
EP - 113
BT - Coordination, Organizations, Institutions, Norms, and Ethics for Governance of Multi-Agent Systems XIII
A2 - Aler Tubella, Andrea
A2 - Cranefield, Stephen
A2 - Frantz, Christopher
A2 - Meneguzzi, Felipe
A2 - Vasconcelos, Wamberto
PB - Springer Science and Business Media Deutschland GmbH
T2 - International Workshop on Coordination, Organizations, Institutions, and Norms for Governance of Multi-Agent Systems, COIN 2017, co-located with AAMAS 2017 and the International Workshop on Coordination, Organizations, Institutions, Norms and Ethics for Governance of Multi-Agent Systems, COINE 2020, co-located with AAMAS 2020
Y2 - 9 May 2020 through 9 May 2020
ER -