Quantifying exploration in reward-based motor learning

Nina M. Van Mastrigt*, Jeroen B.J. Smeets, Katinka Van Der Kooij

*Corresponding author for this work

Research output: Contribution to JournalArticleAcademicpeer-review

Abstract

Exploration in reward-based motor learning is observable in experimental data as increased variability. In order to quantify exploration, we compare three methods for estimating other sources of variability: Sensorimotor noise. We use a task in which participants could receive stochastic binary reward feedback following a target-directed weight shift. Participants first performed six baseline blocks without feedback, and next twenty blocks alternating with and without feedback. Variability was assessed based on trial-to-trial changes in movement endpoint. We estimated sensorimotor noise by the median squared trial-to-trial change in movement endpoint for trials in which no exploration is expected. We identified three types of such trials: Trials in baseline blocks, trials in the blocks without feedback, and rewarded trials in the blocks with feedback. We estimated exploration by the median squared trial-to-trial change following non-rewarded trials minus sensorimotor noise. As expected, variability was larger following non-rewarded trials than following rewarded trials. This indicates that our reward-based weight-shifting task successfully induced exploration. Most importantly, our three estimates of sensorimotor noise differed: The estimate based on rewarded trials was significantly lower than the estimates based on the two types of trials without feedback. Consequently, the estimates of exploration also differed. We conclude that the quantification of exploration depends critically on the type of trials used to estimate sensorimotor noise. We recommend the use of variability following rewarded trials.

Original languageEnglish
Article numbere0226789
JournalPLoS ONE
Volume15
Issue number4
DOIs
Publication statusPublished - 1 Jan 2020

Fingerprint

Dive into the research topics of 'Quantifying exploration in reward-based motor learning'. Together they form a unique fingerprint.

Cite this