TY - JOUR
T1 - A learning rule that explains how rewards teach attention
AU - Rombouts, J.O.
AU - Bohte, S.M.
AU - Martinez-Trujillo, J.
AU - Roelfsema, P.R.
PY - 2015
Y1 - 2015
N2 - Many theories propose that top-down attentional signals control processing in sensory cortices by modulating neural activity. But who controls the controller? Here we investigate how a biologically plausible neural reinforcement learning scheme can create higher order representations and top-down attentional signals. The learning scheme trains neural networks using two factors that gate Hebbian plasticity: (1) an attentional feedback signal from the response-selection stage to earlier processing levels; and (2) a globally available neuromodulator that encodes the reward prediction error. We demonstrate how the neural network learns to direct attention to one of two coloured stimuli that are arranged in a rank-order. Like monkeys trained on this task, the network develops units that are tuned to the rank-order of the colours and it generalizes this newly learned rule to previously unseen colour combinations. These results provide new insight into how individuals can learn to control attention as a function of reward contingency.
AB - Many theories propose that top-down attentional signals control processing in sensory cortices by modulating neural activity. But who controls the controller? Here we investigate how a biologically plausible neural reinforcement learning scheme can create higher order representations and top-down attentional signals. The learning scheme trains neural networks using two factors that gate Hebbian plasticity: (1) an attentional feedback signal from the response-selection stage to earlier processing levels; and (2) a globally available neuromodulator that encodes the reward prediction error. We demonstrate how the neural network learns to direct attention to one of two coloured stimuli that are arranged in a rank-order. Like monkeys trained on this task, the network develops units that are tuned to the rank-order of the colours and it generalizes this newly learned rule to previously unseen colour combinations. These results provide new insight into how individuals can learn to control attention as a function of reward contingency.
UR - https://www.scopus.com/pages/publications/84928938586
UR - https://www.scopus.com/inward/citedby.url?scp=84928938586&partnerID=8YFLogxK
U2 - 10.1080/13506285.2015.1010462
DO - 10.1080/13506285.2015.1010462
M3 - Article
SN - 1350-6285
VL - 23
SP - 179
EP - 205
JO - Visual Cognition
JF - Visual Cognition
IS - 1-2
ER -