Learning reward frequency over reward probability: A tale of two learning rules.

abstract

Learning about the expected value of choice alternatives associated with reward is critical for adaptive behavior. Although human choice preferences are affected by the presentation frequency of reward-related alternatives, this may not be captured by some dominant models of value learning, such as the delta rule. In this study, we examined whether reward learning is driven more by learning the probability of reward provided by each option, or how frequently each option has been rewarded, and assess how well models based on average reward (e.g. the delta model) and models based on cumulative reward (e.g. the decay model) can account for choice preferences. In a binary-outcome choice task, participants selected between pairs of options that had reward probabilities of 0.65 (A) versus 0.35 (B) or 0.75 (C) versus 0.25 (D). Crucially, during training there were twice the number of AB trials as CD trials, such that option A was associated with higher cumulative reward, while option C gave higher average reward. Participants then decided between novel combinations of options (e.g., AC). Most participants preferred option A over C, a result predicted by the Decay model, but not the Delta model. We also compared the Delta and Decay models to both more simplified as well as more complex models that assumed additional mechanisms, such as representation of uncertainty. Overall, models that assume learning about cumulative reward provided the best account of the data.

authors

Worthy, Darrell

published proceedings

Cognition

altmetric score

9.05

author list (cited authors)

Don, H. J., Otto, A. R., Cornwall, A. C., Davis, T., & Worthy, D. A.

citation count

7

complete list of authors

Don, Hilary J||Otto, A Ross||Cornwall, Astin C||Davis, Tyler||Worthy, Darrell A

publication date

January 2019

publisher

Elsevier Publisher

published in

Cognition Journal

keywords

Adult
Choice Behavior
Decay Rule
Delta Rule
Female
Humans
Male
Models, Psychological
Prediction Error
Probability Learning
Reinforcement Learning
Reinforcement, Psychology
Reward
Reward Frequency
Young Adult

Digital Object Identifier (DOI)

10.1016/j.cognition.2019.104042

start page

104042

end page

104042

volume

193

URL

http://dx.doi.org/10.1016/j.cognition.2019.104042

Learning reward frequency over reward probability: A tale of two learning rules. Academic Article

Overview

abstract

authors

published proceedings

altmetric score

author list (cited authors)

citation count

complete list of authors

publication date

publisher

published in

Research

keywords

Identity

Digital Object Identifier (DOI)

Additional Document Info

start page

end page

volume

Other

URL