Ventral Striatum and Orbitofrontal Cortex Are Both Required for Model-Based, But Not Model-Free, Reinforcement Learning

Author

Michael McDannald, Federica Lucantonio, Kathryn Burke, Yael Niv, Geoffrey Schoenbaum

Publication Year

2011

Type

Journal Article

Abstract

In many cases, learning is thought to be driven by differences between the value of rewards we expect and rewards we actually receive. Yet learning can also occur when the identity of the reward we receive is not as expected, even if its value remains unchanged. Learning from changes in reward identity implies access to an internal model of the environment, from which information about the identity of the expected reward can be derived. As a result, such learning is not easily accounted for by model-free reinforcement learning theories such as temporal difference reinforcement learning (TDRL), which predicate learning on changes in reward value, but not identity. Here, we used unblocking procedures to assess learning driven by value- versus identity-based prediction errors. Rats were trained to associate distinct visual cues with different food quantities and identities. These cues were subsequently presented in compound with novel auditory cues and the reward quantity or identity was selectively changed. Unblocking was assessed by presenting the auditory cues alone in a probe test. Consistent with neural implementations of TDRL models, we found that the ventral striatum was necessary for learning in response to changes in reward value. However, this area, along with orbitofrontal cortex, was also required for learning driven by changes in reward identity. This observation requires that existing models of TDRL in the ventral striatum be modified to include information about the specific features of expected outcomes derived from model-based representations, and that the role of orbitofrontal cortex in these models be clearly delineated.

Keywords

Animals, Rats, Male, Cues, Basal Ganglia, Analysis of Variance, Acoustic Stimulation, Physiology, Reinforcement (Psychology), Long-Evans, Statistic, Nonparametric, injuries/physiology, Prefrontal Corte, methods, Associati

Journal

Journal of Neuroscience

Volume

Issue

Pages

2700–2705

Date Published

02/2011

ISSN Number

0270-6474

ISBN

1529-2401 (Electronic)\$\backslash\$r0270-6474 (Linking)

URL

PDF

DOI

10.1523/JNEUROSCI.5499-10.2011