Value Learning through Reinforcement: The Basics of Dopamine and Reinforcement Learning

Nathaniel D. Daw, Philipe N. Tobler

Research output: Chapter in Book/Report/Conference proceedingChapter

Abstract

In order to choose advantageously in many circumstances, the values of choice alternatives have to be learned from experience. We provide an introduction to theoretical and experimental work on reinforcement learning, that is, trial-and-error learning to obtain rewards or avoid punishments. We introduce one version, the temporal-difference learning model, and review evidence that its predictions relate to the firing properties of midbrain dopamine neurons and to activity recorded with functional neuroimaging in humans. We also present evidence that this computational and neurophysiological mechanism affects human and animal behavior in decision and conditioning tasks.

Original languageEnglish (US)
Title of host publicationNeuroeconomics: Decision Making and the Brain: Second Edition
PublisherElsevier Inc.
Pages283-298
Number of pages16
ISBN (Print)9780124160088
DOIs
StatePublished - Sep 2013

Fingerprint

Dopamine
Learning
Animal Behavior
Functional Neuroimaging
Punishment
Dopaminergic Neurons
Mesencephalon
Reward
Reinforcement (Psychology)
Conditioning (Psychology)

Keywords

  • Dopamine
  • Reinforcement learning

ASJC Scopus subject areas

  • Neuroscience(all)

Cite this

Daw, N. D., & Tobler, P. N. (2013). Value Learning through Reinforcement: The Basics of Dopamine and Reinforcement Learning. In Neuroeconomics: Decision Making and the Brain: Second Edition (pp. 283-298). Elsevier Inc.. https://doi.org/10.1016/B978-0-12-416008-8.00015-2

Value Learning through Reinforcement : The Basics of Dopamine and Reinforcement Learning. / Daw, Nathaniel D.; Tobler, Philipe N.

Neuroeconomics: Decision Making and the Brain: Second Edition. Elsevier Inc., 2013. p. 283-298.

Research output: Chapter in Book/Report/Conference proceedingChapter

Daw, ND & Tobler, PN 2013, Value Learning through Reinforcement: The Basics of Dopamine and Reinforcement Learning. in Neuroeconomics: Decision Making and the Brain: Second Edition. Elsevier Inc., pp. 283-298. https://doi.org/10.1016/B978-0-12-416008-8.00015-2
Daw ND, Tobler PN. Value Learning through Reinforcement: The Basics of Dopamine and Reinforcement Learning. In Neuroeconomics: Decision Making and the Brain: Second Edition. Elsevier Inc. 2013. p. 283-298 https://doi.org/10.1016/B978-0-12-416008-8.00015-2
Daw, Nathaniel D. ; Tobler, Philipe N. / Value Learning through Reinforcement : The Basics of Dopamine and Reinforcement Learning. Neuroeconomics: Decision Making and the Brain: Second Edition. Elsevier Inc., 2013. pp. 283-298
@inbook{85674a77fe604d7fabc3b915dd32a2be,
title = "Value Learning through Reinforcement: The Basics of Dopamine and Reinforcement Learning",
abstract = "In order to choose advantageously in many circumstances, the values of choice alternatives have to be learned from experience. We provide an introduction to theoretical and experimental work on reinforcement learning, that is, trial-and-error learning to obtain rewards or avoid punishments. We introduce one version, the temporal-difference learning model, and review evidence that its predictions relate to the firing properties of midbrain dopamine neurons and to activity recorded with functional neuroimaging in humans. We also present evidence that this computational and neurophysiological mechanism affects human and animal behavior in decision and conditioning tasks.",
keywords = "Dopamine, Reinforcement learning",
author = "Daw, {Nathaniel D.} and Tobler, {Philipe N.}",
year = "2013",
month = "9",
doi = "10.1016/B978-0-12-416008-8.00015-2",
language = "English (US)",
isbn = "9780124160088",
pages = "283--298",
booktitle = "Neuroeconomics: Decision Making and the Brain: Second Edition",
publisher = "Elsevier Inc.",

}

TY - CHAP

T1 - Value Learning through Reinforcement

T2 - The Basics of Dopamine and Reinforcement Learning

AU - Daw, Nathaniel D.

AU - Tobler, Philipe N.

PY - 2013/9

Y1 - 2013/9

N2 - In order to choose advantageously in many circumstances, the values of choice alternatives have to be learned from experience. We provide an introduction to theoretical and experimental work on reinforcement learning, that is, trial-and-error learning to obtain rewards or avoid punishments. We introduce one version, the temporal-difference learning model, and review evidence that its predictions relate to the firing properties of midbrain dopamine neurons and to activity recorded with functional neuroimaging in humans. We also present evidence that this computational and neurophysiological mechanism affects human and animal behavior in decision and conditioning tasks.

AB - In order to choose advantageously in many circumstances, the values of choice alternatives have to be learned from experience. We provide an introduction to theoretical and experimental work on reinforcement learning, that is, trial-and-error learning to obtain rewards or avoid punishments. We introduce one version, the temporal-difference learning model, and review evidence that its predictions relate to the firing properties of midbrain dopamine neurons and to activity recorded with functional neuroimaging in humans. We also present evidence that this computational and neurophysiological mechanism affects human and animal behavior in decision and conditioning tasks.

KW - Dopamine

KW - Reinforcement learning

UR - http://www.scopus.com/inward/record.url?scp=84897401223&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84897401223&partnerID=8YFLogxK

U2 - 10.1016/B978-0-12-416008-8.00015-2

DO - 10.1016/B978-0-12-416008-8.00015-2

M3 - Chapter

SN - 9780124160088

SP - 283

EP - 298

BT - Neuroeconomics: Decision Making and the Brain: Second Edition

PB - Elsevier Inc.

ER -