Learning the value of information and reward over time when solving exploration-exploitation problems.

Cogliati Dezza, Irene; Yu, Angela J; Cleeremans, Axel; Alexander, William

doi:doi/10.1038/s41598-017-17237-w

Citer

Learning the value of information and reward over time when solving exploration-exploitation problems.

par Cogliati Dezza, Irene

;Yu, Angela J ;Cleeremans, Axel

;Alexander, William
Référence Scientific reports, 7, 1, page (16919)
Publication Publié, 2017-12

Article révisé par les pairs

Résumé :

To flexibly adapt to the demands of their environment, animals are constantly exposed to the conflict resulting from having to choose between predictably rewarding familiar options (exploitation) and risky novel options, the value of which essentially consists of obtaining new information about the space of possible rewards (exploration). Despite extensive research, the mechanisms that subtend the manner in which animals solve this exploitation-exploration dilemma are still poorly understood. Here, we investigate human decision-making in a gambling task in which the informational value of each trial and the reward potential were separately manipulated. To better characterize the mechanisms that underlined the observed behavioural choices, we introduce a computational model that augments the standard reward-based reinforcement learning formulation by associating a value to information. We find that both reward and information gained during learning influence the balance between exploitation and exploration, and that this influence was dependent on the reward context. Our results shed light on the mechanisms that underpin decision-making under uncertainty, and suggest new approaches for investigating the exploration-exploitation dilemma throughout the animal kingdom.

Référencement	Visibilité	Pérennité	Facilité
Les publications encodées constituent la bibliographie académique de l'Université.	Les documents déposés sont indexés par les moteurs de recherche (Google Scholar,…).	Les documents déposés en open-access sont archivés au sein du réseau de préservation SAFE-PLN (www.safepln.org).	Les listes de publications sont compatibles avec le CV-ULB, le FNRS et accessibles sur le web.

Learning the value of information and reward over time when solving exploration-exploitation problems.

Documents en relation

DI-fusion