INFORL | Characterizing information integration in reinforcement learning: a neuro-computational investigation

Summary
Reinforcement learning (RL) characterizes how we adaptively learn, by trial and errors, to select actions that maximize the occurrence of rewards, and minimize the occurrence of punishments. While the behavioural, computational and neurobiological features of learning from singular experienced outcomes have been extensively studied, the mechanisms by which RL could leverage multiple, concurrent information samples – including abstract information about prospective outcomes – have been largely overlooked.
As a consequence, little is known about how we prioritize, filter or value outcome information in RL, while these processes likely critically contribute to shaping learning behaviour.
This project proposes to address this gap, and hypothesizes that humans can learn from multiple concurrent information samples, but that computational limitations and affective biases curb information integration.
First, using a new experimental and computational framework, I will evidence and quantify these cognitive features. Using eye-tracking and complementary functional neuroimaging modalities (EEG, fMRI), I will build a neuro-computational model of information integration, by deciphering the interactions between attentional parieto-frontal network and the affective ventro-limbic networks during reinforcement learning.
Then, I propose to investigate the strategic modulation of information integration, by investigating the effects of information quantity and quality on learning strategies and on the neural correlates of learning variables.
Finally, I will assess several behavioural interventions to ameliorate information integration and improve learning performance.
By investigating an overlooked aspect of reinforcement learning –the integration of available information–, this project could not only help refine computational and neurobiological models of the learning process, but also shed new lights on maladaptive traits of human behaviour in social and clinical contexts.
Unfold all
/
Fold all
More information & hyperlinks
Web resources: https://cordis.europa.eu/project/id/948671
Start date: 01-09-2021
End date: 31-08-2026
Total budget - Public funding: 1 500 000,00 Euro - 1 500 000,00 Euro
Cordis data

Original description

Reinforcement learning (RL) characterizes how we adaptively learn, by trial and errors, to select actions that maximize the occurrence of rewards, and minimize the occurrence of punishments. While the behavioural, computational and neurobiological features of learning from singular experienced outcomes have been extensively studied, the mechanisms by which RL could leverage multiple, concurrent information samples – including abstract information about prospective outcomes – have been largely overlooked.
As a consequence, little is known about how we prioritize, filter or value outcome information in RL, while these processes likely critically contribute to shaping learning behaviour.
This project proposes to address this gap, and hypothesizes that humans can learn from multiple concurrent information samples, but that computational limitations and affective biases curb information integration.
First, using a new experimental and computational framework, I will evidence and quantify these cognitive features. Using eye-tracking and complementary functional neuroimaging modalities (EEG, fMRI), I will build a neuro-computational model of information integration, by deciphering the interactions between attentional parieto-frontal network and the affective ventro-limbic networks during reinforcement learning.
Then, I propose to investigate the strategic modulation of information integration, by investigating the effects of information quantity and quality on learning strategies and on the neural correlates of learning variables.
Finally, I will assess several behavioural interventions to ameliorate information integration and improve learning performance.
By investigating an overlooked aspect of reinforcement learning –the integration of available information–, this project could not only help refine computational and neurobiological models of the learning process, but also shed new lights on maladaptive traits of human behaviour in social and clinical contexts.

Status

SIGNED

Call topic

ERC-2020-STG

Update Date

27-04-2024
Images
No images available.
Geographical location(s)
Structured mapping
Unfold all
/
Fold all
Horizon 2020
H2020-EU.1. EXCELLENT SCIENCE
H2020-EU.1.1. EXCELLENT SCIENCE - European Research Council (ERC)
ERC-2020
ERC-2020-STG