Goal-directed learning of the statistical structure of the environment

Project description

Reward in statistics-based human decisions

Humans and other animals need to learn to compress their sensory input efficiently - guided by both the statistical regularities in, and the reward available from, the environment. Recent psychological and neuroscience studies show that reward signals duly modulate the representations learned. However, these representations determine how we can recognise objects and concepts, and reward's indirect influence over these remains obscure. Hypothesising that humans use limited computational resources to steer learned representations towards their utility for gaining reward, RELEARN proposes a mathematical model that integrates reinforcement learning and probabilistic representational learning. We will test behavioural and neural predictions in experiments where humans learn to gain reward based on the feature statistics of a simulated environment.

Objective

Learning the statistical buildup of the environment serves the purpose of making good decisions, thus what regularities humans learn and what ones they neglect depends on the relevance towards maximizing reward. Recent studies characterise reward-based modulation of feature representations built by humans and animals both on the behavioural and neural level, but the effect of reward on learning higher-order environmental statistics is unknown. Our hypothesis is that humans do not learn to represent feature co-occurrence statistics if it does not help to predict reward due to resource constraints on computation and storage. We propose a mathematical framework based on Bayesian hierarchical modelling and reinforcement learning to predict the modulatory effect of reward on learned representations. We will test the predictions of the model in a series of experiments where humans need to learn to associate precisely controlled statistical aspects of a naturalistic simulated environment to reward both in the lab and online, in reactive and planning-based tasks. Additional to behaviour, the model will predict the structure of neural representations and their changes over the course of the experiment as well. We will test those predictions using magnetoencephalography during the learning phase of the experiments and decoding analysis to compare model variables to neural responses. The results will contribute to the understanding of representational learning in humans, with potential implications in psychiatry and economics as well as supply the community with novel analytical tools and data. The unique mentoring at the host institution together with the extensive training program including international visits to world-leading collaborators will establish my independent research program in computational neuroscience.

Fields of science

Keywords

Programme(s)

Coordinator

MAX-PLANCK-GESELLSCHAFT ZUR FORDERUNG DER WISSENSCHAFTEN EV

Net EU contribution

€ 174 806,40

Address

HOFGARTENSTRASSE 8
80539 Munchen
Germany

Region

Bayern Oberbayern München, Kreisfreie Stadt

Activity type

Research Organisations

Links

Contact the organisation Website

Participation in EU R&I programmes

HORIZON collaboration network

Total cost

€ 174 806,40

Project description

Reward in statistics-based human decisions

Objective

Fields of science

Keywords

Programme(s)

Topic(s)

Call for proposal

Funding Scheme

Coordinator

Share this page

Download