Better decisions through reinforcement learning

A study on development of reinforcement learning in human adolescence has provided further insights into adolescent reinforcement-based decision-making.

Climate Change and Environment

Decision-making relies on the interaction of several processes, including representation of value, response selection and learning. In adolescence, decisions can often be characterised as being impulsive and high risk with serious consequences. With reinforcement learning, the processes are integrated. It deals with learning to improve one's future choices in order to maximise the occurrence of pleasant events (rewards) and minimise the occurrence of unpleasant events (punishments). Understanding the process that underlies decision-making in adolescence is important, yet the development of reinforcement learning in human adolescence has been studied only recently. DEVELOP-LEARNING (Developmental trajectories for model-free and model-based reinforcement learning: Computational and neural bases) was an EU-funded project devoted to changing this by conducting behaviour experiments. The first experiment involved administrating a novel instrumental learning task involving basic reward and punishment learning, as well as learning from counterfactual (fictive) information to adults and adolescents. It was found that adolescents' performance was not enhanced by counterfactual information; and, compared to adults, adolescents learned preferentially from reward compared to punishment, whereas adults learned equally from both. Results showed that adults and adolescents did not use the same algorithm to solve the learning task. Unlike in adults, adolescents' performance did not take into account counterfactual information. Additionally, adolescents learned to seek rewards rather than to avoid punishments. Adults, on the other hand, learned to seek and avoid both equally. Increasing understanding of computational changes in reinforcement learning during adolescence may provide insights into adolescent value-based decision-making. Results can also have a potential impact on education, since they suggest that adolescents benefit more from positive feedback than from negative feedback in learning tasks.

Keywords

Discover other articles in the same domain of application

New algorithms enable the use of adaptive autonomous underwater vehicles

9 February 2024

Project Information

DEVELOP‐LEARNING

Grant agreement ID: 328822

Project closed

Start date 1 January 2014

End date 31 December 2015

Funded under

Specific programme "People" implementing the Seventh Framework Programme of the European Community for research, technological development and demonstration activities (2007 to 2013)

Total cost

€ 231 283,20

EU contribution

€ 231 283,20

231 283,20

Coordinated by

UNIVERSITY COLLEGE LONDON

Better decisions through reinforcement learning

Keywords

Discover other articles in the same domain of application

Share this page Share this page on social networks

Download Download the content of the page