Periodic Reporting for period 1 - VINCI (The Value of Information and Choice to Improve Control.)
Période du rapport: 2020-01-01 au 2021-12-31
Improving control over ER intake builds on a main strategy: choosing the most valuable option to maximize gains. It is therefore unclear whether IRs can be easily dissociated behaviorally from ERs. If they can be, understanding how these different reward types interact would be revealing about how extrinsic (triggered by ER) and intrinsic (triggered by IR) motivations are combined to guide real-world actions. Evidence suggests that IRs may be encoded by the dopamine (DA) system in a way similar to how DA neurons encode ERs. Midbrain DA neurons send massive projections to the prefrontal cortex (PFC) and the striatum and are central in coding and learning of ERs. These neurons encode a large range of rewarding experiences and generate “reward prediction errors” to signal unpredicted outcomes and changes from conditioned expected value. Under the neural common currency theory, the DA system may compare expected outcomes on a common value scale in order to evaluate which option is preferable. We hypothesize that IRs are encoded in the DA system in a manner consistent with neural common currency theory; that is DA activity should consistently reflect subjects’ preferences when IRs are pitted against different ERs. There is suggestive evidence that basal ganglia nuclei (receiving dense DA inputs), such as the striatum, are involved in encoding preference for choice and IRs.
However none of the previous studies systematically dissociated choice availability from ER intake or tested the dedicated neural networks involved in representing these different forms of reward and motivation. We developed a behavioral task that permits dissociating the value of choice opportunities and ER intake. We rigorously explored these different forms of rewards in primates (humans and monkeys, two species with a comparable DA system that differs from rodents). The goal was to test whether IRs can trigger a distortion of reward intake like a ‘DA bonus’ (i.e. a DA effect not explained by ER expectation), and how this recruits similar or different neural networks.
Our results show a clear preference for free over forced trials in humans since subjects selected in about 70% of the case free trials during the first-stage of the task independently from the ER accumulated. Modeling investigation using reinforcement learning models shows that it was necessary to incorporate overvaluation of extrinsic rewards obtained from free actions to account for choice-seeking behavior. Behavioral results in monkey, performing a similar task, suggested the same pattern.
To test whether DA system can be involved in the encoding of IRs we first asked how deficits of this neural system can affect preference for IR. We first investigated performance of Parkinson’s disease (PD) patients (who suffer from a loss of DA neurons) with (ON) and without (OFF) their treatment using the same task. These patients were subjects using DA medication to compensate their deficits or PD patients who underwent deep brain stimulation surgery of the subthalamic nucleus.
Data indicates that during the first-stage overall preference of the patients for free trials increase after reestablishment of their treatments (i.e. ON treatment > OFF treatment). This set of results postulate a key role of the DA system in intrinsic reward assessment and intrinsic motivation.
Finally we have developed an experimental set-up that will ultimately allow us, in non-human primates, to investigate specifically the calcium-activity of the midbrain DA neurons along with simultaneous electrophysiological recordings of neurons associated to the DA system. The goal is to gain insights about the correlation between the reward encoding by DA neurons and the associated modulation of downstream neurons involved in motivational and learning mechanisms.
Investigating mechanisms of intrinsic reward and intrinsic motivation in primates is a crucial, but under-explored, question in the field of dopamine and reward systems and this multidisciplinary approach allows us to address it. Importantly these systems gather a very large scientific community (e.g. decision-making or learning mechanisms) and are also severely impaired in many diseases. It will provide new insights in primate neurophysiology but also for neurological patients, for whom we show impaired valuation of intrinsic rewards. This could allow for example deep brain stimulation neurosurgeons to improve electrode implantation for patients with several types of deficits such as motor but also motivational dysfunctions.