Achieving the goals of this project was divided into three major milestones, of which milestones 1 and 3 have in practice been realized. The second milestone, focusing on noisy environments, has been subsumed into milestone 3 (integration with experiment) because the scenario of milestone 3 is inherently noisy. The completion of the full project is on-track with the proposed timeframe, and will continue in collaboration with the University of Copenhagen despite early termination. Funding from the EU will continue to be acknowledged in subsequent dissemination of the results.
For these milestones:
1) The first major milestone revolved around the implementation of a reinforcement learning (RL) agent for quantum control. Here, the focus was on implementing an RL agent with a novel set of actions. As a problem to develop this method on, I started the ‘quantum cartpole’ project (in a collaboration with the original MSCA supervisor, Prof. Mark Rudner), in which the RL agent must learn to stabilize a quantum system by pushing it left and right. The benchmark problem of having the agent select pre-determined pushing-strengths has been completed, and we now test if agent can decide whether it wants to increase/decrease the push-strength. These results show that the newer action-set is more difficult to train, but produces more stable outcomes. This has not yet been published.
2&3) The direct integration of an AI optimizer with experiments at Copenhagen University meant an integration with existing controllers. Rather than having a non-tested RL agent optimize an experiment, we started with an evolutionary strategy (CMA-ES). We have successfully demonstrated that CMA-ES can optimize a quantum point contact experiment in-situ, and with the next set of experiments will obtain final data for a publication. The algorithm is robust to noise, and does not require the evaluation of gradients. This setting is similar to an RL agent, turning the original question into: “Can the RL agent outperform CMA-ES by using fewer queries to the experiment?”.
Overview of the results and their exploitation and dissemination:
* The integration of an optimization routine (CMA-ES) directly with the experiment has been completed. A publication on this is being written but has not yet been submitted. Exploitation will follow in the form of a public open-source code library.
* A reinforcement learning agent can indeed learn to control a quantum memory system. A scientific publication will follow, but results are being disseminated at the public website www.quantumcartpole.com.
* A physics-inspired neural network (for future use as a submodule in a reinforcement learning agent) was developed, and published (DOI: 10.1103/physrevresearch.4.l022032)
* Insights into neural networks as classifiers was also gained during this project, and was pubilshed (DOI: 10.21468/scipostphys.11.3.073)
* A novel type of reinforcement learning agent neural network was developed and trained on the problem of quantum error correction (DOI: 10.21468/scipostphys.11.1.005)
In addition, the following other events took place:
• Workshop:
https://indico.fysik.su.se/event/7771/(odnośnik otworzy się w nowym oknie)• Exhibition: Quantum Games at the CultureNight event in Copenhagen
• Training: UCPH ERC Writing Seminar (October 2021)
• Website: www.quantumcartpole.com
• Conferences: 1) Summer school Toulouse:
https://mlqmb.sciencesconf.org/(odnośnik otworzy się w nowym oknie) and 2) CRC Meeting Berlin:
https://www.crc183.uni-koeln.de/crc-183-berlin-conference-2022/(odnośnik otworzy się w nowym oknie)