European Commission logo
English English
CORDIS - EU research results

Deep Bayesian Reinforcement Learning -- Unifying Perception, Planning, and Control


For robots to assist humanity in homes, or hospitals, the capability to manipulate diverse objects is imperative. So far, however, robotic manipulation technology has struggled in managing the uncertainty and unstructuredness that characterize human environments.
Machine learning is a natural approach -- the robot can adapt to a given scenario, even if it was not programmed to handle it beforehand. Indeed, Deep Reinforcement Learning (deep RL), which has recently led to AI breakthroughs in computer games, has been publicized as the learning-based approach to robotics. To date, however, deep RL studies focused on known and observable systems, where uncertainty was resolved by lengthy trial and error. Quickly learning to act in novel environments, as required for robotics, is not yet within our reach.
The crux of the matter is the tight coupling between perception and control under high uncertainty -- the robot must actively reduce uncertainty while also trying to solve the task; for complex and high-dimensional systems, we do not have a suitable algorithmic framework for this.

In this proposal, our overarching goal is to:
Develop the algorithmic framework of using deep learning in problems that tightly couple perception, planning, and control, for advancing robotic AI to reliably manipulate general objects in unstructured environments.

Towards this end, we shall develop neural network representations of uncertainty, and algorithms that estimate uncertainty from data. We will develop theory and algorithms for decision making under uncertainty, bringing in a fresh perspective to the problem based on Bayesian reinforcement learning (Bayes-RL). These advances will allow us to study safety certificates for deep RL, and develop a general and practical methodology for learning-based robotic manipulation under uncertainty, validated on real robot experiments. Aside from robotic manipulation, we expect impact on various fields where decision making plays an important role.

Host institution

Net EU contribution
€ 1 500 000,00
32000 Haifa

See on map

Activity type
Higher or Secondary Education Establishments
Total cost
€ 1 500 000,00

Beneficiaries (1)