Learning in dynamic environments

An EU-funded project established a new paradigm for learning in large-scale, dynamic environments associated with elements of uncertainty.

Industrial Technologies

The overall goal of the project 'Plural reinforcement learning' (PLURELEARN) was to develop algorithms, theory and applications that use a large number of learning approaches and models in a synergetic way. To realise this goal, the project team identified three objectives: developing a learning approach combining learning from a teacher and learning by trial and error; devising a structure discovery methodology for reasoning about uncertainty in high-dimensional Markov processes; and developing approaches for algorithm selection and mini-strategies. The team made progress in meeting these objectives. Research on the first objective resulted in papers on how to use a tutor or expert advice in reinforcement learning paradigms. The work showed new algorithms for the problem of learning from multiple sources, as well as how the algorithms work in medium-scale applications. The problem of structure discovery (objective 2) proved to be quite complex. After developing theoretical and applied aspects of model selection and structure discovery showing the difficulty of detecting dynamic structure, the team developed two approaches for mitigating risks. The first is based on policy gradients and geared toward problems where a simulator is available. The second is based on a robust optimisation approach, where the focus is on a couple of uncertainties between states. For the third objective, researchers designed two strategies that may lead to improved performance. The first was a way to modify options and then generate new, improved options. The second was a way to make use of 'randomly generated' options to expedite planning and learning. The project was successful in developing a new framework for planning and learning in data-driven, variable environments. The research has the potential to open up opportunities for large-scale optimisation of dynamic systems that could have a significant impact on the scale of problems that can be solved.

Keywords

Discover other articles in the same domain of application

Inspiration from the animal kingdom helps robots get back on their feet

25 October 2021

Synthetic skin gives industrial robots a feel for human co-workers

22 November 2021

Helping robots get to grips with the real world

17 May 2021

People-first approach helps build trust in manufacturing AI

1 August 2023

Project Information

PLURELEARN

Grant agreement ID: 249254

Project closed

Start date 1 November 2009

End date 31 October 2013

Funded under

Specific programme "People" implementing the Seventh Framework Programme of the European Community for research, technological development and demonstration activities (2007 to 2013)

Total cost

€ 100 000,00

EU contribution

€ 100 000,00

100 000,00

Coordinated by

TECHNION - ISRAEL INSTITUTE OF TECHNOLOGY
Israel

Keywords

Discover other articles in the same domain of application

Download Download the content of the page