Coevolutionary Policy Search

Project Information

CoPS

Grant agreement ID: 637713

DOI

10.3030/637713

Project closed

EC signature date 2 June 2015

Start date 1 October 2015

End date 30 September 2021

Funded under

EXCELLENT SCIENCE - European Research Council (ERC)

Total cost

€ 1 480 632,00

EU contribution

€ 1 480 632,00

1 480 632,00

Coordinated by

THE CHANCELLOR, MASTERS AND SCHOLARS OF THE UNIVERSITY OF OXFORD
United Kingdom

CORDIS provides links to public deliverables and publications of HORIZON projects.

Links to deliverables and publications from FP7 projects, as well as links to some specific result types such as dataset and software, are dynamically retrieved from OpenAIRE .

Publications

Alternating Optimisation and Quadrature for Robust Control

Author(s): Paul, Supratik; Chatzilygeroudis, Konstantinos; Ciosek, Kamil; Mouret, Jean-Baptiste; Osborne, Michael A.; Whiteson, Shimon
Published in: AAAI 2018 - The Thirty-Second AAAI Conference on Artificial Intelligence, Issue 1, 2018
Publisher: AAAI

Growing Action Spaces

Author(s): Farquhar, Gregory; Gustafson, Laura; Lin, Zeming; Whiteson, Shimon; Usunier, Nicolas; Synnaeve, Gabriel
Published in: Issue 1, 2020
Publisher: ICML

TreeQN and ATreeC: Differentiable Tree-Structured Models for Deep Reinforcement Learning

Author(s): Farquhar, Gregory; Rocktäschel, Tim; Igl, Maximilian; Whiteson, Shimon
Published in: Issue 1, 2018
Publisher: ICLR

DAC: The Double Actor-Critic Architecture for Learning Options

Author(s): Zhang, Shangtong; Whiteson, Shimon
Published in: Issue 1, 2019
Publisher: NeurIPS

VariBAD: A Very Good Method for Bayes-Adaptive Deep RL via Meta-Learning

Author(s): Zintgraf, Luisa; Shiarlis, Kyriacos; Igl, Maximilian; Schulze, Sebastian; Gal, Yarin; Hofmann, Katja; Whiteson, Shimon
Published in: Issue 1, 2020
Publisher: ICLR

Fast Context Adaptation via Meta-Learning

Author(s): Zintgraf, Luisa M; Shiarlis, Kyriacos; Kurin, Vitaly; Hofmann, Katja; Whiteson, Shimon
Published in: Issue 1, 2019
Publisher: ICML

Fingerprint Policy Optimisation for Robust Reinforcement Learning

Author(s): Paul, Supratik; Osborne, Michael A.; Whiteson, Shimon
Published in: Issue 1, 2019
Publisher: ICML

MAVEN: Multi-Agent Variational Exploration

Author(s): Mahajan, Anuj; Rashid, Tabish; Samvelyan, Mikayel; Whiteson, Shimon
Published in: Issue 1, 2019
Publisher: NeurIPS

Learning Retrospective Knowledge with Reverse Reinforcement Learning

Author(s): Zhang, Shangtong; Veeriah, Vivek; Whiteson, Shimon
Published in: Issue 1, 2020
Publisher: NeurIPS

Fast Efficient Hyperparameter Tuning for Policy Gradients

Author(s): Paul, Supratik; Kurin, Vitaly; Whiteson, Shimon
Published in: Issue 1, 2019
Publisher: NeurIPS

A Survey of Reinforcement Learning Informed by Natural Language

Author(s): Luketina, Jelena; Nardelli, Nantas; Farquhar, Gregory; Foerster, Jakob; Andreas, Jacob; Grefenstette, Edward; Whiteson, Shimon; Rocktäschel, Tim
Published in: Issue 1, 2019
Publisher: IJCAI

Mean-Variance Policy Iteration for Risk-Averse Reinforcement Learning

Author(s): Zhang, Shangtong; Liu, Bo; Whiteson, Shimon
Published in: Issue 1, 2021
Publisher: AAAI

Breaking the Deadly Triad with a Target Network

Author(s): Zhang, Shangtong; Yao, Hengshuai; Whiteson, Shimon
Published in: Issue 1, 2021
Publisher: ICML

GradientDICE: Rethinking Generalized Offline Estimation of Stationary Values

Author(s): Zhang, Shangtong; Liu, Bo; Whiteson, Shimon
Published in: Issue 1, 2020
Publisher: ICML

UneVEn: Universal Value Exploration for Multi-Agent Reinforcement Learning

Author(s): Gupta, Tarun; Mahajan, Anuj; Peng, Bei; Böhmer, Wendelin; Whiteson, Shimon
Published in: Issue 1, 2021
Publisher: ICML

Maximizing Information Gain in Partially Observable Environments via Prediction Reward

Author(s): Satsangi, Yash; Lim, Sungsu; Whiteson, Shimon; Oliehoek, Frans; White, Martha
Published in: Issue 1, 2020
Publisher: AAMAS

VIREL: A Variational Inference Framework for Reinforcement Learning

Author(s): Fellows, Matthew; Mahajan, Anuj; Rudner, Tim G. J.; Whiteson, Shimon
Published in: Issue 1, 2019
Publisher: NeurIPS

Loaded DiCE: Trading off Bias and Variance in Any-Order Score Function Estimators for Reinforcement Learning

Author(s): Farquhar, Gregory; Whiteson, Shimon; Foerster, Jakob
Published in: Issue 1, 2019
Publisher: NeurIPS

Average-Reward Off-Policy Policy Evaluation with Function Approximation

Author(s): Zhang, Shangtong; Wan, Yi; Sutton, Richard S.; Whiteson, Shimon
Published in: Issue 1, 2021
Publisher: ICML

Expected Policy Gradients

Author(s): Kamil Ciosek Shimon Whiteson
Published in: 2018
Publisher: AAAI

Alternating Optimisation and Quadrature for Robust Control

Author(s): Supratik Paul‚ Konstantinos Chatzilygeroudis‚ Kamil Ciosek‚ Jean−Baptiste Mouret‚ Michael Osborne and Shimon Whiteson
Published in: 2018
Publisher: AAAI

Learning with Opponent−Learning Awareness

Author(s): Jakob Foerster‚ Richard Chen‚ Maruan Al−Shedivat‚ Shimon Whiteson‚ Pieter Abbeel and Igor Mordatch
Published in: 2018
Publisher: AAMAS

TreeQN and ATreeC: Differentiable Tree−Structured Models for Deep Reinforcement Learning

Author(s): Gregory Farquhar‚ Tim Rocktaschel‚ Maximilian Igl and Shimon Whiteson
Published in: 2018
Publisher: ICLR

QMIX: Monotonic Value Function Factorisation for Deep Multi-Agent Reinforcement Learning

Author(s): Rashid, Tabish; Samvelyan, Mikayel; de Witt, Christian Schroeder; Farquhar, Gregory; Foerster, Jakob; Whiteson, Shimon
Published in: Issue 2, 2018
Publisher: ICML

Expected Policy Gradients for Reinforcement Learning

Author(s): Ciosek, Kamil; Whiteson, Shimon
Published in: Issue 2, 2018
Publisher: AAAI

TACO: Learning Task Decomposition via Temporal Alignment for Control

Author(s): Kyriacos Shiarlis‚ Markus Wulfmeier‚ Sasha Salter‚ Shimon Whiteson and Ingmar Posner
Published in: 2018
Publisher: ICML

DiCE: The Infinitely Differentiable Monte-Carlo Estimator

Author(s): Foerster, Jakob; Farquhar, Gregory; Al-Shedivat, Maruan; Rocktäschel, Tim; Xing, Eric P.; Whiteson, Shimon
Published in: Issue 2, 2018
Publisher: ICML

Deep Variational Reinforcement Learning for POMDPs

Author(s): Igl, Maximilian; Zintgraf, Luisa; Le, Tuan Anh; Wood, Frank; Whiteson, Shimon
Published in: Issue 1, 2018
Publisher: ICML

Fourier Policy Gradients

Author(s): Matthew Fellows‚ Kamil Ciosek and Shimon Whiteson
Published in: 2018
Publisher: ICML

OFFER: Off−Environment Reinforcement Learning

Author(s): Kamil Ciosek and Shimon Whiteson
Published in: 2017
Publisher: AAAI

Stabilising Experience Replay for Deep Multi−Agent Reinforcement Learning

Author(s): Jakob Foerster‚ Nantas Nardelli‚ Greg Farquhar‚ Phil Torr‚ Pushmeet Kohli and Shimon Whiteson
Published in: 2017
Publisher: ICML

Learning to Communicate with Deep Multi−Agent Reinforcement Learning

Author(s): Jakob Foerster‚ Yannis Assael‚ Nando de Freitas and Shimon Whiteson
Published in: 2016
Publisher: NIPS

Counterfactual Multi-Agent Policy Gradients

Author(s): Jakob Foerster, Gregory Farquhar, Triantafyllos Afouras, Nantas Nardelli, Shimon Whiteson
Published in: 2018
Publisher: AAAI

Multi-Agent Common Knowledge Reinforcement Learning

Author(s): de Witt, Christian A. Schroeder; Foerster, Jakob N.; Farquhar, Gregory; Torr, Philip H. S.; Boehmer, Wendelin; Whiteson, Shimon
Published in: Issue 1, 2019
Publisher: NeurIPS

Provably Convergent Two-Timescale Off-Policy Actor-Critic with Function Approximation

Author(s): Zhang, Shangtong; Liu, Bo; Yao, Hengshuai; Whiteson, Shimon
Published in: Issue 1, 2020
Publisher: ICML

Exploration in Approximate Hyper-State Space for Meta Reinforcement Learning

Author(s): Zintgraf, Luisa; Feng, Leo; Lu, Cong; Igl, Maximilian; Hartikainen, Kristian; Hofmann, Katja; Whiteson, Shimon
Published in: Issue 1, 2021
Publisher: ICML

Generalized Off-Policy Actor-Critic

Author(s): Zhang, Shangtong; Boehmer, Wendelin; Whiteson, Shimon
Published in: Issue 1, 2019
Publisher: NeurIPS

Deep Residual Reinforcement Learning

Author(s): Zhang, Shangtong; Boehmer, Wendelin; Whiteson, Shimon
Published in: Issue 1, 2020
Publisher: AAMAS

Optimistic Exploration even with a Pessimistic Initialisation

Author(s): Rashid, Tabish; Peng, Bei; Böhmer, Wendelin; Whiteson, Shimon
Published in: Issue 1, 2020
Publisher: ICLR

Monotonic Value Function Factorisation for Deep Multi-Agent Reinforcement Learning

Author(s): Rashid, Tabish; Samvelyan, Mikayel; de Witt, Christian Schroeder; Farquhar, Gregory; Foerster, Jakob; Whiteson, Shimon
Published in: JMLR, Issue 2, 2020, ISSN 1533-7928
Publisher: JMLR

Robust Reinforcement Learning with Bayesian Optimisation and Quadrature

Author(s): Paul, Supratik; Chatzilygeroudis, Konstantinos; Ciosek, Kamil; Mouret, Jean-Baptiste; Osborne, Michael,; Whiteson, Shimon
Published in: Journal of Machine Learning Research, Microtome Publishing, 2020, 21, pp.1 - 31, Issue 3, 2020, ISSN 1533-7928
Publisher: JMLR

Expected Policy Gradients for Reinforcement Learning

Author(s): Ciosek, Kamil; Whiteson, Shimon
Published in: JMLR, Issue 1, 2020, ISSN 1533-7928
Publisher: JMLR

Searching for OpenAIRE data...

Publications

Download Download the content of the page