Coevolutionary Policy Search

Informacje na temat projektu

CoPS

Identyfikator umowy o grant: 637713

DOI

10.3030/637713

Projekt został zamknięty

Data podpisania przez KE 2 Czerwca 2015

Data rozpoczęcia 1 Października 2015

Data zakończenia 30 Września 2021

Finansowanie w ramach

EXCELLENT SCIENCE - European Research Council (ERC)

Koszt całkowity

€ 1 480 632,00

Wkład UE

€ 1 480 632,00

1 480 632,00

Koordynowany przez

THE CHANCELLOR, MASTERS AND SCHOLARS OF THE UNIVERSITY OF OXFORD
United Kingdom

CORDIS oferuje możliwość skorzystania z odnośników do publicznie dostępnych publikacji i rezultatów projektów realizowanych w ramach programów ramowych HORYZONT.

Odnośniki do rezultatów i publikacji związanych z poszczególnymi projektami 7PR, a także odnośniki do niektórych konkretnych kategorii wyników, takich jak zbiory danych i oprogramowanie, są dynamicznie pobierane z systemu OpenAIRE .

Publikacje

Alternating Optimisation and Quadrature for Robust Control

Autorzy: Paul, Supratik; Chatzilygeroudis, Konstantinos; Ciosek, Kamil; Mouret, Jean-Baptiste; Osborne, Michael A.; Whiteson, Shimon
Opublikowane w: AAAI 2018 - The Thirty-Second AAAI Conference on Artificial Intelligence, Numer 1, 2018
Wydawca: AAAI

Growing Action Spaces

Autorzy: Farquhar, Gregory; Gustafson, Laura; Lin, Zeming; Whiteson, Shimon; Usunier, Nicolas; Synnaeve, Gabriel
Opublikowane w: Numer 1, 2020
Wydawca: ICML

TreeQN and ATreeC: Differentiable Tree-Structured Models for Deep Reinforcement Learning

Autorzy: Farquhar, Gregory; Rocktäschel, Tim; Igl, Maximilian; Whiteson, Shimon
Opublikowane w: Numer 1, 2018
Wydawca: ICLR

DAC: The Double Actor-Critic Architecture for Learning Options

Autorzy: Zhang, Shangtong; Whiteson, Shimon
Opublikowane w: Numer 1, 2019
Wydawca: NeurIPS

VariBAD: A Very Good Method for Bayes-Adaptive Deep RL via Meta-Learning

Autorzy: Zintgraf, Luisa; Shiarlis, Kyriacos; Igl, Maximilian; Schulze, Sebastian; Gal, Yarin; Hofmann, Katja; Whiteson, Shimon
Opublikowane w: Numer 1, 2020
Wydawca: ICLR

Fast Context Adaptation via Meta-Learning

Autorzy: Zintgraf, Luisa M; Shiarlis, Kyriacos; Kurin, Vitaly; Hofmann, Katja; Whiteson, Shimon
Opublikowane w: Numer 1, 2019
Wydawca: ICML

Fingerprint Policy Optimisation for Robust Reinforcement Learning

Autorzy: Paul, Supratik; Osborne, Michael A.; Whiteson, Shimon
Opublikowane w: Numer 1, 2019
Wydawca: ICML

MAVEN: Multi-Agent Variational Exploration

Autorzy: Mahajan, Anuj; Rashid, Tabish; Samvelyan, Mikayel; Whiteson, Shimon
Opublikowane w: Numer 1, 2019
Wydawca: NeurIPS

Learning Retrospective Knowledge with Reverse Reinforcement Learning

Autorzy: Zhang, Shangtong; Veeriah, Vivek; Whiteson, Shimon
Opublikowane w: Numer 1, 2020
Wydawca: NeurIPS

Fast Efficient Hyperparameter Tuning for Policy Gradients

Autorzy: Paul, Supratik; Kurin, Vitaly; Whiteson, Shimon
Opublikowane w: Numer 1, 2019
Wydawca: NeurIPS

A Survey of Reinforcement Learning Informed by Natural Language

Autorzy: Luketina, Jelena; Nardelli, Nantas; Farquhar, Gregory; Foerster, Jakob; Andreas, Jacob; Grefenstette, Edward; Whiteson, Shimon; Rocktäschel, Tim
Opublikowane w: Numer 1, 2019
Wydawca: IJCAI

Mean-Variance Policy Iteration for Risk-Averse Reinforcement Learning

Autorzy: Zhang, Shangtong; Liu, Bo; Whiteson, Shimon
Opublikowane w: Numer 1, 2021
Wydawca: AAAI

Breaking the Deadly Triad with a Target Network

Autorzy: Zhang, Shangtong; Yao, Hengshuai; Whiteson, Shimon
Opublikowane w: Numer 1, 2021
Wydawca: ICML

GradientDICE: Rethinking Generalized Offline Estimation of Stationary Values

Autorzy: Zhang, Shangtong; Liu, Bo; Whiteson, Shimon
Opublikowane w: Numer 1, 2020
Wydawca: ICML

UneVEn: Universal Value Exploration for Multi-Agent Reinforcement Learning

Autorzy: Gupta, Tarun; Mahajan, Anuj; Peng, Bei; Böhmer, Wendelin; Whiteson, Shimon
Opublikowane w: Numer 1, 2021
Wydawca: ICML

Maximizing Information Gain in Partially Observable Environments via Prediction Reward

Autorzy: Satsangi, Yash; Lim, Sungsu; Whiteson, Shimon; Oliehoek, Frans; White, Martha
Opublikowane w: Numer 1, 2020
Wydawca: AAMAS

VIREL: A Variational Inference Framework for Reinforcement Learning

Autorzy: Fellows, Matthew; Mahajan, Anuj; Rudner, Tim G. J.; Whiteson, Shimon
Opublikowane w: Numer 1, 2019
Wydawca: NeurIPS

Loaded DiCE: Trading off Bias and Variance in Any-Order Score Function Estimators for Reinforcement Learning

Autorzy: Farquhar, Gregory; Whiteson, Shimon; Foerster, Jakob
Opublikowane w: Numer 1, 2019
Wydawca: NeurIPS

Average-Reward Off-Policy Policy Evaluation with Function Approximation

Autorzy: Zhang, Shangtong; Wan, Yi; Sutton, Richard S.; Whiteson, Shimon
Opublikowane w: Numer 1, 2021
Wydawca: ICML

Expected Policy Gradients

Autorzy: Kamil Ciosek Shimon Whiteson
Opublikowane w: 2018
Wydawca: AAAI

Alternating Optimisation and Quadrature for Robust Control

Autorzy: Supratik Paul‚ Konstantinos Chatzilygeroudis‚ Kamil Ciosek‚ Jean−Baptiste Mouret‚ Michael Osborne and Shimon Whiteson
Opublikowane w: 2018
Wydawca: AAAI

Learning with Opponent−Learning Awareness

Autorzy: Jakob Foerster‚ Richard Chen‚ Maruan Al−Shedivat‚ Shimon Whiteson‚ Pieter Abbeel and Igor Mordatch
Opublikowane w: 2018
Wydawca: AAMAS

TreeQN and ATreeC: Differentiable Tree−Structured Models for Deep Reinforcement Learning

Autorzy: Gregory Farquhar‚ Tim Rocktaschel‚ Maximilian Igl and Shimon Whiteson
Opublikowane w: 2018
Wydawca: ICLR

QMIX: Monotonic Value Function Factorisation for Deep Multi-Agent Reinforcement Learning

Autorzy: Rashid, Tabish; Samvelyan, Mikayel; de Witt, Christian Schroeder; Farquhar, Gregory; Foerster, Jakob; Whiteson, Shimon
Opublikowane w: Numer 2, 2018
Wydawca: ICML

Expected Policy Gradients for Reinforcement Learning

Autorzy: Ciosek, Kamil; Whiteson, Shimon
Opublikowane w: Numer 2, 2018
Wydawca: AAAI

TACO: Learning Task Decomposition via Temporal Alignment for Control

Autorzy: Kyriacos Shiarlis‚ Markus Wulfmeier‚ Sasha Salter‚ Shimon Whiteson and Ingmar Posner
Opublikowane w: 2018
Wydawca: ICML

DiCE: The Infinitely Differentiable Monte-Carlo Estimator

Autorzy: Foerster, Jakob; Farquhar, Gregory; Al-Shedivat, Maruan; Rocktäschel, Tim; Xing, Eric P.; Whiteson, Shimon
Opublikowane w: Numer 2, 2018
Wydawca: ICML

Deep Variational Reinforcement Learning for POMDPs

Autorzy: Igl, Maximilian; Zintgraf, Luisa; Le, Tuan Anh; Wood, Frank; Whiteson, Shimon
Opublikowane w: Numer 1, 2018
Wydawca: ICML

Fourier Policy Gradients

Autorzy: Matthew Fellows‚ Kamil Ciosek and Shimon Whiteson
Opublikowane w: 2018
Wydawca: ICML

OFFER: Off−Environment Reinforcement Learning

Autorzy: Kamil Ciosek and Shimon Whiteson
Opublikowane w: 2017
Wydawca: AAAI

Stabilising Experience Replay for Deep Multi−Agent Reinforcement Learning

Autorzy: Jakob Foerster‚ Nantas Nardelli‚ Greg Farquhar‚ Phil Torr‚ Pushmeet Kohli and Shimon Whiteson
Opublikowane w: 2017
Wydawca: ICML

Learning to Communicate with Deep Multi−Agent Reinforcement Learning

Autorzy: Jakob Foerster‚ Yannis Assael‚ Nando de Freitas and Shimon Whiteson
Opublikowane w: 2016
Wydawca: NIPS

Counterfactual Multi-Agent Policy Gradients

Autorzy: Jakob Foerster, Gregory Farquhar, Triantafyllos Afouras, Nantas Nardelli, Shimon Whiteson
Opublikowane w: 2018
Wydawca: AAAI

Multi-Agent Common Knowledge Reinforcement Learning

Autorzy: de Witt, Christian A. Schroeder; Foerster, Jakob N.; Farquhar, Gregory; Torr, Philip H. S.; Boehmer, Wendelin; Whiteson, Shimon
Opublikowane w: Numer 1, 2019
Wydawca: NeurIPS

Provably Convergent Two-Timescale Off-Policy Actor-Critic with Function Approximation

Autorzy: Zhang, Shangtong; Liu, Bo; Yao, Hengshuai; Whiteson, Shimon
Opublikowane w: Numer 1, 2020
Wydawca: ICML

Exploration in Approximate Hyper-State Space for Meta Reinforcement Learning

Autorzy: Zintgraf, Luisa; Feng, Leo; Lu, Cong; Igl, Maximilian; Hartikainen, Kristian; Hofmann, Katja; Whiteson, Shimon
Opublikowane w: Numer 1, 2021
Wydawca: ICML

Generalized Off-Policy Actor-Critic

Autorzy: Zhang, Shangtong; Boehmer, Wendelin; Whiteson, Shimon
Opublikowane w: Numer 1, 2019
Wydawca: NeurIPS

Deep Residual Reinforcement Learning

Autorzy: Zhang, Shangtong; Boehmer, Wendelin; Whiteson, Shimon
Opublikowane w: Numer 1, 2020
Wydawca: AAMAS

Optimistic Exploration even with a Pessimistic Initialisation

Autorzy: Rashid, Tabish; Peng, Bei; Böhmer, Wendelin; Whiteson, Shimon
Opublikowane w: Numer 1, 2020
Wydawca: ICLR

Monotonic Value Function Factorisation for Deep Multi-Agent Reinforcement Learning

Autorzy: Rashid, Tabish; Samvelyan, Mikayel; de Witt, Christian Schroeder; Farquhar, Gregory; Foerster, Jakob; Whiteson, Shimon
Opublikowane w: JMLR, Numer 2, 2020, ISSN 1533-7928
Wydawca: JMLR

Robust Reinforcement Learning with Bayesian Optimisation and Quadrature

Autorzy: Paul, Supratik; Chatzilygeroudis, Konstantinos; Ciosek, Kamil; Mouret, Jean-Baptiste; Osborne, Michael,; Whiteson, Shimon
Opublikowane w: Journal of Machine Learning Research, Microtome Publishing, 2020, 21, pp.1 - 31, Numer 3, 2020, ISSN 1533-7928
Wydawca: JMLR

Expected Policy Gradients for Reinforcement Learning

Autorzy: Ciosek, Kamil; Whiteson, Shimon
Opublikowane w: JMLR, Numer 1, 2020, ISSN 1533-7928
Wydawca: JMLR

Wyszukiwanie danych OpenAIRE...

Publikacje

Pobierz Pobierz zawartość strony