Skip to main content

Coevolutionary Policy Search

Searching for OpenAIRE data...

Publications

Alternating Optimisation and Quadrature for Robust Control

Author(s): Paul, Supratik; Chatzilygeroudis, Konstantinos; Ciosek, Kamil; Mouret, Jean-Baptiste; Osborne, Michael A.; Whiteson, Shimon
Published in: AAAI 2018 - The Thirty-Second AAAI Conference on Artificial Intelligence, 1, 2018
Publisher: AAAI

Growing Action Spaces

Author(s): Farquhar, Gregory; Gustafson, Laura; Lin, Zeming; Whiteson, Shimon; Usunier, Nicolas; Synnaeve, Gabriel
Published in: 1, 2020
Publisher: ICML

TreeQN and ATreeC: Differentiable Tree-Structured Models for Deep Reinforcement Learning

Author(s): Farquhar, Gregory; Rocktäschel, Tim; Igl, Maximilian; Whiteson, Shimon
Published in: 1, 2018
Publisher: ICLR

DAC: The Double Actor-Critic Architecture for Learning Options

Author(s): Zhang, Shangtong; Whiteson, Shimon
Published in: 1, 2019
Publisher: NeurIPS

VariBAD: A Very Good Method for Bayes-Adaptive Deep RL via Meta-Learning

Author(s): Zintgraf, Luisa; Shiarlis, Kyriacos; Igl, Maximilian; Schulze, Sebastian; Gal, Yarin; Hofmann, Katja; Whiteson, Shimon
Published in: 1, 2020
Publisher: ICLR

Fast Context Adaptation via Meta-Learning

Author(s): Zintgraf, Luisa M; Shiarlis, Kyriacos; Kurin, Vitaly; Hofmann, Katja; Whiteson, Shimon
Published in: 1, 2019
Publisher: ICML

Fingerprint Policy Optimisation for Robust Reinforcement Learning

Author(s): Paul, Supratik; Osborne, Michael A.; Whiteson, Shimon
Published in: 1, 2019
Publisher: ICML

MAVEN: Multi-Agent Variational Exploration

Author(s): Mahajan, Anuj; Rashid, Tabish; Samvelyan, Mikayel; Whiteson, Shimon
Published in: 1, 2019
Publisher: NeurIPS

Learning Retrospective Knowledge with Reverse Reinforcement Learning

Author(s): Zhang, Shangtong; Veeriah, Vivek; Whiteson, Shimon
Published in: 1, 2020
Publisher: NeurIPS

Fast Efficient Hyperparameter Tuning for Policy Gradients

Author(s): Paul, Supratik; Kurin, Vitaly; Whiteson, Shimon
Published in: 1, 2019
Publisher: NeurIPS

A Survey of Reinforcement Learning Informed by Natural Language

Author(s): Luketina, Jelena; Nardelli, Nantas; Farquhar, Gregory; Foerster, Jakob; Andreas, Jacob; Grefenstette, Edward; Whiteson, Shimon; Rocktäschel, Tim
Published in: 1, 2019
Publisher: IJCAI

Mean-Variance Policy Iteration for Risk-Averse Reinforcement Learning

Author(s): Zhang, Shangtong; Liu, Bo; Whiteson, Shimon
Published in: 1, 2021
Publisher: AAAI

Breaking the Deadly Triad with a Target Network

Author(s): Zhang, Shangtong; Yao, Hengshuai; Whiteson, Shimon
Published in: 1, 2021
Publisher: ICML

GradientDICE: Rethinking Generalized Offline Estimation of Stationary Values

Author(s): Zhang, Shangtong; Liu, Bo; Whiteson, Shimon
Published in: 1, 2020
Publisher: ICML

UneVEn: Universal Value Exploration for Multi-Agent Reinforcement Learning

Author(s): Gupta, Tarun; Mahajan, Anuj; Peng, Bei; Böhmer, Wendelin; Whiteson, Shimon
Published in: 1, 2021
Publisher: ICML

Maximizing Information Gain in Partially Observable Environments via Prediction Reward

Author(s): Satsangi, Yash; Lim, Sungsu; Whiteson, Shimon; Oliehoek, Frans; White, Martha
Published in: 1, 2020
Publisher: AAMAS

VIREL: A Variational Inference Framework for Reinforcement Learning

Author(s): Fellows, Matthew; Mahajan, Anuj; Rudner, Tim G. J.; Whiteson, Shimon
Published in: 1, 2019
Publisher: NeurIPS

Loaded DiCE: Trading off Bias and Variance in Any-Order Score Function Estimators for Reinforcement Learning

Author(s): Farquhar, Gregory; Whiteson, Shimon; Foerster, Jakob
Published in: 1, 2019
Publisher: NeurIPS

Average-Reward Off-Policy Policy Evaluation with Function Approximation

Author(s): Zhang, Shangtong; Wan, Yi; Sutton, Richard S.; Whiteson, Shimon
Published in: 1, 2021
Publisher: ICML

Expected Policy Gradients

Author(s): Kamil Ciosek Shimon Whiteson
Published in: 2018
Publisher: AAAI

Alternating Optimisation and Quadrature for Robust Control

Author(s): Supratik Paul‚ Konstantinos Chatzilygeroudis‚ Kamil Ciosek‚ Jean−Baptiste Mouret‚ Michael Osborne and Shimon Whiteson
Published in: 2018
Publisher: AAAI

Learning with Opponent−Learning Awareness

Author(s): Jakob Foerster‚ Richard Chen‚ Maruan Al−Shedivat‚ Shimon Whiteson‚ Pieter Abbeel and Igor Mordatch
Published in: 2018
Publisher: AAMAS

TreeQN and ATreeC: Differentiable Tree−Structured Models for Deep Reinforcement Learning

Author(s): Gregory Farquhar‚ Tim Rocktaschel‚ Maximilian Igl and Shimon Whiteson
Published in: 2018
Publisher: ICLR

QMIX: Monotonic Value Function Factorisation for Deep Multi-Agent Reinforcement Learning

Author(s): Rashid, Tabish; Samvelyan, Mikayel; de Witt, Christian Schroeder; Farquhar, Gregory; Foerster, Jakob; Whiteson, Shimon
Published in: 2, 2018
Publisher: ICML

Expected Policy Gradients for Reinforcement Learning

Author(s): Ciosek, Kamil; Whiteson, Shimon
Published in: 2, 2018
Publisher: AAAI

TACO: Learning Task Decomposition via Temporal Alignment for Control

Author(s): Kyriacos Shiarlis‚ Markus Wulfmeier‚ Sasha Salter‚ Shimon Whiteson and Ingmar Posner
Published in: 2018
Publisher: ICML

DiCE: The Infinitely Differentiable Monte-Carlo Estimator

Author(s): Foerster, Jakob; Farquhar, Gregory; Al-Shedivat, Maruan; Rocktäschel, Tim; Xing, Eric P.; Whiteson, Shimon
Published in: 2, 2018
Publisher: ICML

Deep Variational Reinforcement Learning for POMDPs

Author(s): Igl, Maximilian; Zintgraf, Luisa; Le, Tuan Anh; Wood, Frank; Whiteson, Shimon
Published in: 1, 2018
Publisher: ICML

Fourier Policy Gradients

Author(s): Matthew Fellows‚ Kamil Ciosek and Shimon Whiteson
Published in: 2018
Publisher: ICML

OFFER: Off−Environment Reinforcement Learning

Author(s): Kamil Ciosek and Shimon Whiteson
Published in: 2017
Publisher: AAAI

Stabilising Experience Replay for Deep Multi−Agent Reinforcement Learning

Author(s): Jakob Foerster‚ Nantas Nardelli‚ Greg Farquhar‚ Phil Torr‚ Pushmeet Kohli and Shimon Whiteson
Published in: 2017
Publisher: ICML

Learning to Communicate with Deep Multi−Agent Reinforcement Learning

Author(s): Jakob Foerster‚ Yannis Assael‚ Nando de Freitas and Shimon Whiteson
Published in: 2016
Publisher: NIPS

Counterfactual Multi-Agent Policy Gradients

Author(s): Jakob Foerster, Gregory Farquhar, Triantafyllos Afouras, Nantas Nardelli, Shimon Whiteson
Published in: 2018
Publisher: AAAI

Multi-Agent Common Knowledge Reinforcement Learning

Author(s): de Witt, Christian A. Schroeder; Foerster, Jakob N.; Farquhar, Gregory; Torr, Philip H. S.; Boehmer, Wendelin; Whiteson, Shimon
Published in: 1, 2019
Publisher: NeurIPS

Provably Convergent Two-Timescale Off-Policy Actor-Critic with Function Approximation

Author(s): Zhang, Shangtong; Liu, Bo; Yao, Hengshuai; Whiteson, Shimon
Published in: 1, 2020
Publisher: ICML

Exploration in Approximate Hyper-State Space for Meta Reinforcement Learning

Author(s): Zintgraf, Luisa; Feng, Leo; Lu, Cong; Igl, Maximilian; Hartikainen, Kristian; Hofmann, Katja; Whiteson, Shimon
Published in: 1, 2021
Publisher: ICML

Generalized Off-Policy Actor-Critic

Author(s): Zhang, Shangtong; Boehmer, Wendelin; Whiteson, Shimon
Published in: 1, 2019
Publisher: NeurIPS

Deep Residual Reinforcement Learning

Author(s): Zhang, Shangtong; Boehmer, Wendelin; Whiteson, Shimon
Published in: 1, 2020
Publisher: AAMAS

Optimistic Exploration even with a Pessimistic Initialisation

Author(s): Rashid, Tabish; Peng, Bei; Böhmer, Wendelin; Whiteson, Shimon
Published in: 1, 2020
Publisher: ICLR