Provably Efficient Algorithms for Large-Scale Reinforcement Learning

Informacje na temat projektu

SCALER

Identyfikator umowy o grant: 950180

Strona internetowa projektu

DOI

10.3030/950180

Data podpisania przez KE 3 Września 2020

Data rozpoczęcia 1 Października 2021

Data zakończenia 30 Września 2026

Finansowanie w ramach

EXCELLENT SCIENCE - European Research Council (ERC)

Koszt całkowity

€ 1 493 990,00

Wkład UE

€ 1 493 990,00

1 493 990,00

Koordynowany przez

UNIVERSIDAD POMPEU FABRA
Spain

CORDIS oferuje możliwość skorzystania z odnośników do publicznie dostępnych publikacji i rezultatów projektów realizowanych w ramach programów ramowych HORYZONT.

Odnośniki do rezultatów i publikacji związanych z poszczególnymi projektami 7PR, a także odnośniki do niektórych konkretnych kategorii wyników, takich jak zbiory danych i oprogramowanie, są dynamicznie pobierane z systemu OpenAIRE .

Rezultaty

Data Management Plan (DMP)

The Open Research Data Pilot will be prepared and submitted to the European Commission

Publikacje

Offline Primal-Dual Reinforcement Learning for Linear MDPs

Autorzy: G. Gabbianelli, G. Neu, N. Okolo, M. Papini
Opublikowane w: Proceedings of the Twenty-seventh International Conference on Artificial Intelligence and Statistics (AISTATS), 2024
Wydawca: Proceedings of Machine Learning Research

Scalable Representation Learning in Linear Contextual Bandits with Constant Regret Guarantees

Autorzy: Tirinzoni A.; Papini M.; Touati A.; Lazaric A.; Pirotta M.
Opublikowane w: Advances in Neural Information Processing Systems 35 (NeurIPS 2022), 2022
Wydawca: NeurIPS foundation
DOI: 10.48550/arxiv.2210.13083

Efficient Global Planning in Large MDPs via Stochastic Primal-Dual Optimization

Autorzy: Gergely Neu, Nneka Okolo
Opublikowane w: Proceedings of The 34th International Conference on Algorithmic Learning Theory (ALT 2023), 2023
Wydawca: Proceedings of Machine Learning Research

Lifting the Information Ratio: An Information-Theoretic Analysis of Thompson Sampling for Contextual Bandits

Autorzy: Gergely Neu, Julia Olkhovskaya, Matteo Papini, Ludovic Schwartz
Opublikowane w: Advances in Neural Information Processing Systems 35 (NeurIPS 2022), 2022
Wydawca: NeurIPS foundation

Nonstochastic Contextual Combinatorial Bandits

Autorzy: L. Zierahn, D. van der Hoeven, N. Cesa-Bianchi, G. Neu
Opublikowane w: Proceedings of the Twenty-sixth International Conference on Artificial Intelligence and Statistics (AISTATS), 2023
Wydawca: Proceedings of Machine Learning Research

Optimistic Information-Directed Sampling

Autorzy: G. Neu, M. Papini, L. Schwartz
Opublikowane w: Proceedings of the 36th Annual Conference on Learning Theory (COLT), 2024
Wydawca: Proceedings of Machine Learning Research

Dealing with Unbounded Gradients in Stochastic Saddle-Point Optimizaiton

Autorzy: G. Neu, N. Okolo
Opublikowane w: Proceedings of the 41st International Conference on Machine Learning (ICML), 2024
Wydawca: Proceedings of Machine Learning Research

Proximal Point Imitation Learning

Autorzy: Luca Viano, Angeliki Kamoutsi, Gergely Neu, Igor Krawczuk, Volkan Cevher
Opublikowane w: Advances in Neural Information Processing Systems 35 (NeurIPS 2022), 2022
Wydawca: NeurIPS foundation

Importance-Weighted Offline Learning Done Right

Autorzy: G. Gabbianelli, G. Neu, M. Papini
Opublikowane w: Proceedings of the 34th International Conference on Algorithmic Learning Theory (ALT), 2024
Wydawca: Proceedings of Machine Learning Research

Online learning with off-policy feedback

Autorzy: Germano Gabbianelli, Matteo Papini, Gergely Neu
Opublikowane w: Proceedings of The 34th International Conference on Algorithmic Learning Theory (ALT 2023), 2023
Wydawca: Proceedings of Machine Learning Research

First-and Second-Order Bounds for Adversarial Linear Contextual Bandits

Autorzy: J. Olkhovskaya, J. Mayo, T. van Erven, G. Neu, C.-Y. Wei
Opublikowane w: Advances in Neural Information Processing Systems 36 (NeurIPS), 2023
Wydawca: NeurIPS foundation

Optimistic Planning by Regularized Dynamic Programming

Autorzy: Antoine Moulin, Gergely Neu
Opublikowane w: International Conference on Machine Learning (ICML 2022), 2023
Wydawca: Proceedings of Machine Learning Research

Generalization bounds via convex analysis

Autorzy: Gabor Lugosi, Gergely Neu
Opublikowane w: Proceedings of Thirty Fifth Conference on Learning Theory (COLT 2022), 2022
Wydawca: Proceedings of Machine Learning Research

Adversarial Contextual Bandits Go Kernelized

Autorzy: G. Neu, J. Olkhovskaya, S. Vakili
Opublikowane w: Proceedings of the 34th International Conference on Algorithmic Learning Theory (ALT), 2024
Wydawca: Proceedings of Machine Learning Research

Smoothing policies and safe policy gradients

Autorzy: Matteo Papini; Matteo Pirotta; Marcello Restelli
Opublikowane w: Machine Learning, Numer 111, 2022, Strona(/y) 4081–4137, ISSN 1573-0565
Wydawca: Springer
DOI: 10.1007/s10994-022-06232-6

A note on regularised NTK dynamics with an application to PAC-Bayesian training

Autorzy: Clerico, Eugenio; Guedj, Benjamin
Opublikowane w: Transactions on Machine Learning Research, 2024, ISSN 2835-8856
Wydawca: Transactions on Machine Learning Research
DOI: 10.48550/arxiv.2312.13259

Wyszukiwanie danych OpenAIRE...

Rezultaty

Publikacje

Udostępnij tę stronę Udostępnij tę stronę w mediach społecznościowych

Pobierz Pobierz zawartość strony