Periodic Reporting for period 1 - DESlRE (Data-Efficient Scalable Reinforcement Learning for Practical Robotic Environments)
Reporting period: 2018-04-01 to 2020-03-31
From a practical perspective, we have proposed easy-to-implement algorithms. As we discussed in recent works, one strength of our methods is its wide applicability. Many of today's learning tasks suffer from manifestations of distributional ambiguity. We believe practitioners from industry and business that wish to gain robustness in their learning or decision-making tasks can apply our kernel distributionally robust optimization algorithms.