Recent Posts

2023 in AutoRL

6 minute read

TL;DR: From combining RL with LLMs through more efficient MetaRL and updates in an environment design to classic hyperparameter optimization, these are some ...

TempoRL - Learning When to Act Permalink

less than 1 minute read

TL;DR: Jointly learning when and how to act improves sample efficiency of RL agents through better exploration and improved exploitation.