Recent Posts

TempoRL - Learning When to Act Permalink

less than 1 minute read

TL;DR: Jointly learning when and how to act improves sample efficiency of RL agents through better exploration and improved exploitation.