[D] SOTA in continuous control
Is the state of the art in continuous control environments (e.g. Mujoco) still model-free algorithms like PPO/SAC? Are there any promising/competitive model-based or hierarchical methods?
submitted by /u/arltep
[link] [comments]