maximum entropy framework1 Soft Actor-Critic (SAC) https://arxiv.org/abs/1801.01290 Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor Model-free deep reinforcement learning (RL) algorithms have been demonstrated on a range of challenging decision making and control tasks. However, these methods typically suffer from two major challenges: very high sample complexity and brittle convergenc arxiv.org .. 2024. 1. 27. 이전 1 다음