본문 바로가기

maximum entropy framework1

Soft Actor-Critic (SAC) https://arxiv.org/abs/1801.01290 Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor Model-free deep reinforcement learning (RL) algorithms have been demonstrated on a range of challenging decision making and control tasks. However, these methods typically suffer from two major challenges: very high sample complexity and brittle convergenc arxiv.org .. 2024. 1. 27.

이전 1 다음

티스토리툴바