Advantage Actor-Critic

GPTKB entity

Statements (27)
Predicate Object
gptkbp:instanceOf reinforcement learning algorithm
gptkbp:abbreviation gptkb:A2C
gptkbp:actorRole updates policy
gptkbp:appliesTo gptkb:Atari_games
robotics
continuous control tasks
gptkbp:category on-policy algorithm
gptkbp:component actor
literary criticism
gptkbp:criticRole estimates value function
https://www.w3.org/2000/01/rdf-schema#label Advantage Actor-Critic
gptkbp:improves actor-critic method
gptkbp:introducedIn 2016
gptkbp:objective expected return
gptkbp:purpose policy optimization
value estimation
gptkbp:relatedTo gptkb:A3C
policy gradient methods
value-based methods
gptkbp:usedBy gptkb:OpenAI_Baselines
gptkb:Stable_Baselines
gptkbp:usedIn deep reinforcement learning
gptkbp:uses temporal difference learning
advantage function
stochastic policy
gptkbp:bfsParent gptkb:A2C
gptkbp:bfsLayer 7