reinforcement learning

URI: https://gptkb.org/entity/reinforcement_learning

GPTKB entity

AI-created image

Predicate	Object
gptkbp:instanceOf	gptkb:machine_learning_paradigm
gptkbp:application	autonomous vehicles robotics resource management game playing recommendation systems
gptkbp:challenge	credit assignment problem exploration-exploitation tradeoff sample efficiency stability and convergence
gptkbp:conference	gptkb:AAAI gptkb:ICML gptkb:NeurIPS gptkb:IJCAI
gptkbp:field	gptkb:artificial_intelligence
gptkbp:firstMajorBookPublished	1998
gptkbp:focusesOn	learning by trial and error
gptkbp:goal	maximize cumulative reward
gptkbp:hasSubfield	deep reinforcement learning hierarchical reinforcement learning inverse reinforcement learning model-based reinforcement learning model-free reinforcement learning multi-agent reinforcement learning
gptkbp:involves	gptkb:public_policy environment states actions reward function value function agent
gptkbp:notableBook	gptkb:Reinforcement_Learning:_An_Introduction
gptkbp:notableContributor	gptkb:Andrew_Barto gptkb:Richard_S._Sutton
gptkbp:notableFor	gptkb:Monte_Carlo_methods gptkb:Actor-Critic gptkb:SARSA gptkb:Deep_Q-Network_(DQN)
gptkbp:relatedTo	gptkb:Markov_chain gptkb:Q-learning dynamic programming policy gradient methods temporal difference learning
gptkbp:usedBy	gptkb:DeepMind gptkb:AlphaGo gptkb:OpenAI_Five
gptkbp:uses	rewards and punishments
gptkbp:bfsParent	gptkb:machine_learning
gptkbp:bfsLayer	4
http://www.w3.org/2000/01/rdf-schema#label	reinforcement learning