Statements (51)
Predicate | Object |
---|---|
gptkbp:instanceOf |
Person
|
gptkbp:affiliation |
gptkb:University_of_Alberta
|
gptkbp:author |
gptkb:Andrew_G._Barto
|
gptkbp:awards |
ACM Fellow
IEEE Fellow |
gptkbp:birthPlace |
gptkb:Canada
|
gptkbp:birthYear |
1958
|
gptkbp:contribution |
gptkb:Deep_Reinforcement_Learning
gptkb:Dynamic_Programming Q-learning Actor-Critic Methods Exploration Strategies Learning from Demonstration Hierarchical Reinforcement Learning Inverse Reinforcement Learning Multi-Agent Reinforcement Learning Reward Shaping Finance Applications of RL Benchmarking RL Algorithms OpenAI Gym Contributions Real-Time Decision Making in RL Robotics Applications of RL Policy_Gradient_Methods Function_Approximation_in_RL Autonomous_Systems_Applications_of_RL Computer_Vision_Applications_of_RL Education_Applications_of_RL Ethics_in_AI_and_RL Explainability_in_RL Game_Theory_in_RL Generalization_in_RL Healthcare_Applications_of_RL Natural_Language_Processing_Applications_of_RL RL_in_Real-World_Applications RL_in_Simulations RL_in_Video_Games Robustness_in_RL Sample_Efficiency_in_RL Stochastic_Control Transfer_Learning_in_RL |
gptkbp:field |
Artificial Intelligence
Machine Learning |
https://www.w3.org/2000/01/rdf-schema#label |
Richard Sutton
|
gptkbp:influencedBy |
gptkb:Herbert_A._Simon
gptkb:John_McCarthy gptkb:Marvin_Minsky |
gptkbp:knownFor |
Reinforcement Learning
|
gptkbp:publishes |
Reinforcement Learning: An Introduction
|
gptkbp:researchInterest |
Neuroscience
Markov Decision Processes Temporal Difference Learning |