Partially Observable Markov Decision Processes

GPTKB entity

Statements (51)
Predicate Object
gptkbp:instanceOf gptkb:logic
gptkbp:A set of actions
gptkbp:abbreviation gptkb:POMDP
gptkbp:describes decision making under uncertainty
gptkbp:formalizedAs tuple (S, A, O, T, Z, R, γ)
gptkbp:generalizes gptkb:Markov_Decision_Processes
gptkbp:hasApplication gptkb:dialogue_systems
gptkb:game_AI
autonomous vehicles
resource management
diagnosis
robot navigation
gptkbp:hasComponent gptkb:public_policy
reward function
transition function
belief state
observation function
https://www.w3.org/2000/01/rdf-schema#label Partially Observable Markov Decision Processes
gptkbp:includes actions
reward function
transition probabilities
hidden states
observations
observation probabilities
gptkbp:introduced gptkb:Anthony_R._Cassandra
gptkb:Michael_L._Littman
gptkb:Leslie_Pack_Kaelbling
gptkbp:introducedIn 1998
gptkbp:limitation gptkb:curse_of_dimensionality
computational complexity
curse of history
gptkbp:O set of observations
gptkbp:R reward function
gptkbp:relatedTo gptkb:Hidden_Markov_Models
gptkb:Bayesian_networks
gptkb:Reinforcement_Learning
Partially Observable Stochastic Games
gptkbp:S set of states
gptkbp:solvedBy gptkb:Monte_Carlo_methods
policy iteration
value iteration
point-based value iteration
gptkbp:TDP state transition probability function
gptkbp:usedIn gptkb:artificial_intelligence
gptkb:reinforcement_learning
decision theory
robotics
gptkbp:Z observation probability function
gptkbp:γ discount factor
gptkbp:bfsParent gptkb:Markov_Decision_Processes
gptkbp:bfsLayer 7