Partially Observable Markov Decision Process

GPTKB entity

Statements (50)
Predicate Object
gptkbp:instanceOf Mathematical model
gptkbp:abbreviation gptkb:POMDP
gptkbp:assumes Agent has incomplete information about the state
gptkbp:component Observations
States
Actions
Transition function
Discount factor
Observation function
Reward function
gptkbp:describes Decision making under uncertainty
gptkbp:field gptkb:artificial_intelligence
Reinforcement learning
Operations research
gptkbp:formedBy Mathematical notation
gptkbp:generalizes gptkb:Markov_Decision_Process
gptkbp:hasApplication gptkb:Autonomous_vehicles
Finance
Speech recognition
Game playing
Resource management
Medical diagnosis
Dialogue systems
Robot navigation
gptkbp:hasProperty gptkb:NP-hard
PSPACE-complete
Computationally hard
https://www.w3.org/2000/01/rdf-schema#label Partially Observable Markov Decision Process
gptkbp:originatedIn 1960s
gptkbp:relatedConcept gptkb:stochastic_process
gptkb:Hidden_Markov_Model
Bayesian filtering
Belief state
gptkbp:relatedTo gptkb:Markov_Decision_Process
gptkbp:solvedBy gptkb:Monte_Carlo_methods
gptkb:Heuristic_search
gptkb:Policy_iteration
Point-based value iteration
Value iteration
gptkbp:studiedBy Researchers in AI
Researchers in control theory
Researchers in operations research
gptkbp:usedIn gptkb:robot
gptkb:Control_theory
Natural language processing
Planning
Autonomous systems
Medical decision making
gptkbp:bfsParent gptkb:Markov_Decision_Process
gptkbp:bfsLayer 6