Markov Decision Process

URI: https://gptkb.org/entity/Markov_Decision_Process

GPTKB entity

Predicate	Object
gptkbp:instanceOf	gptkb:logic gptkb:stochastic_process
gptkbp:abbreviation	gptkb:MDP
gptkbp:component	gptkb:public_policy states actions rewards transition probabilities
gptkbp:field	gptkb:artificial_intelligence gptkb:reinforcement_learning decision theory operations research
gptkbp:generalizes	gptkb:Markov_chain
gptkbp:hasApplication	gptkb:machine_learning gptkb:navigation gptkb:customer_service autonomous systems control theory economics finance game theory healthcare marketing planning queueing theory robotics telecommunications resource management resource allocation supply chain management traffic control energy management scheduling inventory control maintenance planning network optimization portfolio optimization medical decision making resource scheduling
gptkbp:introducedIn	1950s
gptkbp:namedAfter	gptkb:Andrey_Markov
gptkbp:property	gptkb:Markov_property
gptkbp:relatedTo	gptkb:Partially_Observable_Markov_Decision_Process gptkb:Bellman_equation gptkb:stochastic_game
gptkbp:solvedBy	dynamic programming policy iteration value iteration
gptkbp:usedFor	modeling decision making sequential decision problems
gptkbp:bfsParent	gptkb:MDP
gptkbp:bfsLayer	5
http://www.w3.org/2000/01/rdf-schema#label	Markov Decision Process