Statements (53)
| Predicate | Object |
|---|---|
| gptkbp:instanceOf |
gptkb:logic
gptkb:stochastic_process |
| gptkbp:abbreviation |
gptkb:MDP
|
| gptkbp:component |
gptkb:public_policy
states actions rewards transition probabilities |
| gptkbp:field |
gptkb:artificial_intelligence
gptkb:reinforcement_learning decision theory operations research |
| gptkbp:generalizes |
gptkb:Markov_chain
|
| gptkbp:hasApplication |
gptkb:machine_learning
gptkb:navigation gptkb:customer_service autonomous systems control theory economics finance game theory healthcare marketing planning queueing theory robotics telecommunications resource management resource allocation supply chain management traffic control energy management scheduling inventory control maintenance planning network optimization portfolio optimization medical decision making resource scheduling |
| gptkbp:introducedIn |
1950s
|
| gptkbp:namedAfter |
gptkb:Andrey_Markov
|
| gptkbp:property |
gptkb:Markov_property
|
| gptkbp:relatedTo |
gptkb:Partially_Observable_Markov_Decision_Process
gptkb:Bellman_equation gptkb:stochastic_game |
| gptkbp:solvedBy |
dynamic programming
policy iteration value iteration |
| gptkbp:usedFor |
modeling decision making
sequential decision problems |
| gptkbp:bfsParent |
gptkb:MDP
|
| gptkbp:bfsLayer |
5
|
| https://www.w3.org/2000/01/rdf-schema#label |
Markov Decision Process
|