Statements (23)
| Predicate | Object |
|---|---|
| gptkbp:instanceOf |
gptkb:logic
|
| gptkbp:appliesTo |
gptkb:reinforcement_learning
control theory operations research |
| gptkbp:describes |
decision making over time
|
| gptkbp:field |
gptkb:machine_learning
decision theory statistics |
| gptkbp:focusesOn |
expected value
uncertainty utility maximization policies information gathering |
| gptkbp:notableContributor |
gptkb:Abraham_Wald
|
| gptkbp:originatedIn |
20th century
|
| gptkbp:relatedTo |
gptkb:multi-armed_bandit_problem
gptkb:Markov_chain dynamic programming optimal stopping |
| gptkbp:studies |
sequences of decisions
|
| gptkbp:bfsParent |
gptkb:Universal_Artificial_Intelligence:_Sequential_Decisions_based_on_Algorithmic_Probability
|
| gptkbp:bfsLayer |
8
|
| https://www.w3.org/2000/01/rdf-schema#label |
sequential decision theory
|