Statements (23)
Predicate | Object |
---|---|
gptkbp:instanceOf |
gptkb:logic
|
gptkbp:appliesTo |
gptkb:reinforcement_learning
control theory operations research |
gptkbp:describes |
decision making over time
|
gptkbp:field |
gptkb:machine_learning
decision theory statistics |
gptkbp:focusesOn |
expected value
uncertainty utility maximization policies information gathering |
https://www.w3.org/2000/01/rdf-schema#label |
sequential decision theory
|
gptkbp:notableContributor |
gptkb:Abraham_Wald
|
gptkbp:originatedIn |
20th century
|
gptkbp:relatedTo |
gptkb:multi-armed_bandit_problem
Markov chain dynamic programming optimal stopping |
gptkbp:studies |
sequences of decisions
|
gptkbp:bfsParent |
gptkb:Universal_Artificial_Intelligence:_Sequential_Decisions_based_on_Algorithmic_Probability
|
gptkbp:bfsLayer |
8
|