Statements (23)
| Predicate | Object |
|---|---|
| gptkbp:instanceOf |
gptkb:machine_learning_technique
|
| gptkbp:alternativeTo |
inverse reinforcement learning
|
| gptkbp:compatibleWith |
reward function
|
| gptkbp:firstDescribed |
1989
|
| gptkbp:improves |
data augmentation
|
| gptkbp:input |
state-action pairs
|
| gptkbp:learnsFrom |
demonstrations
|
| gptkbp:limitation |
compounding errors
covariate shift |
| gptkbp:notableFor |
self-driving cars
robot manipulation video game AI |
| gptkbp:output |
gptkb:public_policy
|
| gptkbp:relatedTo |
gptkb:reinforcement_learning
supervised learning |
| gptkbp:requires |
expert demonstrations
|
| gptkbp:trainer |
minimizing prediction error
|
| gptkbp:usedIn |
robotics
autonomous driving imitation learning |
| gptkbp:bfsParent |
gptkb:DAgger
|
| gptkbp:bfsLayer |
8
|
| https://www.w3.org/2000/01/rdf-schema#label |
behavior cloning
|