Statements (23)
| Predicate | Object |
|---|---|
| gptkbp:instanceOf |
gptkb:reinforcement_learning_algorithm
|
| gptkbp:approach |
hierarchical reinforcement learning
|
| gptkbp:author |
gptkb:Ilya_Sutskever
gptkb:Peter_Abbeel gptkb:John_Schulman gptkb:Xi_Chen gptkb:Yan_Duan |
| gptkbp:citation |
gptkb:Meta-Learning_Shared_Hierarchies
gptkb:arXiv_preprint_arXiv:1710.09767 |
| gptkbp:field |
gptkb:machine_learning
gptkb:reinforcement_learning meta-learning |
| gptkbp:fullName |
gptkb:Meta_Learning_Shared_Hierarchies
|
| gptkbp:introduced |
gptkb:OpenAI
gptkb:Meta-Learning_Shared_Hierarchies |
| gptkbp:publicationYear |
2017
|
| gptkbp:purpose |
improve sample efficiency in RL
learn reusable sub-policies |
| gptkbp:url |
https://arxiv.org/abs/1710.09767
|
| gptkbp:uses |
policy gradient methods
|
| gptkbp:bfsParent |
gptkb:Miaoli_Senior_High_School
|
| gptkbp:bfsLayer |
6
|
| https://www.w3.org/2000/01/rdf-schema#label |
MLSH
|