Risks from Learned Optimization in Advanced Machine Learning Systems

GPTKB entity

Statements (26)
Predicate Object
gptkbp:instanceOf gptkb:academic_journal
gptkbp:arXivID 1906.01820
gptkbp:author gptkb:John_Wentworth
gptkb:Evan_Hubinger
Chris van Merwijk
Vika Safronova
gptkbp:citation AI safety community
high (over 300 as of 2024)
gptkbp:describes inner alignment
outer alignment
base optimizer
mesa-optimizer
https://www.w3.org/2000/01/rdf-schema#label Risks from Learned Optimization in Advanced Machine Learning Systems
gptkbp:influenced AI alignment research
gptkbp:language English
gptkbp:proposedBy taxonomy of optimization risks
gptkbp:publicationYear 2019
gptkbp:publishedIn gptkb:arXiv
gptkbp:topic gptkb:machine_learning
optimization
AI safety
mesa-optimization
alignment problem
gptkbp:url https://arxiv.org/abs/1906.01820
gptkbp:bfsParent gptkb:Evan_Hubinger
gptkbp:bfsLayer 8