Musings on the Alignment Problem

Home
Archive
About

Sitemap - 2023 - Musings on the Alignment Problem

Combining weak-to-strong generalization with scalable oversight

Self-exfiltration is a key dangerous capability

A proposal for importing society’s values

© 2025 Jan Leike
Privacy ∙ Terms ∙ Collection notice
Start writingGet the app
Substack is the home for great culture

Share