Musings on the Alignment Problem

Home
Archive
About
A proposal for importing society’s values
Building towards Coherent Extrapolated Volition with language models
Mar 9 • 
Jan Leike
13
6
New
Top
Community
Distinguishing three alignment taxes
The impact of different alignment taxes depends on the context
Dec 19, 2022 • 
Jan Leike
7
5
Why I’m optimistic about our alignment approach
Some arguments in favor and responses to common objections
Dec 5, 2022 • 
Jan Leike
34
10
What could a solution to the alignment problem look like?
A high-level view on the elusive once-and-for-all solution
Sep 27, 2022 • 
Jan Leike
10
7
What is inner alignment?
An explanation using the language of machine learning
May 8, 2022 • 
Jan Leike
10
A minimal viable product for alignment
Bootstrapping a solution to the alignment problem
Mar 29, 2022 • 
Jan Leike
11
Why I’m excited about AI-assisted human feedback
How to scale alignment techniques to hard tasks
Mar 29, 2022 • 
Jan Leike
15
What is the alignment problem?
My attempt at clarifying a confusing topic
Mar 29, 2022 • 
Jan Leike
11
3
Musings on the Alignment Problem

Musings on the Alignment Problem

AboutArchiveSitemap
Share this publication

Musings on the Alignment Problem

aligned.substack.com

Musings on the Alignment Problem

By Jan Leike
By registering you agree to Substack's Terms of Service, our Privacy Policy, and our Information Collection Notice
© 2023 Jan Leike
Privacy ∙ Terms ∙ Collection notice
Start WritingGet the app
Substack is the home for great writing

Our use of cookies

We use necessary cookies to make our site work. We also set performance and functionality cookies that help us make improvements by measuring traffic on our site. For more detailed information about the cookies we use, please see our privacy policy. ✖