Some Thoughts on AI Alignment: Using AI to Control AI

eigenvalue

Some Thoughts on AI Alignment: Using AI to Control AI

post by eigenvalue · 2024-06-21T17:44:19.263Z · LW · GW · 1 comments

This is a link post for https://github.com/Dicklesworthstone/some_thoughts_on_ai_alignment

1 comment

Recent news has caused me to think through some questions about AI alignment, so I collected my thoughts here. While I'm sure a lot of this stuff isn't new, I haven't seen all these ideas presented together in one place. I think that some of the approaches that are used in designing decentralized systems can also be useful in constructing alignment systems, so I've tried to do that here. Anyway, I welcome feedback on my ideas.

1 comments

Comments sorted by top scores.

comment by Raemon · 2024-06-21T17:44:43.469Z · LW(p) · GW(p)

FYI, these sorts of posts generally get more readership/responses if they copy over the text of the post here.

Some Thoughts on AI Alignment: Using AI to Control AI

Contents

1 comments