Posts
Comments
Comment by
Kayla Brown (kayla-brown) on
Order Matters for Deceptive Alignment ·
2023-02-16T12:37:00.175Z ·
LW ·
GW
I found this post fascinating and although I myself am not engaged within the x-risk space relative to artificial intelligence, the delivery of the information skillfully explored some of my key discomforts with the seemingly high risk of deceptive alignment.