0 comments
Comments sorted by top scores.
comment by Charlie Steiner · 2022-10-06T01:25:27.744Z · LW(p) · GW(p)
I'd love to claim credit for helping to boost talk about meta-preferences in the zeitgeist (regular plug for Reducing Goodhart [? · GW]).
But sadly, I think if I had actually been influential, people would be more freaking leery of reifying a "True Utility Function" for humans.