0 comments

Comments sorted by top scores.

comment by Charbel-Raphaël (charbel-raphael-segerie) · 2023-01-29T11:38:26.610Z · LW(p) · GW(p)

There is also davidad's Open Agency Architecture

https://www.alignmentforum.org/posts/pKSmEkSQJsCSTK6nH/an-open-agency-architecture-for-safe-transformative-ai [AF · GW]

comment by Mitchell_Porter · 2023-02-14T05:28:50.179Z · LW(p) · GW(p)

Nice to see someone who wants to directly tackle the big problem. Also nice to see someone who appreciates June Ku's work.

comment by Roman Leventov · 2023-05-06T15:00:37.804Z · LW(p) · GW(p)

the core motivation for formal alignment, for me, is that a working solution is at least eventually aligned: there is an objective answer to the question "will maximizing this with arbitrary capabilities produce desirable outcomes?" where the answer does not depend, at the limit, on what does the maximization.

I don't know about other proposals because I'm not familiar with them, but Methaethical AI actually describes the machinery of the agent, hence "the answer" does depend "on what does the maximisation".