Gwern's "Why Tool AIs Want to Be Agent AIs: The Power of Agency"

habryka4

Gwern's "Why Tool AIs Want to Be Agent AIs: The Power of Agency"

post by habryka (habryka4) · 2019-05-05T05:11:45.805Z · LW · GW · 3 comments

This is a link post for https://www.gwern.net/Tool-AI

3 comments

I somehow hadn't read this post until now, so I am posting this here in case I am not the only one (and I wasn't able to find a previous linkpost for it). Relevant to relatively recent discussion on AI-as-a-service, but also just good as a broad reference.

3 comments

Comments sorted by top scores.

comment by Donald Hobson (donald-hobson) · 2019-05-05T21:40:00.046Z · LW(p) · GW(p)

Agenty AI's can be well defined mathematically. We have enough understanding of what an agent is that we can start dreaming up failure modes. Most of what we have for tool ASI is analogies to systems to stupid fail catastrophically anyway, and pleasant imaginings.

Some possible programs will be tool ASI's, much as some programs will be agent ASI's. The question is, what are the relative difficulties in humans building, and benefits of, each kind of AI. Conditional on friendly AI, I would consider it more likely to be an agent than a tool, with a lot of probability on "neither", "both" and "that question isn't mathematically well defined". I wouldn't be surprised if tool AI and corrigible AI turned out to be the same thing or something.

There have been attempts to define tool-like behavior, and they have produced interesting new failure modes. We don't have the tool AI version of AIXI yet, so its hard to say much about tool AI.

comment by David Scott Krueger (formerly: capybaralet) (capybaralet) · 2021-07-12T04:54:26.047Z · LW(p) · GW(p)

I wonder if gwern has changed their view on RL/meta-learning at all given GPT, scaling laws, and current dominance of training on big offline datasets. This would be somewhat in line with skybrian's comment on Hacker News: https://news.ycombinator.com/item?id=13231808

Replies from: capybaralet

↑ comment by David Scott Krueger (formerly: capybaralet) (capybaralet) · 2021-07-12T04:57:58.913Z · LW(p) · GW(p)

I see that is has references to papers from this year, so presumably has been updated to reflect any changes in view.

Gwern's "Why Tool AIs Want to Be Agent AIs: The Power of Agency"

Contents

3 comments