How Do You Quantify [Physics Interfacing] Real World Capabilities?

post by DragonGod · 2022-06-14T14:49:47.264Z · LW · GW · 1 comment

This is a question post.

Contents

      Disclaimer
  Introduction
  Humans
    Quantifying Human Interfacing Capabilities
      Defence of the Method
  Human Infrastructure
    Quantifying Human Infrastructure Interfacing Capabilities
      Defence of the Method
  Physics
  My Question
        At a very high level of abstraction (consider the methods I proposed for the other interfaces) how would you quantify the ability of an AI System to influence bare reality?
  Desiderata for an Answer
None
1 comment

Disclaimer

Sorry for the length of this "question". The post in which I cover the preliminary context hasn't been published yet (4,000+ words and I'm < 40% through my outline).


 

Introduction

I have done some thinking of how one might quantify the capabilities of an AI system to influence the "real world" environment.

I have identified three broad interfaces for executing capabilities:

  1. Humans (and groups thereof)
  2. Human infrastructure
  3. Physics/Bare reality


 

Humans

The AI could influence the world by convincing humans to do things that it wants.

Some relevant skills:

The idea here is influence the real world via influencing individual humans or groups of humans

 

Quantifying Human Interfacing Capabilities

To a first approximation, a function to quantify human capabilities might be a positive (aggregate) function of something like: “the likelihood that the AI could convince a(n arbitrary) human to perform a(n arbitrary) act” (let's call this the likelihood of successful persuasion [LSP]). The function may apply suitable modifications to LSP such as:

 

Defence of the Method

The human interface is just the AI influencing the world via influencing humans. A naive way to quantify the AI's ability to influence humans is something like an aggregate of LSP. Upon thinking on it for a few seconds, you'd like to modify the LSP in some ways.

This is vague, because I am not trying to design a concrete measure of "ability to influence humans", at this stage of my thinking, I'm fine with a high level abstraction of what such a measure might look like.


 

Human Infrastructure

The AI could influence the world via the levers of human civilisation. A non-exhaustive list of relevant human infrastructure follows:

 

An example of influencing the world via human infrastructure is hacking into a web server. I'm not going to attempt to list the relevant classes of skills for interacting with human infrastructure because they are far too many. 

Some ways of interfacing with human infrastructure

 

Quantifying Human Infrastructure Interfacing Capabilities

To a first approximation, you want something like "economic power". One might try to capture related concepts like social power, but I think "economic power" will capture it adequately.

Some ways to operationalise economic power:

At the level of abstraction at which I'm thinking of things, I find his vague notion satisfactory.

 

Defence of the Method

My intuition for quantifying "human infrastructure interfacing capabilities" in economic terms is something like:


 

Physics

AKA the "bare interface".

The AI can also attempt to manipulate its environment directly without using the humans or human infrastructure interfaces. Any way of interacting with the real world that doesn't use the "human"/"human infrastructure" proxies — interfacing with bare reality itself — are interacting via the "physics" interface.

One way of an AI system interacting with a human through the physics interface is shooting them down with a lethal autonomous weapon.

The other natural sciences (chemistry, biology, geology, meteorology etc.) apply here (but they seem to be emergent physics/physics at higher levels of abstraction, so I still think of it as the "physics interface" [but I'm willing to change/drop the name if people are sufficiently opposed to it]).

Some relevant skills to influence physics include:

 

My Question

At a very high level of abstraction (consider the methods I proposed for the other interfaces) how would you quantify the ability of an AI System to influence bare reality?

 

Desiderata for an Answer

A good abstraction for quantifying the physics interfacing capabilities of an AI system should have the following properties:

 

Money is very good a measure for quantifying capability deployed via the human infrastructure interface because it satisfies the above criteria (when reinterpreted to fit human infrastructure).

So for clues on where to look, what fits the analogy: "money but for physics"? What's the currency with which arbitrary physical capability can be purchased?

Answers

1 comment

Comments sorted by top scores.

comment by DragonGod · 2022-06-14T16:28:25.174Z · LW(p) · GW(p)

To be clear, I do have a few ideas, but they're hypotheses I privileged [? · GW], so I don't want to poison your thinking by mentioning them.

Maybe in three days (or whenever I finish the mega essay), I'll lay them out later and explore why I'm dissatisfied with them.