Updateless Intelligence Metrics in the Multiverse

post by Squark · 2014-03-08T00:25:59.903Z · LW · GW · Legacy · 5 comments

Contents

  Problems with UIM 1.0
  UIM 2.0
    Discussion
  Generic Intelligence
  Utility Functions in the Multiverse
    Discussion
  TLDR
None
5 comments

Followup to: Intelligence Metrics with Naturalized Induction using UDT

In the previous post I have defined an intelligence metric solving the duality (aka naturalized induction) and ontology problems in AIXI. This model used a formalization of UDT using Benja's model of logical uncertainty. In the current post I am going to:

Problems with UIM 1.0

The previous model postulated that naturalized induction uses a version of Solomonoff induction updated in the direction of an innate model N with a temporal confidence parameter t. This entails several problems:

UIM 2.0

I now propose the following intelligence metric (the formula goes first and then I explain the notation):

IU(q) := ET[ED[EL[U(Y(D)) | Q(X(T)) = q]] | N]

Discussion

Generic Intelligence

It seems conceptually desirable to have a notion of intelligence independent of the specifics of the utility function. Such an intelligence metric is possible to construct in a way analogical to what I've done in UIM 1.0, however it is no longer a special case of the utility-specific metric.

Assume N to consist of a machine M connected to a special storage device E. Assume further that at X-time 0, E contains a valid C-program u realizing a utility function U, but that this is the only constraint on the initial content of E imposed by N. Define

I(q) := ET[ED[EL[u(Y(D); X(T)) | Q(X(T)) = q]] | N]

Here, u(Y(D); X(T)) means that we decode u from X(T) and evaluate it on Y(D). Thus utility depends both on the physical universe Y and the ideal universe X. This means G is not precisely a UDT agent but rather a "proto-agent": only when a realization of G reads u from E it knows which other realizations of G in the multiverse (the Solomonoff ensemble from which Y is selected) should be considered as the "same" agent UDT-wise.

Incidentally, this can be used as a formalism for reasoning about agents that don't know their utility functions. I believe this has important applications in metaethics I will discuss in another post.

Utility Functions in the Multiverse

UIM 2.0 is a formalism that solves the diseases of UIM 1.0 at the price of losing N in the capacity of the ontology for utility functions. We need the utility function to be defined on the entire multiverse i.e. on any sequence of natural numbers. I will outline a way to extend "ontology-specific" utility functions to the multiverse through a simple example.

Suppose G is an agent that cares about universes realizing the Game of Life, its utility function U corresponding to e.g. some sort of glider maximization with exponential temporal discount. Fix a specific way DC to decode any Y into a history of a 2D cellular automaton with two cell states ("dead" and "alive"). Our multiversal utility function U* assigns Ys for which DC(Y) is a legal Game of Life the value U(DC(Y)). All other Ys are treated by dividing the cells into cells O obeying the rules of Life and cells V violating the rules of Life. We can then evaluate U on O only (assuming it has some sort of locality) and assign V utility by some other rule, e.g.:

U*(Y) is then defined to be the sum of the values assigned to O(Y) and V(Y).

Discussion

  • The construction of U* depends on the choice of DC. However, U* only depends on DC weakly since given a hypothesis D which produces a Game of Life wrt some other low complexity encoding, there is a corresponding hypothesis D' producing a Game of Life wrt DC. D' is obtained from D by appending a corresponding "transcoder" and thus it is only less Solomonoff-likely than D by an O(1) factor.
  • Since the accumulation between O and V is additive rather than e.g. multiplicative, a U*-agent doesn't behave as if it a priori expects the universe the follow the rules of Life but may have strong preferences about the universe actually doing it.
  • This construction is reminiscent of Egan's dust theory in the sense that all possible encodings contribute. However, here they are weighted by the Solomonoff measure.

TLDR

The intelligence of a physicalist agent is defined to be the UDT-value of the "decision" to create the agent by the process creating the agent. The process is selected randomly from a Solomonoff measure conditional on obeying the laws of the hardware on which the agent is implemented. The "decision" is made in an "ideal" universe in which the agent is Cartesian, but the utility function is evaluated on the real universe (raw Solomonoff measure). The interaction between the two "universes" is purely via logical conditional probabilities (acausal).

If we want to discuss intelligence without specifying a utility function up front, we allow the "ideal" agent to read a program describing the utility function from a special storage immediately after "booting up".

Utility functions in the Tegmark level IV multiverse are defined by specifying a "reference universe", specifying an encoding of the reference universe and extending a utility function defined on the reference universe to encodings which violate the reference laws by summing the utility of the portion of the universe which obeys the reference laws with some function of the space-time shape of the violation.

5 comments

Comments sorted by top scores.

comment by Fivehundred · 2014-03-10T19:48:25.720Z · LW(p) · GW(p)

tl;dr?

Replies from: Squark
comment by Squark · 2014-03-11T22:46:02.075Z · LW(p) · GW(p)

The intelligence of a physicalist agent is defined to be the UDT-value of the "decision" to create the agent by the process creating the agent. The process is selected randomly from a Solomonoff measure conditional on obeying the laws of the hardware on which the agent is implemented. The "decision" is made in an "ideal" universe in which the agent is Cartesian, but the utility function is evaluated on the real universe (raw Solomonoff measure). The interaction between the two "universes" is purely via logical conditional probabilities (acausal).

If we want to discuss intelligence without specifying a utility function up front, we allow the "ideal" agent to read a program describing the utility function from a special storage immediately after "booting up".

Utility functions in the Tegmark level IV multiverse are defined by specifying a "reference universe", specifying an encoding of the reference universe and extending a utility function defined on the reference universe to encodings which violate the reference laws by summing the utility of the portion of the universe which obeys the reference laws with some function of the space-time shape of the violation.

Replies from: cousin_it
comment by cousin_it · 2014-03-12T15:31:56.760Z · LW(p) · GW(p)

For what it's worth, even though you reference some of my work, I've been procrastinating on reading your post for several days now because it seems so difficult. It feels like I need to memorize an alphabet's worth of abbreviations to figure out what's going on and what's actually new. The tl;dr helps a little, but I'm still mostly lost. Since there might be valuable stuff in there, maybe you could try to reformulate parts of it, using more standard terminology from logic or CS, or even just more standard LW jargon? Maybe then more people would be able to participate.

Replies from: Squark, Squark
comment by Squark · 2014-03-22T20:22:16.585Z · LW(p) · GW(p)

I've written down the next version of the formalism without introducing any notations until the very last moment when I must do it to write the result in the form of a formula. I would be extremely grateful if you read it and tell me what you think.

comment by Squark · 2014-03-12T20:41:42.133Z · LW(p) · GW(p)

Maybe you can ask a few specific questions?

Some of the background appeared in the previous points so I was too lazy to repeat it here, which might have been a mistake.

To recap what I'm doing here, regardless of how I do it:

Legg and Hutter defined an intelligence metric for agents which are input-output mappings. I.e., each input-output map gets a number which says how intelligent it is. AIXI is the maximally intelligent agent by this definition. Of course this intelligence metric is completely Cartesian in nature.

The Legg-Hutter metric is defined for any mapping, regardless of the computing resources needed to evaluate it or even if it is uncomputable (like AIXI). Of course there is nothing stopping us from restricting to mappings that can be realized by a given computing model. For example, we can assume the agent to be a universal Turing machine augmented by special "input" registers and map some bits from the tape to the output. This way we get an intelligence number for every program that we can write into the universal Turing machine.

I define a different way to assign "intelligence numbers" to such programs which is physicalist. The construction is inspired by UDT. Indeed, most of the ideas already exist in UDT however UDT is a different type of mathematical object. UDT speaks of decision algorithms and/or values of decisions by an algorithm, whereas I speak of quantifying the intelligence of a program for a fixed "robot" (abstract computing device with input/output channels).