What does Functional Decision Theory say to do in imperfect Newcomb situations?

post by Daniel_Eth · 2022-05-07T22:26:33.469Z · LW · GW · No comments

This is a question post.

Contents

  Answers
    2 Stephen Bennett (Previously GWS)
    1 JBlack
None
No comments

Imagine a variation of Newcomb's problem where the Oracle is correct 99% of the time (say, half of people 1-box, half 2-box, and the Oracle is correct 99% of the time on each). What would FDT imply is the correct course of action then?

Answers

answer by Stephen Bennett (Stephen Bennett (Previously GWS)) · 2022-05-07T22:34:44.738Z · LW(p) · GW(p)

The expected value under FDT would be:

  • 1-boxing: 0.99 * big prize + 0.01 * 0

  • 2-boxing: 0.99 * small prize + 0.01 * (small prize + big prize)

Making a decision based on that will depend on the specifics of the problem (how much bigger is the big prize than the small prize?) and your circumstances (what is your utility function with respect to the big and small prizes?)

comment by Daniel_Eth · 2022-05-07T22:48:18.925Z · LW(p) · GW(p)

Hmm, does this not depend on how the Oracle is making its decision? I feel like there might be versions of this that look more like the smoking lesion problem – for instance, what if the Oracle is simply using a (highly predictive) proxy to determine whether you'll 1-box or 2-box? (Say, imagine if people from cities 1-box 99% of the time, and people from the country 2-box 99% of the time, and the Oracle is just looking at where you're from).

Replies from: GWS
comment by Stephen Bennett (GWS) · 2022-05-07T23:41:29.647Z · LW(p) · GW(p)

It seems like this might become a discussion of Aleatory vs Epistemic Uncertainty. I like this way of describing the distinction between the two (from here - pdf):

In distinguishing between aleatory variability and epistemic uncertainty it can be helpful to think how you would describe, in words, the parameter under consideration. If the parameter sometimes has one value and sometimes has another values, then it has aleatory variability. That is, the variability is random. If the parameter always has either one value or another, but we are not sure which it is, then the parameter has epistemic uncertainty.

I believe that the differences between classical decision theory and FDT's only occur in the context of aleatory uncertainty (although in some formulations of newcomb's paradox there's no actual uncertainty). That is, if you are in an epistemically uncertain environment, then FDT and classical decision theory will agree on all problems (hopefully by saying this I can cause someone to come up with a juicy counterexample).

In your example, it is unclear to me what sort of uncertainty the problem possesses because I don't know enough about the oracle.

In the simple example where a quantum coin with a 99% chance of coming up heads is flipped to determine whether the oracle gives the right answer or the wrong answer, then the answer I gave above is right. Use expected value under the assumptions of FDT; classical decision theory will lead you to 2-box and that would lower your expected gains.

In your example relying on demographic information, it will depend a bit on what sorts of information count as "demographic" in nature. If you are, in this moment by reading this comment on lesswrong, forming the sort of self that will result in you 1-boxing or 2-boxing and that information is also an input to this sort of oracle, then I encourage you to 1-box on the oracle you had in mind.

answer by JBlack · 2022-05-08T02:52:22.308Z · LW(p) · GW(p)

The exact values of the payoff depend upon the unknown details of how the Oracle achieves its 99% accuracy. Here is an inconvenient scenario that is consistent with the given description:

Suppose that 99% of the population don't even think about the decision, they just follow inherent preferences that are nearly equally prevalent. The Oracle gets almost all of these correct, but fails on 1% of inherent one-boxers giving them nothing instead of $1,000,000. In the remaining cases where people do actually think about it, the Oracle is always wrong, and everyone who thinks about the situation knows this.

Since you are actually thinking about it, then you're one of the people for whom the Oracle's prediction is always wrong. If you end up taking one box, then it will be empty. If you take both boxes, you will find $1,001,000. In this scenario CDT, EDT, and FDT all agree that you should take both boxes.

In many other scenarios, possibly even "most" in some ill-defined sense, FDT says you should take one box.

No comments

Comments sorted by top scores.