Should you refuse this bet in Technicolor Sleeping Beauty?
post by Ape in the coat · 2024-04-04T08:55:09.206Z · LW · GW · 13 commentsThis is a question post.
Contents
Answers 11 faul_sname 2 Dagon None 13 comments
This is the question for people who didn't read my latest post [LW · GW]. Please, try to answer it yourself without spoiling the solution, and then post it in the comments with your reasoning and whether you consider yourself a halfer or a thirder in regular Sleeping Beauty problem.
Technicolor Sleeping Beauty experiment goes mostly as regular Sleeping Beauty experiment:
The participant is put to sleep on Sunday. Then the coin is tossed. If it's Heads the participant will be awakened on Monday. If it's Tails the participant will be awaken both on Monday and on Tuesday, and between these awakenings their memories will be erased. Therefore, while awakening on Tuesday the participants doesn't remember whether they awakened on Monday or not, and they can never be sure on which day (Monday or Tuesday) their current awakening is happening.
The only difference is that in Technicolor, the walls of the room that the participant awakens in changle their color every day from red to blue and vice versa. The initial color is determined randomly: 1/2 that it's red and 1/2 that it's blue.
While awakened during the experiment you are asked whether you would like to take a specific once per experiment bet:
You may bet that the coin is Tails at 2:3 odds. That is: if you bet 300$ and the coin is indeed Tails you win 200$. The bet will be resolved on Wednesday, after the experiment has ended.
You may take this bet only once per experiment and one agreement is enough. If you have two awakenings in this experiment and agreed on any of them - the bet counts as taken. If you agreed on both of them, the bet counts as taken only once.
For reference, in regular Sleeping Beauty problem utility neutral betting odds for once per experiment bet are 1:1, regardless of whether you are a halfer or a thirder, so taking such bet would be a bad idea.
Answers
A probability is an attribute of your model of the world, not an attribute of the world of the world itself. In this case, we're looking for a world model which cares about the outcome of this particular bet. Specifically, on Sunday, before the experiment starts, the participant wants to choose a policy which produces a good outcome on Wednesday.
On Sunday, the participant determines that the probabilities are
- 0.5 they will be woken on Monday and offered the bet (value -$300 if they take it)
- 0.5 they will be woken on Monday and offered the bet, and then woken again on Tuesday and offered the bet. (value +$200 if they take it)
If this were the entire situation, and the participant had to have a single deterministic policy, they should obviously not take the bet (0.5 to accept the bet once and lose $300, 0.5 to accept the bet twice and win $200, so total EV is -$50).
If the participant had access to a source of randomness which would return a random number r
between 0 and 1 on demand, with the results of that randomness independent between times of getting a random number, they could do better, by saying that they will take the bet if r <= p
on that roll, for some p
they determined on Sunday. Now there are 6 cases:
- 0.5*(1-p) they will be woken on Monday and see
r > p
(don't take bet, $0) - 0.5*p they will be woken on Monday and see
r <= p
(take bet, -$300) - 0.25*(1-p)*(1-p) they will be woken on Monday and see
r > p
, and then woken again on Tuesday and seer > p
again (don't take bet either time, $0) - 0.25*(1-p)*p they will be woken on Monday and see
r > p
, and then woken again on Tuesday and seer <= p
(take bet on Tuesday, +$200) - 0.25*p*(1-p) they will be woken on Monday and see
r <= p
, and then woken again on Tuesday and seer > p
(take bet on Monday, +$200) - 0.25*p*p they will be woken on Monday and see
r <= p
, and then woken again on Tuesday and seer <= p
again (take bet on both Monday and Tuesday, only one of which counts, +$200)
The degenerate case of p=1
is the "always take the bet" case above, EV=-$50. p=0
is "never take the bet", EV=$0. But in between those is a curve
It turns out the maximal EV is obtained when p=0.25, which gives an EV of +$6.25.
If we want to do better than that, we need a way for the participant to ensure that their decision on Tuesday is not only uncorrelated with their decision on Monday, but anticorrelated with their decision on Monday. The only possible source of anticorrelation would be the color of the walls. So the participant can now make a decision on Sunday "with what probability do I take the bet if the walls are red" (p_r
) and separately "with what probability do I take the bet if the walls are blue" (p_b
). There are now 12 cases to work through (would be 20, but "two wakes, room red Mon, room red Tue" is not possible, nor is "two wakes, room blue Mon, room blue Tue").
Computing the EV for every possible p_r
and p_b
we see that there are two maxima, one where the participant always bets if in a red room and never bets in a blue room, and the other where the participant always bets if in a blue room and never bets in a red room. Both of these maxima have an EV of +$25.
... what was the philosophical question, again?
I like this question a lot. My base position is "It's a modeling choice - either one can make appropriate betting choices, so it's unclear what true probability even means in correlated-existence cases". Though in private, I'll admit that 1/3 is simpler most of the time, and in public I argue for 1/2 fairly often because die-hard thirders seem to be so common.
The difficulty with betting analyses is how to handle the duplicate days, where Beauty's experience is doubled on tails, but you want that to be irrelevant to the wager (to put focus on the fact that it's a coin flip, so you have to be indifferent on Wednesday). Saying "you must agree or be put to death" is one way, which leads to a static decision that's indifferent to heads or tails, which is justified by either halfer or thirder. Saying "bets are off if they disagree" doesn't work, since only tails can disagree. Saying "Monday takes precedence" kind of obviates the whole excercise.
Your mechanism of "days are ORed together" would normally show the weirdness of the thirder position, because a random choice by Beauty would lead to taking the bet 50% of the time for heads and 75% for tails. But painting the walls gives a coordination mechanism between Monday and Tuesday, without revealing which day it actually is.
I think "bet if the wall is red" works here. If it's heads (50% of the time, to an outside observer, and to the bet payoff), you'll take the bet 50% of the time and lose $200. If it's tails, you'll take the bet 100% of the time and win $300. 0.5*0.5*-200 + 0.5*300 = +100.
I don't think it sheds any light on the halfer vs thirder debate - this encourages an outside view, because the payoff is outside and the coordination is outside (which color to pick in any world on any day). But I guess that means it supports my position that it's a modeling choice rather than a true distinction.
13 comments
Comments sorted by top scores.
comment by Viktor Rehnberg (viktor.rehnberg) · 2024-04-04T15:43:43.464Z · LW(p) · GW(p)
The possible observer instances and their probability are:
- Heads 50 %
- Red room 25 %
- Blue room 25 %
- Tails 50 %
- Red room 50 % (On Monday or Tuesday)
- Blue room 50 % (On Monday or Tuesday)
If I choose a strategy "bet only if blue" (or equivalentely "bet only if red") then expected value for this strategy is so I choose to follow this strategy.
I don't remember what halfer and thirder were or what position I consider to be correct.
Replies from: malentropicgizmo, Ape in the coat↑ comment by Malentropic Gizmo (malentropicgizmo) · 2024-04-04T16:14:44.511Z · LW(p) · GW(p)
What is the probability of tails given it's Monday for your observer instances?
Replies from: viktor.rehnberg↑ comment by Viktor Rehnberg (viktor.rehnberg) · 2024-04-08T07:25:11.753Z · LW(p) · GW(p)
Good formulation. "Given it's Monday" can have two different meanings:
- you learn that you will only be awoken on Monday, then it's 50%
- you awake assign 1/3 probability to each instance and then make the update
So it turns out to 50 % for both but it wasn't initially obvious to me that these two ways would have the same result.
↑ comment by Ape in the coat · 2024-04-04T16:07:07.475Z · LW(p) · GW(p)
Well done!
Halfer and thirder are about answer to the initial question of the Sleeping Beauty problem: What is the probability that the coin landed tails when you awake in the experiment?
Replies from: viktor.rehnberg↑ comment by Viktor Rehnberg (viktor.rehnberg) · 2024-04-08T07:14:53.752Z · LW(p) · GW(p)
I'd say
comment by avturchin · 2024-07-09T12:18:26.352Z · LW(p) · GW(p)
I think this is a toy problem about how to cooperate with own copies, especially in the cases where it is acausal cooperation in which different copies have to act differently. Similar to "Death in Damascus". Random generator is a good thing to diverge, but naturally flipping process is better.
comment by Yair Halberstadt (yair-halberstadt) · 2024-04-04T13:26:23.855Z · LW(p) · GW(p)
I don't see why the colour makes a difference...
So I refuse the bet same as in regular sleeping beauty.
↑ comment by Yair Halberstadt (yair-halberstadt) · 2024-04-04T13:55:06.892Z · LW(p) · GW(p)
Taking a thirder position:
The reason you don't usually accept a bet which you can only accept at most once is that you have a 2/3s credence the coin is tails, but given that the coin is tails there's a 50% chance the bet is fake (since you've already accepted it the first time you woke up). This halves your expected earnings, so the bet is net negative.
But this gives you a way to break the symmetry. You only take the bet if the room is blue, which means your expected takings are now = 1/3 * -$200 + 2/3 * $100 = $0. So you still don't take the bet, but would if slightly better odds are available.
Taking a halfer position:
When you wake up and see the wall is blue, your credence that the coin was tails must remain %50 (by conservation of expected evidence - you knew the wall would be either red or blue before you opened your eyes). So why should you take the bet?
↑ comment by Markvy · 2024-04-06T14:57:02.473Z · LW(p) · GW(p)
I think this is much easier to analyze if you think about your plans before the experiment starts, like on Sunday. In fact, let’s pretend we are going to write down a game plan on Sunday, and we will simply consult that plan wherever we wake up and do what it says. This sidesteps the whole half vs third debate, since both sides agree about how things look better the experiment begins.
Furthermore, let’s say we’re going to participate in this experiment 100 times, just so I don’t have to deal with annoying fractions. Now, consider the following tentative game plan: agree to the bet if and only if the room is red. Let’s see what happens. Out of 100 experiments, 50 will result in just one awakening. In 25 of them you will refuse the bet (costing you zero dollars), and in the other twenty five you will accept, which costs you $7500. So far so good. (Actually pretty bad, since we just lose lots of money.) The other 50 will result in two awakenings. Here, we don’t need to worry about probabilities anymore. It is guaranteed we will see a red room once and a blue room once. Thus, we will agree to the bet once, and thus the bet will be in effect. So we will win $200 fifty times, for a total of $10k. Once we subtract what we lost when the coin landed for one awakening, our net profit is $2500, or an average of $25 dollars per experiment.
comment by Malentropic Gizmo (malentropicgizmo) · 2024-04-04T15:08:12.392Z · LW(p) · GW(p)
You may bet that the coin is Tails at 2:3 odds. That is: if you bet 200$ and the coin is indeed Tails you win 300$. The bet will be resolved on Wednesday, after the experiment has ended.
I think the second sentence should be: "That is: if you bet 300$ and the coin is indeed Tails you win 200$."
Replies from: Dagon, Ape in the coat↑ comment by Dagon · 2024-04-04T16:44:35.651Z · LW(p) · GW(p)
Oh, good point! I'm not sure if the intent was "pays 1:2, meaning you end up with +300 or -200 total" or "pays 2:3, which indeed means a $300 bet ends with +500 or -300).
Replies from: malentropicgizmo↑ comment by Malentropic Gizmo (malentropicgizmo) · 2024-04-04T16:47:57.882Z · LW(p) · GW(p)
I wasn't sure either, but looked at the previous post to check which one is intended.
↑ comment by Ape in the coat · 2024-04-04T15:15:32.222Z · LW(p) · GW(p)
Yes, you are correct, thanks!