Prisoner's Dilemma on game show Golden Balls

post by atorm · 2012-04-21T00:31:48.741Z · LW · GW · Legacy · 32 comments

I found this to be a very interesting method of dealing with a modified Prisoner's Dilemma. In this situation, if both players cooperate they split a cash prize, but if one defects he gets the entire prize. The difference from the normal prisoner's dilemma is that if both defect, neither gets anything, so a player gains nothing by defecting if he knows his opponent will defect; he merely has the option to hurt him out of spite. Watch and see how one player deals with this.
http://www.youtube.com/watch?v=S0qjK3TWZE8

32 comments

Comments sorted by top scores.

comment by cousin_it · 2012-04-25T18:33:08.028Z · LW(p) · GW(p)

This game has multiple Nash equilibria and cheap talk is allowed, so correlated equilibria are possible. Here's how you implement a correlated equilibrium if your opponent is smart enough:

"We have two minutes to talk, right? I'm going to ask you to flip a coin (visibly to both of us) at the last possible moment, the exact second where we must cease talking. If the coin comes up heads, I promise I'll cooperate, you can just go ahead and claim the whole prize. If the coin comes up tails, I promise I'll defect. Please cooperate in this case, because you have nothing to gain by defecting, and anyway the arrangement is fair, isn't it?"

Replies from: Alsadius
comment by Alsadius · 2012-04-26T02:05:41.033Z · LW(p) · GW(p)

This neglects diminishing marginal utility - few people would actually prefer a 50% chance at everything to a 100% chance at half of it. It does solve the coordination problem, though. Interesting approach.

comment by buybuydandavis · 2012-04-21T03:42:35.526Z · LW(p) · GW(p)

he merely has the option to hurt him out of spite.

Merely? Never underestimate the power of Spite.

comment by Sniffnoy · 2012-04-22T04:32:04.829Z · LW(p) · GW(p)

The changed payoff matrix makes this unlike the Prisoner's Dilemma even without the addition of communication; more like a restricted bargaining game. One noteworthy difference from the Prisoner's Dilemma is that this game lacks a pure Nash equilibrium.

Edit: Apparently not quite; see below.

Replies from: Pfft
comment by Pfft · 2012-04-23T01:06:19.059Z · LW(p) · GW(p)

The usual definition of Nash equilibrium requires only ≤, not <, so Defect-Defect, Cooperate-Defect and Defect-Cooperate are Nash equilibria (and pure) but not "strong" Nash equilibria. You want this definition, because games need not have strong Nash equilibria, even if you allow mixed strategies.

(Apparently the game is called "weak prisoner's dilemma" in the literature).

Replies from: Sniffnoy
comment by Sniffnoy · 2012-04-23T02:01:44.248Z · LW(p) · GW(p)

Oops, didn't realize that.

comment by Clarity1992 · 2012-04-21T10:23:54.143Z · LW(p) · GW(p)

I love the bit at the end where Ibrahim (market trader) says it's "the hardest money I've ever had to work for" and Nick (charity worker) jokes "he obviously hasn't worked in the charity sector to try and get money", then the look on Nick's face when Ibrahim says he's going to respray his yacht!

I felt that Nick displayed a good mix of hot and cold rationality.

comment by b1shop · 2012-04-21T14:20:17.632Z · LW(p) · GW(p)

The next contestant needs to say:

"I'm going to choose steal. If you choose split, I'll give you 25 percent after the show. I promise."

Replies from: ArisKatsaris
comment by ArisKatsaris · 2012-04-21T16:33:33.454Z · LW(p) · GW(p)

I don't think that's gonna work even a tenth as well as a promise of 50 percent. Promises work on the basis of the kind of honor that's also correlated to concepts of justice and fairness.

Once someone has already effectively proclaimed themselves to be unfair and dishonorable, their promises would seem worthless. At that point the other contestant would be even more likely than before to choose "steal" for the purposes of retribution.

comment by RomeoStevens · 2012-04-21T02:47:32.707Z · LW(p) · GW(p)

WTF why did he (the precommiter) choose split after all?

The WHOLE point is to change it from a difficult decision with close outcomes to one where split is the only choice with positive utility (the chance that other person will keep their word).

Replies from: ArisKatsaris
comment by ArisKatsaris · 2012-04-21T03:07:41.169Z · LW(p) · GW(p)

WTF why did he (the precommiter) choose split after all?

If the other guy (Abraham) had chosen "steal", the supposed precommitter (Nick) would not have gotten anything either way, under any scenario.
If Abraham had chosen "split", it was a time-saver to have the gameshow hosts divide the money between them, than to gift half of it afterwards to him.

So Nick's whole ploy was to effectively scare Abraham into saying "split" -- by making him think that contrary to normal expectations it was saying "steal" that would ensure he would get nothing. Once Nick had convinced him of that, there was no longer any need to say "steal": Nick had either managed to convince him or he hadn't.

Nick sacrifices credibility for future claimed precommitments of course.

Replies from: AlexMennen
comment by AlexMennen · 2012-04-21T06:49:53.222Z · LW(p) · GW(p)

Nick sacrifices credibility for future claimed precommitments of course.

He sacrifices credibility in future threats against people, but maintains credibility in future promises to act in others' benefit just as much as if he had decided to steal and then give Abraham half the money. This latter credibility is probably much more useful in most real situations.

Replies from: RomeoStevens
comment by RomeoStevens · 2012-04-26T03:35:27.877Z · LW(p) · GW(p)

no no no.

The moment I found out I was going to be on this show I would obtain two notarized contracts.

When the time comes to deliberate I whip out the first contract. It states that if I choose "split" I must donate $10k to the KKK + any prize money I get (the $10k at least is held in trust).

Then I ask my opponent to split, I whip out a second contract stating that any and all prize money I receive is going 100% to medical research. His choice now has nothing to do with money for me or him, only if the television studio keeps the money or if it goes to medical research.

I hope this is novel enough to land me a talk show appearance where I pimp my ebook on using cognitive science and game theory to improve your life.

Replies from: fmgn
comment by fmgn · 2017-07-19T01:43:00.329Z · LW(p) · GW(p)

And then your opponent still steals, gets all the money, and nothing goes to medical research. woopty doo.

Replies from: arundelo
comment by arundelo · 2017-07-19T04:35:57.954Z · LW(p) · GW(p)

No, Romeo chooses steal. If his opponent also chooses steal (in spite of Romeo's credible commitment to choosing steal himself), the opponent does not get any money.

comment by Tuxedage · 2012-04-21T17:10:45.222Z · LW(p) · GW(p)

To be a little technical, this is not actually prisoners dilemma, because they are allowed to communicate. The whole point of prisoners dilemma is that they cannot communicate, and thus, must choose to cooperate or defect based upon their knowledge of nash's equilibrium alone.

Although this is honestly quite an interesting solution to this kind of problem. I'll be using it the next time I'm offered a situation similar to this.

Replies from: AShepard
comment by AShepard · 2012-04-22T15:48:38.547Z · LW(p) · GW(p)

To be even more technical, "Prisoner's Dilemma" is actually used as a generic term in game theory. It refers to the set of two-player games with this kind of payoff matrix (see here). The classic prisoners dilemma also adds in the inability to communicate (as well as a bunch of backstory which isn't relevant to the math), but not all prisoners dilemmas need to follow that pattern.

Replies from: DavidAgain
comment by DavidAgain · 2012-04-22T20:48:10.397Z · LW(p) · GW(p)

In game theory terms, I'm not sure communication would do much for a one-off prisoner's dilemma.

comment by Raemon · 2012-04-21T04:08:45.467Z · LW(p) · GW(p)

Anyone watch this show regularly, and know how Split/Steal normally plays out?

Replies from: MileyCyrus
comment by MileyCyrus · 2012-04-21T06:20:46.557Z · LW(p) · GW(p)

Someone actually wrote a paper on it.

comment by trlkly · 2012-04-25T05:36:23.744Z · LW(p) · GW(p)

I can say, without hindsight interfering, that this strategy would not have worked on me. Because I can explain exactly what I was thinking as it happened.

You see, when I see someone alter the rules of a game, my instinct is that they are trying to do so for their own gain, and thus are not altruistic. Thus I immediately assumed the promise was a lie (which was right), and that he would not be splitting the money with me (which was wrong).

The question then becomes rather simple. My choices are to choose SPLIT, receive $0, and reward the treachery, or choose STEAL, still receive $0, and punish the treachery. Obviously, the latter is more valuable to me.

Now, in the short time required, I did not have time to check if my assumptions were correct. But let's say they aren't. The most likely way for me to be wrong would not be that he wasn't lying, but that he was going to choose SPLIT. Well, that's still a winning outcome for me. And if I feel guilty for winning all the money, I can always split after the fact with him. So that's not a problem either. The only option that is a possible problem is if he's telling the entire truth. But I see this as highly unlikely, as what does he have to gain from splitting after the fact rather than just using the balls?

I honestly was surprised that this worked. I actually thought the other guy was foolish for choosing SPLIT until the reveal. As I do not know the other possible solutions I cannot say the first guy's solution was rational, but I am fairly confident in saying the second guy's decision was not.

Replies from: RomeoStevens
comment by RomeoStevens · 2012-04-26T03:36:57.445Z · LW(p) · GW(p)

split is not 0. It is some probability he will give you money out of gratitude + the probability he is lying and will actually choose split.

comment by wedrifid · 2012-04-21T04:07:23.594Z · LW(p) · GW(p)

Watch and see how one player deals with this.

Wow. That guy is a loony! I'd have walked away with a lot of money in that game (in the other guy's shoes).

Replies from: ShardPhoenix, Will_Newsome
comment by ShardPhoenix · 2012-04-21T09:05:05.817Z · LW(p) · GW(p)

Why would you choose steal here, if you actually believed he had precommitted to it? There's a decent enough chance he would honor his word. I doubt you would gain much, if anything, from appearing spiteful on TV.

Replies from: wedrifid
comment by wedrifid · 2012-04-21T17:51:48.873Z · LW(p) · GW(p)

Why would you choose steal here, if you actually believed he had precommitted to it? There's a decent enough chance he would honor his word. I doubt you would gain much, if anything, from appearing spiteful on TV.

I'd gain all of the money for a start. You are not going to have much luck convincing me that 'steal' is irrational here given that it has better overall strategic properties (than this guys ploy) and happens to be the winning move in this case.

I doubt you would gain much, if anything, from appearing spiteful on TV.

Choosing 'steal' vs 'steal' doesn't seem particularly spiteful to me.

Replies from: Sketch, ShardPhoenix
comment by Sketch · 2012-04-23T01:24:37.217Z · LW(p) · GW(p)

Welcome to the land of Hindsight Bias. Enjoy your stay.

http://en.wikipedia.org/wiki/Hindsight

Replies from: wedrifid
comment by wedrifid · 2012-04-23T01:55:57.888Z · LW(p) · GW(p)

Welcome to the land of Hindsight Bias. Enjoy your stay.

What on earth are you trying to say? There is no instance of hindsight bias here.

Replies from: ArisKatsaris
comment by ArisKatsaris · 2012-04-23T02:16:16.514Z · LW(p) · GW(p)

I think he's saying that your statement "I'd gain all of the money for a start" only works if you knew that Nick was going to "split".

You know that in hindsight, but you wouldn't have known it at the time.

Replies from: wedrifid
comment by wedrifid · 2012-04-23T04:17:03.682Z · LW(p) · GW(p)

I think he's saying that your statement "I'd gain all of the money for a start" only works if you knew that Nick was going to "split".

Prior to the actual decisions it of course merely one possibility of several - but is enough for there to be a clear reason apart from 'spite' to make the move. The significance of the other-shares outcome actually occurring is in as much as it denies any "stealing is spiteful" advocates the inevitable fallback of claiming that the 'steal could win' options is unrealistic and that the word of the ultimatum guy should be accepted at face value. ie. It is a rejection of the premise of the preceding comment.

comment by ShardPhoenix · 2012-04-22T01:29:55.126Z · LW(p) · GW(p)

You seem to be working on the assumption that you knew or strongly suspected that he would actually choose "split" despite apparently pre-committing to "steal". If you believed him to any significant extent it is clearly better to choose split and hope he keeps his word (unless for some reason you think this makes you look bad enough that you'd rather willingly give up an expected large fraction of 6000 pounds instead).

comment by Will_Newsome · 2012-04-21T06:14:00.299Z · LW(p) · GW(p)

Presumably Nick wouldn't have tried to split with you, because he'd learned about your personality earlier in the game or was able to gauge your responses as he discussed his certain intent to steal. Nick's ploy was especially good because it put Ibrahim in an unexpected situation where it's a lot harder to appear genuine while simultaneously thinking through optimal strategy, thus giving Nick unusually reliable information about whether he could expect Ibrahim to actually try to split or not.

ETA: Retracted because I'm not sure this is relevant to User:wedrifid's point.

Replies from: wedrifid
comment by wedrifid · 2012-04-21T06:21:37.822Z · LW(p) · GW(p)

Presumably Nick wouldn't have tried to split with you, because he'd learned about your personality earlier in the game or was able to gauge your responses as he discussed his certain intent to steal. Nick's ploy was especially good because it put Ibrahim in an unexpected situation where it's a lot hard to appear genuine while simultaneously thinking through optimal strategy, thus giving Nick unusually reliable information about whether he could expect Ibrahim to actually try to split or not.

I'm not sufficiently impressed with Nick's decision making that'd I'd be willing to assume he would be able to distinguish my response from Ibrahim's to any remarkable degree. It's possible that he could, in which case he gets points for incidentally being good at reading people but loses points because he still gets no money.

The best that can be said for the gambit is that it made good television.