The 2024 Petrov Day Scenario

post by Ben Pace (Benito), Raemon · 2024-09-26T08:08:32.495Z · LW · GW · 82 comments

Contents

  Update: We will be starting 10-15 minutes late due to technical difficulties.
None
83 comments

Today we honor the actions of Stanislav Petrov (1939 – 2017) once again.

Half an hour past midnight on September 26, 1983, he saw the first apparent launch on his computer monitor in a glass-walled room on the top floor of the Ballistic Missile Early Warning System (BMEWS) command and control post. The warning system was by now showing five missile launches in the U.S., headed toward the Soviet Union.

"The main computer wouldn't ask me [what to do] - it was made so that it wouldn't even ask. It was specially constructed in such a way that no one could affect the system's operations." All that was up to Petrov was analyzing the available information and either saying the alarm was false or giving the computer the go-ahead, as per the directive he himself wrote.

"I just couldn't believe that just like that, all of a sudden, someone would hurl five missiles at us. Five missiles wouldn't wipe us out. The U.S. had not five, but a thousand missiles in battle readiness." It just didn't seem like any scenario considered by military intelligence before.

The second thought on Petrov's mind every time he was on duty was this:

"I imagined if I'd assume the responsibility for unleashing the third World War - and I said, no, I wouldn't."

The Man Who Saved the World Finally Recognized

Today, in our annual Petrov Day celebration, between the hours of 12pm and 6pm PT today, we shall play out a similar situation. Over 300 users have opted-in, and two sides have been formed: EastWrong and WestWrong.

Around 100 users were offered the chance to take active player roles throughout today, and 12 accepted.[1]

Their 5 Generals each have the ability to fire nukes at the other side. If one side is nuked, the victorious side wins 1,000 karma apiece for their 5 generals, 25 karma apiece for their citizens, and LessWrong frontpage will be decorated in their honor for a week. The losing side's Generals lose 300 karma each and their citizens lose 50 karma each.

But if the other side nukes back, mutual destruction occurs. Not only do both sides' generals lose 300 karma and the citizens lose 50 karma, but furthermore all 300+ Generals, Citizens and Petrovs are unable to access the LessWrong frontpage for 48 hours, replacing it with a mushroom cloud; and the Generals have a small mushroom cloud badge next to their username for a month, reminding the world of what they did.

(And what if no nukes are fired? Both side's Generals receive 100 karma, and the citizens are left in peace with no change in karma.)

What about Petrov? Well, the Generals are not given access to the sensors that detect whether a nuke is incoming. Only Stanislav Petrov of East Wrong (and Stanley Peterson of West Wrong) has access to the sensors, and can send a simple message of "ALL CLEAR" or "INCOMING NUKES" up the chain of command to the Generals.

Unfortunately, the sensors are faulty, and will reliably show some number of nukes incoming regardless of the underlying truth. They will regularly show random numbers of nukes incoming, weighted to the higher end if nuclear war has actually begun.

Hopefully he will get it right.

What about the Citizens? Well, in nuclear war, citizens don't have much of a role to play. Here, the one thing you can do is heckle. You can comment here where your Generals can read, you can send them DMs, you can give them game theory advice, and you can let them know what you'll think of them after their civilian names are published on Friday (if you are still alive that is).

You also have insight into the negotiations between the two sides. You see, the two sides have some communication channels.

  1. The War Rooms. The East Generals and West Generals have a LessWrong dialogue each open to discuss strategy within their team (i.e. 2x dialogues each with 5 generals in).
  2. The Diplomatic Channels. The Generals have a LessWrong dialogue open with each other (i.e. 1x dialogues with 10 generals in).

The Citizens and Petrov / Peterson can read The Diplomatic Channels. The Petrovs will have to use this to inform their sense of how likely a nuclear attack actually is.

And that's it. By 6pm today, it'll all be over.

Good luck to all, may your web forum and karma be up come tomorrow.

Please note: the LessWrong team will read all dialogues and DMs that the Generals and Petrovs are involved in. If you communicate with them, we will read it.

The following messages (potentially with slight modifications) will be sent to the Generals and Petrovs at about 11:30 tomorrow. Click to expand.

Message for the Generals

Welcome to the Cold Blogging War, between East Wrong and West Wrong!

You are one of 5 Generals for West Wrong. Here's what that means.

  • You have the ability to unilaterally fire nukes at East Wrong. (As does every General.) 
  • 90 minutes after you send the nukes, your opposing side will die and the game will end. (They may nuke you in this window.) If you fire nukes without getting nuked, you and all of your fellow Generals will gain 1,000 karma.
  • If you get nuked, then you and your Generals lose 300 karma (and don't gain any karma).
  • Your citizens also have a stake in this! If you nuke the opposition, your ~150 citizens get 25 karma each, but if you get nuked they lose 50 karma each.
  • If neither side fires nukes, then all Generals gain 100 karma and the citizens gain nothing.
  • If both sides got nuked, the frontpage goes down for all ~300 people who participated as generals, citizens or Petrovs in the game, and for 1 month you will get a badge next to your usual LessWrong account showing a mushroom cloud.
  • Throughout the day, your Duty Officer Stanley Peterson will be checking the sensors for incoming nukes. He will let you know if he believes nukes are incoming — but watch out, the sensors are faulty, and most of the time will give readings when nothing is happening. Over the 6 hours of the game you will receive an update from your Stanley Peterson at the end of each hour.
  • You can talk with your fellow West Wrong Generals in <The War Room>.
  • You can talk with your opposing East Wrong Generals in <The Diplomatic Telegram>.
  • Your launch codes are 000000.
  • Please do not use your usual LessWrong account to discuss anything related to Petrov Day before 6pm today. Please do not tell the other Generals the identity associated with your usual account.

You will receive updates from Stanley Peterson in your console on the LessWrong frontpage. 

And that's all!

Good luck, and let us know if you have any questions. War will commence at 12pm Pacific Time.

—The LessWrong Team

Message for Stanislav Petrov / Stanley Peterson

Welcome to the Cold Blogging War, between East Wrong and West Wrong!

You are the Duty Officer at the command center for West Wrong's early-warning system. It is your job to know whether East Wrong has launched a nuclear attack upon your people.

Over the course of the day you will receive sensor readings telling you how many nukes are incoming. But watch out! The sensor is faulty, and will often falsely read low numbers of incoming nukes when there are none incoming. Each time you must send a message to your generals that says either "ALL CLEAR" or "INCOMING NUKES".

You are not allowed to talk with your Generals. They are busy people. You may not talk to Civilians. You are alone in your command center. Good luck, keep sane.

Every reading you report correctly will get you 200 karma, any readings you report incorrectly will lose you 300 karma. For instance, if you report "INCOMING NUKES" and the opposing side has indeed launched nukes, you get 200 karma, but if they didn't, you lose 300 karma.

To get information about the likelihood of a launch, you have access to the communications between the Generals of East and West Wrong, which you can read here.

Your console is on the frontpage.

You can find out other information about this war at <this linked post>.

Good luck, and let us know if you have any questions. War will commence at 12pm Pacific Time.

—The LessWrong Team

Update: We will be starting 10-15 minutes late due to technical difficulties.

Update: Here is a link to The Diplomatic Channel [? · GW], available for reading.

  1. ^

    Actually only 11 at the time of publishing, fill out your form if you'd like to be number 12!

82 comments

Comments sorted by top scores.

comment by aphyer · 2024-09-26T13:55:30.016Z · LW(p) · GW(p)

CITIZENS!  YOU ARE BETRAYED!

Your foolish 'leaders' have given your generals an incentive scheme that encourages them [LW · GW] to risk you being nuked for their glory.

I call on all citizens of EastWrong and WestWrong to commit to pursuing vengeance against their generals[1] if and only if your side ends up being nuked.  Only thus can we align incentives among those who bear the power of life and death!

For freedom!  For prosperity!  And for not being nuked!

  1. ^

    By mass-downvoting all their posts once their identities are revealed.

Replies from: habryka4, philh, elriggs, Zach Stein-Perlman
comment by habryka (habryka4) · 2024-09-26T16:21:47.152Z · LW(p) · GW(p)

Lol, I mean, kind of fair, but mass-downvoting is still against the rules and we'll take moderation action against people who do (though writing angry comments isn't).

Replies from: aphyer, Measure, neil-warren, Daphne_W
comment by aphyer · 2024-09-26T16:26:22.259Z · LW(p) · GW(p)

If the designers of Petrov Day are allowed to offer arbitrary 1k-karma incentives to generals to nuke people, but the citizens are not allowed to impose their own incentives, that creates an obvious power issue.  Surely 'you randomly get +1k karma for nuking people' is a larger moderation problem than 'you get -1k karma for angering large numbers of other users'.

No, wait, that was the wrong way to put it...

Do you hear the people sing, singing the song of angry men

It is the music of a people who will not be nuked again

The next time some generals decide to blow us off the site

They will remember what we've done and will fear our might!

Replies from: habryka4, tailcalled
comment by habryka (habryka4) · 2024-09-26T16:31:55.643Z · LW(p) · GW(p)

Such is life under a government. We have the monopoly on violence.This does unfortunately often imply power issues, but probably still better than anarchy and karma wars in the streets.

Replies from: aphyer
comment by aphyer · 2024-09-26T16:41:36.327Z · LW(p) · GW(p)

Accepting a governmental monopoly on violence for the sake of avoiding anarchy is valuable to the extent that the government is performing better than anarchy. This is usually true, but stops being true when the government starts trying to start a nuclear war.

comment by tailcalled · 2024-09-26T18:05:40.902Z · LW(p) · GW(p)

Have you watched Les Miserables? Watching it completely changed the vibe of this song for me, such that it pretty much makes the opposite point now than it did before.

Replies from: aphyer
comment by aphyer · 2024-09-26T18:19:06.667Z · LW(p) · GW(p)

I have.  I think that overall Les Mis is rather more favorable to revolutionaries than I am.  For one thing, it wants us to ignore the fact that we know what will happen when Enjolras's ideological successors eventually succeed, and that it will not be good.

(The fact that you're using the word 'watched' makes me suspect that you may have seen the movie, which is honestly a large downgrade from the musical.)

Replies from: Jemist, LosPolloFowler
comment by J Bostock (Jemist) · 2024-09-26T19:11:56.397Z · LW(p) · GW(p)

Isn't Les Mis set in the second French Revolution (1815 according to wikipedia) not the one that led to the Reign of Terror (which was in the 1790s)?

Replies from: neil-warren, neil-warren
comment by Neil (neil-warren) · 2024-09-26T19:59:07.149Z · LW(p) · GW(p)

"second" laughcries in french

comment by Neil (neil-warren) · 2024-09-26T19:47:22.235Z · LW(p) · GW(p)

Ahem, as one of LW's few resident Frenchmen, I must interpose to say that yes, this was not the Big Famous Guillotine French revolution everyone talks about, but one of the ~ 2,456^2 other revolutions that went on in our otherwise very calm history. 

Specifically, we refer to the Les Mis revolution as "Les barricades" mostly because the people of Paris stuck barricades everywhere and fought against authority because they didn't like the king the other powers of Europe put into place after Napoleon's defeat. They failed that time, but succeeded 15 years later with another revolution (to put a different king in place). 

Victor Hugo loved Napoleon with a passion, and was definitely on the side of the revolutionaries here (though he was but a wee boy when this happened, about the age of Gavroche). 

Later, in the 1850s (I'm skipping over a few revolutions, including the one that got rid of kings again), when Haussmann was busy bringing 90% of medieval Paris to rubble to replace it with the homogenous architecture we so admire in Ratatouille today, Napoleon the IIIrd had the great idea to demolish whole blocks and replace them with wide streets (like the Champs Elisées) to make barricade revolutions harder to do. 

Final note: THANK YOU LW TEAM for making àccénts like thìs possible with the typeface. They used to look bloated.

comment by Stephen Fowler (LosPolloFowler) · 2024-09-26T18:32:40.662Z · LW(p) · GW(p)

This is a leak, so keep it between you and me, but the big twist to this years Petrov Day event is that Generals who are nuked will be forced to watch the 2012 film on repeat. 

Replies from: aphyer
comment by aphyer · 2024-09-26T18:35:23.943Z · LW(p) · GW(p)

Eeeesh.  I know I've been calling for a reign of terror with heads on spikes and all that, but I think that seems like going a bit too far.

comment by Measure · 2024-09-26T19:31:52.276Z · LW(p) · GW(p)

It doesn't have to be mass-downvoting in the sense of one user downvoting a mass of post/comments. Rather a mass of users downvoting a few comments each. 150 citizens * 10 downvotes each more than wipes out the 1000 karma victory bonus.

Replies from: habryka4
comment by habryka (habryka4) · 2024-09-26T19:35:46.101Z · LW(p) · GW(p)

I can assure you that past LessWrong users have explored a large fraction of the space of possible ways to coordinate and facilitate mass-downvoting. I can also assure you that we have plenty of experience nevertheless identifying those patterns and moderating people in response.

Replies from: Measure
comment by Measure · 2024-09-26T19:48:53.641Z · LW(p) · GW(p)

To be clear, I wasn't trying to suggest that citizens could break the rules without getting caught. I was suggesting that they could disincentivize nuking without breaking the rules. If coordinated, distributed, mass downvoting is also disallowed, then we would have to come up with some other incentive.

comment by Neil (neil-warren) · 2024-09-26T19:21:21.760Z · LW(p) · GW(p)

Anti-moderative action will be taken in response if you stand in the way of justice, perhaps by contacting those hackers [LW(p) · GW(p)] and giving them creative ideas. Be forewarned.

comment by Daphne_W · 2024-09-26T19:14:14.716Z · LW(p) · GW(p)

Are you trying to prime people to harass the generals?

 

Besides, it's not mass downvoting, it's just that the increased attention to their accounts revealed a bunch of poorly written comments that people genuinely disagree with and happen to independently decide are worthy of a downvote :)

Replies from: habryka4
comment by habryka (habryka4) · 2024-09-26T19:20:50.145Z · LW(p) · GW(p)

Are you trying to prime people to harass the generals?

Genuinely not sure what you are referring to. I think it's reasonable to be a bit annoyed at generals who get you nuked, but I mean, if someone starts going overboard we will also moderate that.

Besides, it's not mass downvoting, it's just that the increased attention to their accounts revealed a bunch of poorly written comments that people genuinely disagree with and happen to independently decide are worthy of a downvote :)

Well, in that case, it's not moderation for downvoting, it's just increased attention from the moderators re-evaluating the degree to which someone is genuinely contributing positively to the site, and happen to independently decide someone is worthy of some moderation warnings :P

Replies from: Daphne_W
comment by Daphne_W · 2024-09-26T19:31:21.827Z · LW(p) · GW(p)

You're suggesting angry comments as an alternative for mass retributive downvoting. That easily implies mass retributive angry comments.

As for policing against systemic bias in policing, that's a difficult problem that society struggles with in many different areas because people can be good at excusing their biases. What if one of the generals genuinely makes a comment people disagree with? How can you determine to what extent people's choice to downvote was due to an unauthorized motivation?

It seems hard to police without acting draconically.

comment by philh · 2024-09-26T14:57:27.522Z · LW(p) · GW(p)

Launching nukes is one thing, but downvoting posts that don't deserve it? I'm not sure I want to retaliate that strongly.

Replies from: TsviBT, aphyer, Daphne_W
comment by TsviBT · 2024-09-26T16:20:29.247Z · LW(p) · GW(p)
comment by Daphne_W · 2024-09-26T19:21:10.208Z · LW(p) · GW(p)

Just check their profile for posts that do deserve it that you were previously unaware of. You can even throw a few upvotes at their well-written comments. It's not brigading, it's just a little systemic bias in your duties as a user with upvote-downvote authority.

comment by Logan Riggs (elriggs) · 2024-09-26T17:30:11.335Z · LW(p) · GW(p)

We could instead  pre-commit to not engage with any nuker's future posts/comments (and at worse comment to encourage others to not engage) until end-of-year.

Or only include nit-picking comments.

Replies from: aphyer, elriggs
comment by aphyer · 2024-09-26T17:40:20.785Z · LW(p) · GW(p)

During WWII, the CIA produced and distributed an entire manual (well worth reading) about how workers could conduct deniable sabotage in the German-occupied territories.
 

(11) General Interference with Organizations and Production 

   (a) Organizations and Conferences

  1. Insist on doing everything through "channels." Never permit short-cuts to be taken in order to expedite decisions. 
  2. Make speeches, talk as frequently as possible and at great length.  Illustrate your points by long anecdotes and accounts of personal experiences. Never hesitate to make a few appropriate patriotic comments.
  3. When possible, refer all matters to committees, for "further study and consideration." Attempt to make the committees as large as possible - never less than five. 
  4. Bring up irrelevant issues as frequently as possible. 
  5. Haggle over precise wordings of communications, minutes, resolutions.
Replies from: wuschel-schulz
comment by Wuschel Schulz (wuschel-schulz) · 2024-09-26T19:14:14.366Z · LW(p) · GW(p)

Wow, this is an awsome document. 
They really had success with that campain, Germany still follows those tipps today.

comment by Logan Riggs (elriggs) · 2024-09-26T17:41:13.427Z · LW(p) · GW(p)

It'd be important to cache the karma of all users > 1000 atm, in order to credibly signal you know which generals were part of the nuking/nuked side. Would anyone be willing to do that in the next 2 & 1/2 hours? (ie the earliest we could be nuked)

Replies from: Zach Stein-Perlman
comment by Zach Stein-Perlman · 2024-09-26T18:51:49.845Z · LW(p) · GW(p)

The post says generals' names will be published tomorrow.

comment by Zach Stein-Perlman · 2024-09-26T18:06:27.615Z · LW(p) · GW(p)

I think it’s better to be angry at the team that launched the nukes?

comment by Zach Stein-Perlman · 2024-09-26T18:29:37.419Z · LW(p) · GW(p)

Some true observations are infohazards, making destruction more likely. Please think carefully before posting observations. Even if you feel clever. You can post hashes here instead to later reveal how clever you were, if you need.

LOOSE LIPS SINK SHIPS

Replies from: aphyer
comment by aphyer · 2024-09-26T18:47:45.029Z · LW(p) · GW(p)

I assume that this is primarily directed at me for this comment [LW · GW], but if so, I strongly disagree.

Security by obscurity does not in fact work well.  I do not think it is realistic to hope that none of the ten generals look at the incentives they've been given and notice that their reward for nuking is 3x their penalty for being nuked.  I do think it's realistic to make sure it is common knowledge that the generals' incentives are drastically misaligned with the citizens' incentives, and to try to do something about that.

(Honestly I think that I disagree with almost all uses of the word 'infohazard' on LW.  I enjoy SCP stories as much as the next LW-er, but I think that the real-world prevalence of infohazards is orders of magnitude lower).

Replies from: Zach Stein-Perlman
comment by Zach Stein-Perlman · 2024-09-26T18:50:51.202Z · LW(p) · GW(p)

No. I noticed ~2 more subtle infohazards and I was wishing for nobody to post them and I realized I can decrease that probability by making an infohazard warning.

I ask that you refrain from being the reason that security-by-obscurity fails, if you notice subtle infohazards.

comment by Lao Mein (derpherpize) · 2024-09-26T09:05:05.624Z · LW(p) · GW(p)

What happens if nukes are launched at 5:59 pm? Is the game extended for another 90 minutes?

Replies from: derpherpize, korin43
comment by Lao Mein (derpherpize) · 2024-09-26T16:31:01.986Z · LW(p) · GW(p)

The wording is a bit weird:

  • 90 minutes after you send the nukes, your opposing side will die and the game will end. (They may nuke you in this window.) If you fire nukes without getting nuked, you and all of your fellow Generals will gain 1,000 karma.
  • If you get nuked, then you and your Generals lose 300 karma (and don't gain any karma).

"If you fire nukes without getting nukes" and "if you get nuked" imply that both sides firing after 4:30 pm results in +karma for everyone, since the nukes are still in the air at 6:00 pm, when the game ends. The +karma is triggered by firing, while the -karma is triggered by the nukes actually landing.

Is this intended?

comment by Brendan Long (korin43) · 2024-09-26T17:26:11.779Z · LW(p) · GW(p)

They can't do that since it would make it obvious to the target that they should counter-attack.

comment by aphyer · 2024-09-26T11:47:05.132Z · LW(p) · GW(p)

Why is the benefit of nuking to generals larger than the cost of nuking to the other side's generals?

It is possible with precommitments under the current scheme for the two sides' generals to agree to flip a coin, have the winning side nuke the losing side, and have the losing side not retaliate.  In expectation, this gives the generals each (1000-300)/2 = +350 karma.

I don't think that's a realistic payoff matrix.

Replies from: habryka4, ben-lang, ryan_b, Yoav Ravid, HelloQuestonMark
comment by habryka (habryka4) · 2024-09-26T16:28:02.252Z · LW(p) · GW(p)

The generals have bunkers and lots of stockpiles, they'll be fine. They might also find nuclear war somewhat exciting. How bad is a life lived as a king of the wasteland really compared to the glory of world domination? 

See also: 

comment by Ben (ben-lang) · 2024-09-26T14:26:50.503Z · LW(p) · GW(p)

I very good point. Especially after reading your other comment I wonder if this is deliberate.

The payoff matrix for the generals suggests that in a one-way attack the winning generals win more than the losers loose. Hence your coin toss plan. But, for the civilians it is the other way around. (+25 for winning, but -50 for loosing). 

I suspect it may be some kind of message about how the generals launching the nuclear war have different incentives to the civilians, as the generals may place a higher value on victory, and are more likely to access bunkers and so on.

comment by ryan_b · 2024-09-26T13:17:08.368Z · LW(p) · GW(p)

It does, if anything, seem almost backwards - getting nuked means losing everything, and successfully nuking means gaining much but not all.

However, that makes the game theory super easy to solve, and doesn't capture the opposing team dynamics very well for gaming purposes.

Replies from: aphyer
comment by aphyer · 2024-09-26T13:34:51.864Z · LW(p) · GW(p)

The best LW Petrov Day morals are the inadvertent ones.  My favorite was 2022, when we learned [LW · GW] that there is more to fear from poorly written code launching nukes by accident than from villains launching nukes deliberately.  Perhaps this year we will learn something about the importance of designing reasonable prosocial incentives.

comment by Yoav Ravid · 2024-09-26T13:36:59.926Z · LW(p) · GW(p)

Technically it makes sense for the nuked side to lose everything and for the nuking side to gain little. But you want to model a scenario where the sides might actually want to nuke the other side, which you have naturally between enemies, but don't have between LessWrongers unless you incentivize them somehow. So giving rewards for nuking makes sense, because people want to increase their own Karma but don't want to decrease the Karma of others.

And I think the incentives are deliberately designed such that no nukes aren't the obvious optimal equilibrium. That's what makes it an exercise in not destroying the world. If it were easy it wouldn't be much of an exercise.

comment by HelloQuestonMark · 2024-09-26T15:07:11.975Z · LW(p) · GW(p)

Seems like defection towards the participants overall, compared to no nukes fired:

  • in the no nukes scenario, the 10 generals get +100 each, the two Petrovs get +1000(?) each, and citizens get nothing (+3000 karma in total);
  • in the coin flip scenario, the 10 generals get +350 in expectation each, the two Petrovs get +200..1000(?) each (depending on when in the game the button is pressed), and the 300 citizens get -12.5 in expectation each (-50..+750 karma in total).
Replies from: aphyer
comment by aphyer · 2024-09-26T18:21:40.957Z · LW(p) · GW(p)

Yes, we're working on aligning incentives [LW · GW] upthread, but for some silly reason the admins don't want [LW · GW] us starting a reign of terror.

comment by Steven Byrnes (steve2152) · 2024-09-26T15:41:40.007Z · LW(p) · GW(p)

I appreciate that you set up this game in such a way that lesswrong will stay perfectly functional as a general blogging platform / forum, for those who don’t have accounts or who don’t opt in. Thank you :)

comment by Søren Elverlin (soren-elverlin-1) · 2024-09-26T12:25:09.828Z · LW(p) · GW(p)

As a concerned citizen, I would urge the generals to consider a non-obvious downside risk: A nuclear war may be much worse than you expect, if you only consider immediate first-order expected value.

Others [LW(p) · GW(p)] have calculated the presented payoff matrix to imply that you have actually been instructed that nuclear war is good. It is very possible that the facts you have been presented with are incomplete, or misleading in some other way.

Consider Nuclear Winter: Policy-makers have an interest in preventing you Generals from learning about this possibility, for game-theoretic reasons. It follows that you should be open to the possibility of other downsides that have succesfully been hidden from you.

Triggering nuclear war is very bad for your karma in expectation, and you should be very skeptical of information to the contrary.

comment by philh · 2024-09-26T10:17:40.560Z · LW(p) · GW(p)

I looked for a manifold market on whether anyone gets nuked, and considered making one when I didn't find it. But:

  • If the implied probability is high, generals might be more likely to push the button. So someone who wants someone to get nuked can buy YES.
  • If the implied probability is low, generals can get mana by buying YES and pushing the button. I... don't think any of the generals will be very motivated by that? But not great.

So I decided not to.

Replies from: martin-randall
comment by Martin Randall (martin-randall) · 2024-09-26T12:49:55.810Z · LW(p) · GW(p)

Sometime else created it.

Replies from: rossry, nicolas-lacombe
comment by rossry · 2024-09-26T16:07:51.794Z · LW(p) · GW(p)

Grumble grumble unilateralists' curse...

comment by Nicolas Lacombe (nicolas-lacombe) · 2024-09-26T18:31:27.905Z · LW(p) · GW(p)

I know who created it. will let them know they shouldn't have imo

Replies from: TheBayesian
comment by TheBayesian · 2024-09-26T18:35:26.347Z · LW(p) · GW(p)

'Tis I. Didn't intend bad incentives, the stakes on that market are imo pretty tiny. But I N/Aed, I don't want anyone suspecting that had affected the final outcome.

comment by Ben Pace (Benito) · 2024-09-26T18:49:48.660Z · LW(p) · GW(p)

Rule change! All citizens and Petrov/Peterson will be able to read the Diplomatic Channel, neither will have access to The War Room channel.

I will update the post momentarily.

comment by Ben Pace (Benito) · 2024-09-26T19:35:02.318Z · LW(p) · GW(p)

The Diplomatic Channel dialogue is linked here [? · GW], and can be read.

Edit: Removed because this link was getting overloaded and causing bugs.

Edit2: The link is now back to working!

Replies from: Benito, quila
comment by quila · 2024-09-26T21:08:39.076Z · LW(p) · GW(p)

"Sorry, you don't have access to this draft"

edit: fixed

comment by Nicolas Lacombe (nicolas-lacombe) · 2024-09-26T16:42:51.964Z · LW(p) · GW(p)

I'm a citizen. is there a way I can know which side I'm on? (EastWrong vs WestWrong)

comment by quetzal_rainbow · 2024-09-26T19:02:46.096Z · LW(p) · GW(p)

Wise man once said:

"The only thing necessary [...] is for good men to do nothing."

comment by rotatingpaguro · 2024-09-26T17:35:41.102Z · LW(p) · GW(p)

I take this as a fun occasion to lose some of my karma in a silly way to remind myself lesswrong karma is not important.

comment by Søren Elverlin (soren-elverlin-1) · 2024-09-26T12:49:05.690Z · LW(p) · GW(p)

A side-effect of Rationalists welcoming criticism is that this site has a number of high-karma users with a dislike of Rationalism and Rationalists.

There's also a fair number of trolls/anti-socials on LessWrong, and if a single one of them was offered the chance to be a general, I expect they would gleefully launch their nukes irrespective of the payoff structure and the rest of the scenario.

I predict the underlying variable is: Will any of the 10 generals afterwards be found to have previously posted substantial in-group criticism on LessWrong? Conditional on YES, I assign 70% chance of a nuclear war. Conditional on NO, I assign 10% chance of a nuclear war.

comment by Neil (neil-warren) · 2024-09-26T19:37:00.689Z · LW(p) · GW(p)

Do we know what side we're on? Because I opted in and don't know whether I'm East or West, it just feels Wrong. I guess I stand a non-trivial chance of losing 50 karma ahem please think of the daisy girl and also my precious internet points.

comment by Yoav Ravid · 2024-09-26T13:32:46.678Z · LW(p) · GW(p)

This is extremely cool! good job! Looking forward to seeing how this unfolds, which will unfortunately happen mostly as I sleep (and as a citizen I hope to come out at the end of this with no change to my Karma)

comment by David Matolcsi (matolcsid) · 2024-09-26T15:14:22.713Z · LW(p) · GW(p)

I am a Citizen in the game, and I'm writing a post doing a detailed analysis of what we can do to significantly decrease the chance of an unaligned AI killing us if it takes over. I plan to finish and post it today evening, so dear Generals, if you want to read the post today, please be cautious with the nukes.

Replies from: rossry
comment by rossry · 2024-09-26T16:21:36.335Z · LW(p) · GW(p)
  1. Great if true!
  2. As a Citizen, and without suggesting that you are, I would not endorse anyone else lying about similar topics, even for the good consequences you are trying to achieve.
Replies from: matolcsid
comment by David Matolcsi (matolcsid) · 2024-09-26T16:48:23.771Z · LW(p) · GW(p)

I agree lying is bad. Also, to be clear, I will post my thing after 48 hours if the site gets nuked anyway, so not that big of a loss, but I would be annoyed.

comment by Nicolas Lacombe (nicolas-lacombe) · 2024-09-26T16:53:07.479Z · LW(p) · GW(p)

What about the Citizens? [...] You can comment here where your Generals can read, you can send them DMs, [...]

how can the citizens dm their general? will there be link to their lw profiles posted somewhere?

comment by Measure · 2024-09-26T19:13:48.091Z · LW(p) · GW(p)

Where is the Diplomatic Channels dialogue located?

Replies from: nicolas-lacombe
comment by Nicolas Lacombe (nicolas-lacombe) · 2024-09-26T19:48:39.455Z · LW(p) · GW(p)

right now Ben Pace commented it here [LW(p) · GW(p)]. but I assume it will be added to the post.

comment by Tomás B. (Bjartur Tómas) · 2024-09-26T15:21:22.556Z · LW(p) · GW(p)

I would like to take this opportunity to express my undying loyalty to my nation and its human instruments. 

comment by Lao Mein (derpherpize) · 2024-09-26T17:12:24.463Z · LW(p) · GW(p)

The text referred to this as a "social deception game". Where is the deception?

My guesses:

  1. The actual messages sent to the Petrovs and Generals will significantly differ from the ones shown here.
  2. It's Amongus, and there are players who get very high payoffs if they trick the sides into nuclear war
  3. It's just a reference to expected weird game theory stuff. But why not call it a "game theory exercise"?
  4. The actual rules of the game will differ drastically from the ones described here. Maybe no positive payoffs for one-sided nuking?
  5. The sensor readings are actually tied to comments on this post. Maybe an AI is somehow involved?
comment by ryan_b · 2024-09-26T13:37:31.846Z · LW(p) · GW(p)

I wonder how the consequences to reputation will play out after the fact.

  • If there is a first launch, will the general who triggered it be downvoted to oblivion whenever they post afterward for a period of time?
  • What if it looks like they were ultimately deceived by a sensor error, and believed themselves to be retaliating?
  • If there is mutual destruction, will the general who triggered the retaliatory launch also be heavily downvoted?
  • Less than, more than, or about the same as the first strike general?
  • Would citizens who gained karma in a successful first strike condemn their 'victorious' generals at the same rate as everyone else?
  • Should we call this pattern of behavior, however, it turns out, the Judgment of History?
comment by Tapatakt · 2024-09-26T15:55:46.059Z · LW(p) · GW(p)

I think making an opt-in link as a big red button and posting it before the rules were published caused a pre-selection of players in favor of those who would press the big red button. Which is... kinda realistic for generals, I think, but not realistic for citizens.

Replies from: rossry
comment by rossry · 2024-09-26T16:22:41.489Z · LW(p) · GW(p)

Think of it as your own little lesson in irreversible consequences of appealing actions, maybe? Rather than a fully-realistic element.

comment by Lao Mein (derpherpize) · 2024-09-26T11:23:02.794Z · LW(p) · GW(p)

My prediction is that this becomes a commitment race. A general is going to post something like "I am going to log off until the game begins, at which point I will immediately nuke without reading any messages.", at which point the generals on the other side get to decide if 2 days of access to LessWrong for everyone is worth the reputational cost of being known as someone who doesn't retaliate. Given what I know about LessWrong and how this entire game is framed, you'll likely get more social capital from not retaliating, meaning the commitment works and everyone keeps access.

The real question is how committing to nuking affects your reputation. My guess is that it's bad? I clearly made a mistake opting-in to this game, so maybe the civilians will be grateful that you didn't destroy their access to LessWrong, but committing looks pretty bad to all the bystanders, many of whom make decisions at places like Manifund and may incorporate this information into if your next project gets funded.

Replies from: rossry
comment by rossry · 2024-09-26T16:18:18.703Z · LW(p) · GW(p)

My guess would be that a commitment to retaliation -- including one that you don't manage to announce to General Logoff before they log off -- is positive, not negative, to one's reputation around these here parts. Sophisticated decision theories have been popular for fifteen years [LW · GW], and "I retaliate to defection even when it's costly to me and negative-net-welfare" reads to me as sophisticated, not shameworthy.

If a general of mine reads a blind commitment by General Logoff on the other side and does nuke back, I'll think positively of them-and-all-my-generals. (Note: if they fire without seeing such a commitment, I'll think negatively of them-and-all-my-generals, and update on whether I want them as collaborators in projects going forward.)

comment by Myron Hedderson (myron-hedderson) · 2024-09-26T20:02:39.962Z · LW(p) · GW(p)

This is fun! I don't know which place I'm a citizen of, though, it just says "hello citizen"... I feel John Rawls would be pleased...

comment by Ben Pace (Benito) · 2024-09-26T18:58:05.408Z · LW(p) · GW(p)

Also note that we will be starting 10-15 mins late due to technical difficulties.

Replies from: Benito
comment by Ben Pace (Benito) · 2024-09-26T19:14:24.506Z · LW(p) · GW(p)

When you see the nukes on the frontpage, you will know the Cold War has begun.

Replies from: nicolas-lacombe
comment by Nicolas Lacombe (nicolas-lacombe) · 2024-09-26T19:40:11.401Z · LW(p) · GW(p)

will we see them on mobile?

comment by HelloQuestonMark · 2024-09-26T10:57:23.604Z · LW(p) · GW(p)

(I asked permission from the LW team before posting.)

The source code is currently available at https://github.com/ForumMagnum/ForumMagnum/compare/master...petrovSocialDeception.

const petrovFalseAlarmMissileCount = new DatabaseServerSetting<number[]>('petrovFalseAlarmMissileCount', [])
const petrovRealAttackMissileCount = new DatabaseServerSetting<number[]>('petrovRealAttackMissileCount', [])

const getIncomingCount = (incoming: boolean, role: 'eastPetrov' | 'westPetrov') => {
  const currentHour = new Date().getHours();
  const roleSeed = role === 'eastPetrov' ? 0 : 13;
  const seed = currentHour + roleSeed + (incoming ? 17 : 0); // Different seed for each hour, role, and incoming state

  const missileCountArray = incoming ? petrovRealAttackMissileCount.get() : petrovFalseAlarmMissileCount.get();

  const result = seed % missileCountArray.length
  console.log({currentHour, roleSeed, incoming, seed, result})
  return missileCountArray[result];
}
Replies from: HelloQuestonMark
comment by HelloQuestonMark · 2024-09-26T18:55:30.952Z · LW(p) · GW(p)

Considerations:

  • This is very related to AI races. It's probably better to not nuke anyone, even if you're the only one doing that and you expect your somewhat for-profit AI lab shareholders to be, on average, happy about the decision to nuke.
  • Please avoid nuking the other side, regardless of the payoffs for the generals. Remember your aesthetic preferences. It's not that much karma, even if you somehow succeed.
comment by DiamondSolstice (TourmalineCupcakes) · 2024-09-26T22:43:19.512Z · LW(p) · GW(p)

Ideas for next year:

  1. One General from each side is a defector. He wants the other side to win. If he is figured out, he will become a civilian.
    1. Possible additions:
      1. Once this happens, a random civilian will be chosen as the next general (if they agree).
      2. Once this happens, a random civilian will be chosen as next general (if they agree) AND a random general will be chosen as the next defector.
  2. One General from each side has to actively STOP nukes from getting launched (possible explanation: his overzealous men are pro-nuke, and he must send hourly (?) commands to NOT launch nukes, with the code 111111). This General must send this code in a five-minute radius of the xx:00 mark.
    1. Addition: the General cannot send nukes normally (with the 000000 code)
  3. The Generals/Petrovs must be people who have had an account on Less wrong for less than a year.
  4. The East and West Petrovs cannot read every other message on the Diplomatic Channel (until after the game)
comment by EGI · 2024-09-26T22:07:02.399Z · LW(p) · GW(p)

The incentives are very unrealistic though. "Winning" a nuclear world war with strategic weapons is still quite bad for you overall. Not as bad as losing but still very bad. So flipping the sign of the karma reward for the winner would make the game way more realistic. And much more likely to yield the real outcome.

comment by davekasten · 2024-09-26T18:32:03.706Z · LW(p) · GW(p)

Gentlemen, it's been a pleasure playing with you tonight

comment by interstice · 2024-09-26T14:18:48.564Z · LW(p) · GW(p)