# Moloch games

post by Chris van Merwijk (chrisvm) · 2020-10-16T15:19:04.722Z · LW · GW · 5 comments**tl;dr**: This post suggests a direction for modelling Molochs. The main thing this post does is to rename the concept of “potential games” (an existing concept in game theory) to “Moloch games” to suggest an interpretation of this class of games. I also define "the preferences of a Moloch" to generalize that notion (the preferences may be intransitive).

This post assumes that you have familiarity with game theory and the concept of a Moloch [? · GW].

What do group dynamics want? If a society/group (the Moloch) wants things that are different from what the individuals want, how can we assign preferences or a utility function to that society/group that doesn't model what would be good for the aggregated preference of the individuals of the group, but that describes what the group dynamics is actually "trying" to achieve (even against the interest of its individual members)? Here is a suggested answer.

**Intuition**. A **Moloch game** is a game such that there is a utility function , called “**the Moloch’s utility function**”, such that if the agents behave individually rationally, then they collectively behave as a “Moloch” that controls all players simultaneously and optimizes . In particular, the Nash equilibria correspond to local optima of .

Not all games are Moloch games.

**Definition**. A game with finite number of players and for each player a strategy space is a *cardinal ***Moloch game **(in the game theory literature, a *cardinal potential game*), if there is a utility function such that for all players , and all strategies for the other players,

Intuitively, if you take any strategy-profile for all the players, and adjust the strategy of one player, then the Moloch's utility will increase/decrease by the same amount as the utility function of that particular player. Hence, intuitively, every player behaves always *as if* they are optimizing the Moloch's utility function.

The definition for an *ordinal *Moloch game replaces the condition with

Intuitively, represents the Moloch's preferences ordinally but not cardinally. Obviously, cardinal Moloch games are also ordinal Moloch games.

**Example**. The prisoners dilemma:

Player 2 | |||

Cooperate | Defect | ||

Player 1 | Cooperate | 1, 1 | -1, 2 |

Defect | 2, -1 | 0 , 0 |

We will show that this is a cardinal Moloch game, by just computing the cardinal utility function and showing that there are no inconsistencies:

**How to compute the cardinal utility function of the Moloch**: Pick an arbitrary strategy profile to have utility 0 (I take Defect, Defect). Then iteratively compute the utility of rows and columns by just applying the constraint that the definition gives: Compute the difference in utility *of the player whose row/column you're moving along* (i.e. player 2 for the rows, player 1 for the columns) of each cell in the row/column from a cell of which you know the value of . In this case, we know . So compute as which equals . Similar for . For , there are two ways to compute it: Using player 1's utility function and or player 2's utility function and . If these two give different answers, then the game is not a cardinal Moloch game.

Here is the **cardinal utility function of the Moloch for the prisoner's dilemma **(The above algorithm gives a utility function that is unique up to translations) :

Cooperate | Defect | |

Cooperate | -3 | -1 |

Defect | -1 | 0 |

**Intuition. **In this case, even though all players prefer Cooperate, Cooperate over Defect, Defect, the Moloch prefers the opposite. This corresponds to the fact that it is individually rational for the players to Defect. This Moloch utility function captures the "preferences of the group dynamics" as opposed to the preferences of the individuals. (It is obviously very different from the notion of "aggregate preferences" or "welfare").

A Moloch game assumes in some sense that "the Moloch has transitive preferences". We can generalize to Molochs with possibly intransitive preferences (I don't know of this being defined this way in the literature on potential games):

**Definition**. Let be a game with a finite number of players, each of which has a preference relation over the strategy space (by default derived from a utility function ). Then the **Moloch's preferences** are defined as the preference relation satisfying for all players , and all strategy profiles for the other players:

**Observation**. These preferences are always incomplete (intuitively, the Moloch doesn't have an opinion on the comparison between different players changing their strategies, because it doesn't have this information: players individually make choices given their options). They may be either transitive or intransitive. I'll say a Moloch's preferences are rational if they are transitive (neglecting the usual requirement of completeness).

Just to show that the concepts are what they should be:

**Lemma**. Any game whose Moloch has transitive preferences is an ordinal Moloch game. Any ordinal Moloch game has a Moloch with transitive preferences.

**Proof**. For any transitive relation on a space there is a real-valued function on it that is consistent with that relation. The other direction follows directly.

**Intuition**. If the Moloch has transitive preferences, then the Moloch knows what it wants and the game will have a pure Nash equilibrium (there is a theorem that formalizes this). Conversely, if the Moloch has intransitive preferences, then the Moloch doesn't know what it wants and the game will tend to have cycles (not all of them will because the players might want to move out of them into a "transitive region" of the Moloch's preferences).

I won't show this here, but the literature on potential games (cf. the thing I am calling Moloch games), these are examples:

**Games with rational Molochs (i.e. Moloch games / potential games)**:

- Prisoner's dilemma
- Battle of the sexes
- Coordination game
- Game of Chicken

**Games with irrational Molochs (i.e. not Moloch games / potential games)**:

- Matching pennies
- Rock paper scissors

I probably won't spend much more time on this, but here is a suggestion for taking this as a starting point to modelling Molochs:

- Check if various informal ideas about Molochs can be phrased in this language. Check if the language is satisfying to talk about actual Molochs.
- Look at the literature on potential games to see if it contains much insight. Make a dictionary of concepts named in the terminology of the ontology we're interested in (similar to how I renamed "potential game" to "Moloch game") to make this literature an "efficiently queryable database" for insights into Molochs.
- I suspect that there might be ideas to be had about Moloch games that aren't treated there, because as far as I know, potential games were developed mostly as a trick to make computations easier, not as a conceptual tool for thinking about Molochs, societal inadequacy and so forth. It's plausible that certain obvious questions haven't been asked about them for this reason. Try to actually model Molochs this way and see if these definitions allow us to answer questions we want to ask about them. Use this as a stepping stone and see where it is unsatisfying. Build on top of that to push the analysis further.

Feel free to contact me if you want to think about this.

**Some reading**:

Flows and Decompositions of Games: Harmonic and Potential Games. In the language of this post: decomposing a game into a "rational part" of the Moloch, and an irrational deviation from it. Finding the "closest rational Moloch" of a game.

**Some further ideas and questions to ask**:

- Can real world societies be decomposed into multiple Molochs? In the style of the "Flows and Decompositions of Games" paper, it wouldn't have to be a decomposition in terms of subgroups of players, but of "aspects of the game-theoretic interaction". (e.g. an individual might simultaneously be part of a "capitalism Moloch" and a "politics Moloch"). Maybe Molochs can be approximately decomposed.
- Is there a notion of "Moloch game" for sequential games? Games with limited information? (The potential game literature probably has asked analogous questions).

## 5 comments

Comments sorted by top scores.

## comment by Vanessa Kosoy (vanessa-kosoy) · 2020-10-17T07:09:48.022Z · LW(p) · GW(p)

That's a neat interpretation of potential games

If the Moloch has transitive preferences, then the Moloch knows what it wants and the game will have a Nash equilibrium

You mean the game will have a *pure* Nash equilibrium. *Any* game has *some* (mixed) Nash equilibrium.

## ↑ comment by Chris van Merwijk (chrisvm) · 2020-10-18T09:02:45.074Z · LW(p) · GW(p)

Yes that's what I meant, thanks.

## comment by johnswentworth · 2020-10-16T15:55:15.468Z · LW(p) · GW(p)

This is a great idea, well done.

## comment by philh · 2020-10-20T13:17:22.116Z · LW(p) · GW(p)

Intuition. A Moloch game is a game such that there is a utility function , called “the Moloch’s utility function”, such that if the agents behave individually rationally, then they collectively behave as a “Moloch” that controls all players simultaneously and optimizes . In particular, the Nash equilibria correspond to local optima of .

Minor, but this tripped me up. My read of "controls all players simultaneously" would be that there's no such thing as a local optimum, it can just move directly to the global optimum from any other state. I'm not sure what would be a better wording though, and your non-intuitive definition was clear enough to set me right.

## comment by Dagon · 2020-10-18T18:37:08.734Z · LW(p) · GW(p)

I'm not convinced we gain anything by further anthroporphising (or agent-izing) Moloch. Moloch is the result of misaligned agents (who want different outcomes than each other, and will sacrifice the shared environment to pursue their goals), not a separate entity.