Knightian Uncertainty: Bayesian Agents and the MMEU rulepost by So8res · 2014-08-04T14:05:47.758Z · score: 17 (17 votes) · LW · GW · Legacy · 7 comments
Antagonistic ambiguity Preferring the least convenient world Bayesian ambiguity aversion None 7 comments
Recently, I found myself in a conversation with someone advocating the use of Knightian uncertainty. We both agreed that there's no point in singling out some of your uncertainty as "special" unless you treat it differently.
I am under the impression that most advice from advocates of Knightian uncertainty can be taken to heart in a Bayesian framework, and so I find the concept of "Knightian uncertainty" uncompelling. My friend, who I'm anonymizing as "Sir Percy", claims that he does treat Knightian uncertainty differently from normal uncertainty, and so he needs to make the distinction. Unlike an aspiring Bayesian reasoner, who attempts to maximize expected value, he maximizes the minimum expected value given his Knightian uncertainty. This is the MMEU rule motivated previously.
This surprised me: is it possible for a rational agent to refuse to maximize expected utility? My reflexive reaction was simple:
If you're a rational agent and you don't think you're maximizing expected utility, then you've misplaced your "utility" label.
Sir Percy had a response ready:
That can't be. Remember Sir Percy's coin toss. A coin has been tossed, and the event "H" is "the coin came up heads". Consider the following two bets:
- Pay 50¢ to be payed $1.10 if H
- Pay 50¢ to be payed $1.10 if ¬H
I don't know whether the coin was biased, I know only that my credence is in the interval
[0.4, 0.6]. When considering the first bet, I notice that the probability of H may be 0.4, in which case the bet is expected to lose me 6¢. When considering the second bet, I notice that the probability of H may be 0.6, in which case the bet is expected to lose me 6¢. But when considering both together, I see that I will win 10¢ with certainty. So I reject each bet if presented individually, but I will pay up to 10¢ to play the pair.
As you can see, my preferences change under agglomeration of bets. It's not possible to view me as a Bayesian reasoner maximizing expected utility, because there is no credence you can assign to H such that a Bayesian reasoner shares my preferences. I can't be maximizing expected utility, no matter where you put your labels.
My rejection was vague and unformed at the time. I've since fleshed it out, and it will be presented below. But before continuing, see if you can spot my argument on your own:
My friend believes that H occured with probability in the interval [.4, .6]. He is unwilling to pay 50¢ to be payed $1.10 if H, and he is unwilling to pay 50¢ to be payed $1.10 if ¬H, but he is willing to pay up to 10¢ for the pair. There is no way to assign a credence to the event H such that these actions are traditionally consistent. Yet, if we allow that rational agents can in principle have preferences about ambiguity, then he is acting 'rationally' in some sense. Is the Bayesian framework capable of capturing agents with these preferences?
Short answer: Yes.
A Bayesian agent can act like this, and Sir Percy did put his utility label in the wrong place. In fact, there are two ways that a Bayesian idealization of Sir Percy can exhibit Sir Percy's preferences. In the coin toss game, at least one of two things is happening:
- Sir Percy isn't treating H like an event.
- Sir Percy isn't treating money like utility.
I'll explore both possibilities in turn, but first, let's build a little intuition:
The preferences of a rational agent are not supposed to be invariant under agglomeration of bets. Imagine that there are two potions, a blue potion and a green potion. Each potion, taken alone, makes you sick. Both potions, taken together, give you superpowers. It may well be that you reject both "bet 1: 100% you drink the blue potion" and "bet 2: 100% you drink the green potion", but that you happily pay for the pair.
This is not irrational: You could say there is no assignment of utility to the action "drink the blue potion" that makes my actions consistent, but this is an error. Preferences are not over actions, they are over outcomes. If I take bet 1, I get sick. If I take bet 2, I get sick. If I take them both together, I get superpowers.
The fact that I do not honor agglomeration of bets does not mean I am irrational. Sir Percy looks irrational not because his preferences disobey agglomeration of bets, but only because there's no credence which Sir Percy can assign to H that makes his actions consistent with expected money maximization.
This means that, if Sir Percy is traditionally rational, then he's either not treating H like an event, or he's not treating money like utility. And as it turns out, idealized Bayesian reasoners can display Sir Percy's preferences. I'll illustrate two of them below.
Remember Sir Percy's original motivation for rejecting both bets individually in Sir Percy's coin toss:
I'm maximizing the minimum expected utility. Given bet (1), I notice that perhaps the probability of H is only 40%, in which case the expected utility of bet (1) is -6¢, so I reject it. Given bet (2), I notice that perhaps the probability of H is 60%, in which case the expected utility of bet (2) is -6¢, so I reject that too."
Notice how Sir Percy is acting here: given each bet, he assumes that the coin is weighted in whatever manner is worst for him. Sir Percy's
[0.4, 0.6] credence interval for H implies that he has seen evidence that convinces him the coin isn't weighted to land H less than 40% of the time, and isn't weighted to land H more than 60% of the time, but he can't narrow it down any further than that. And when he's considering bets, he acts assuming that the coin is actually weighted in the least convenient way.
We can design a Bayesian agent that acts like this, but this Bayesian agent (who I'll call Paranoid Perry) doesn't treat H like an event. Rather, Perry acts as if the coin's weighting isn't chosen until after Perry selects a bet. Then, nature gets to choose the weighting of the coin (within the constraints set by Perry's confidence interval), and nature always chooses what's worst for Perry.
Let's say that Perry is a perfectly rational Bayesian agent. If nature has a selection of coins weighted from 40% heads to 60% heads, and nature gets to select one of the coins after Perry selects one of the bets, then rejecting each of the bets individually but accepting both together is precisely how Perry maximizes expected utility. The game seems a bit odd until you realize that Perry isn't treating H as an event: Perry is acting as if it's choice of bets affects the weighting of the coin.
Notice that Perry isn't treating its
[0.4, 0.6] credence interval for H as if it's normal uncertainty. Perry is treating this uncertainty as if it is in the world: the coin could actually be weighted anywhere from 40% H to 60% H, and nature gets to choose. It's unclear how Perry decides which uncertainty is internal (subject to normal Bayesian reasoning) and which uncertainty is external (resolved adversarially by nature), but this distinction gives rise to the difference between "normal" uncertainty and "Knightian" uncertainty. Given any mechanism for differentiating between uncertainty about what world Perry is in and variability that gets to be resolved by nature, we can construct a Bayesian reasoner that acts like Perry.
Take, for example, the Ellsberg urn game. Perry has somehow been convinced that the process of selecting balls from the urn is fair: balls are selected by some stochastic process which Nature cannot affect. However, Perry also things that the urn in use gets to be chosen by nature after Perry has selected a bet.
There are many mechanisms by which Perry could come to this state of knowledge. Perhaps the ball-selection mechanism was designed and verified by Perry, but the urn is selected by an unverified process that is supposedly random but which Perry has reason to believe has actually been infected by the nefarious spirit of the Adversary. Given this knowledge, Perry knows that if it chooses bet 1b then nature will select the urn with no black balls, and if it chooses bet 2a then nature will select the urn without yellow balls, so Perry rationally prefers bets 1a and 2b.
Perry is averse to ambiguity because Perry believes that nature gets to resolve ambiguity, and nature is working against Perry. This paranoia captures some of the original motivation for ambiguity aversion, but it starts to seem strange under close examination. For example, in the coin toss game, Perry acts as if nature gets to pick the weighting of the coin even after the coin has been tossed. Perry believes that nature gets to resolve ambiguity, even if the ambiguity lies in the past.
Furthermore, Perry acts as if it is certain that nature gets to resolve ambiguity disfavorably, and this can lead to pathological behavior in some edge cases. For example, consider Perry reasoning about the unbalanced tennis game: Perry believes that one of the players is far better than the other, capable of winning 99 games out of 100. But Perry doesn't know whether Anabel or Zara is the better player. How might the conversation go when a bookie approaches Perry offering a bet of 2:1 odds on the player of Perry's choice?
Hello, Perry, I'm a rich eccentric bookie who likes giving people money in the form of strange bets. I'd like to offer you a bet on the unbalanced tennis game. I know that you know that one of the players is far better than the other, and I know you have "Knightian uncertainty" about which player is better. To countermand your concerns, I'm going to make you a very good deal: I'm willing to offer you a bet with 2:1 odds on the player of your choice.
"I'm sorry", Perry responds, "I cannot take that bet."
But why not? I understand that if I came to you offering a bet with 2:1 odds on Anabel, you should reject it and then update in favor of Anabel being the better player, under the assumption that I was trying to take advantage of your uncertainty. But I'm offering you a bet with 2:1 odds on the player of your choice! How can you reject this?
"Well, you see", Perry responds, "no matter what choice I make, nature will resolve my ambiguity against me. If I place 2:1 odds on Zara winning, then nature will decide that Zara is the worse player, and Zara will loose 99% of the time. But if I instead place 2:1 odds on Zara losing, then nature will decide that Zara is the better player, and Zara will win 99% of the time. Either way, I lose money.
How can that be? Surely, you believe that one player is already better than the other. Your choice can't change who is better; that was decided in the past! You're acting like nature will retroactively make the one that you bet on be worse!
"Yes, precisely", Perry responds. "See, I have Knightian uncertainty about which is better, and nature gets to resolve Knightian uncertainty, regardless of causality. No matter which player I pick, she will turn out to be the worse player."
Realizing that this may sound crazy, Perry pauses.
"Now, I'm not sure how this happens. Perhaps nature has acausal superpowers, actually can alter the past. Or perhaps you simulated me to see who I would pick, and you're only offering me this bet because you found that I would pick the wrong player. I don't know how nature does it, I do know for a fact that nature will resolve my ambiguity antagonistically. Of this I am certain. And so I'm sorry, but I can't take your bet."
And if I offer you 10:1 odds on the player of your choice?
Perry shakes it's head. "Sorry."
The bookie dejectedly departs.
Perry is a Bayesian agent that shares Sir Percy's preferences, thereby demonstrating that the Bayesian framework can capture agents with preferences over ambiguity (given some method for distinguishing ambiguity from uncertainty). Any Bayesian agent that believes that there is actually variability in the environment which nature gets to acausally adversarially resolve acts precisely as if it is using the MMEU rule.
In fact, though Perry may seem to be reasoning in an odd fashion, Perry captures much of the original motivation for the MMEU rule. "I can't get a precise credence", Sir Percy would say, "I can only get a credence interval, because reality actually still has a little wiggle room about whether the coin was weighted. So assuming the worst case, how do I maximize expected utility?"
The paranoia of Perry is a little disconcerting, though. Is there perhaps another idealized Bayesian agent that captures Sir Perry's preferences in a less abrasive manner?
Preferring the least convenient world
Allow me to introduce another Bayesian agent, Cautious Caul. Like Perry, Caul distinguishes between different types of uncertainty. The method of distinction is left unspecified, but given any means of differentiating between uncertainty and ambiguity ("Knightian uncertainty"), we can specify an agent that acts like Caul.
Like Perry, Caul treats ambiguity differently form normal uncertainty because Caul believes that ambiguity is not internal lack of knowledge, but rather a fact about the external world. But whereas Perry thinks that Knightian uncertainty denotes variability in the world that is resolved by nature, Caul instead thinks that ambiguity denotes worldparts that actually exist.
And Caul is an expected utility maximizer (and a perfect Bayesian), but Caul's preferences are defined according to the least convenient world.
When faced with Sir Percy's coin toss, Caul honestly believes that a continuous set of worlds exist, each with a coin weighted somewhere between 40% H and 60% H, and that in each of these worlds there is a sliver of Caul. But Caul's preferences are defined such that Caul only cares about the sliver in the least convenient world, the world with the worst odds.
In other words, every Caul-sliver is willing to trade unlimited amounts of expected dollars in their own worlpart in order to increase the expected dollars of the Caul in the least convenient worldpart. Thus, Caul refuses the first bet in Sir Percy's coin toss, because conditioned upon Caul taking the first bet, Caul only cares about the worldpart where the coin is weighted 40% H (and so Caul refuses the bet). But conditioned upon Caul taking the second bet, Caul only cares about the worldpart where the coin is weighted 60% H (and so Caul refuses the bet). But when offered both bets simultaneously, all the Caul slivers in all the worldparts will gain 10¢, so Caul takes the combination.
In this case, we see why there is no Bayesian with a single credence for H that can exhibit Caul's preferences. From Caul's point of view, H is not a single event, it is a set of events with varying credence. Any Bayesian attempting to assign a single credence to H is neglecting a crucial part of (what Caul sees as) the world's structure.
Furthermore, Caul also fails to treat the outcomes of the bet (in dollars) as utility: Caul only cares about the least convenient world given Caul's action, so the utility that Caul gets from a bet is dependent upon Caul's action. We can see this when imagining Caul facing the Ellsberg urn game. In this scenario, Caul has actually come to believe that there is a Caul-sliver facing each of the sixty urns. Conditioned upon taking bet 1b, Caul only cares about the Caul-sliver facing the urn without any black balls. But conditioned upon taking bet 2a, Caul only cares about the Caul-sliver facing the urn without any yellow balls. Caul's preferences depend upon the action which Caul chooses.
Caul is an expected utility maximizer with preferences only for the worldpart with the worst odds (where worldparts are determined by ambiguity and odds are determined by uncertainty). Caul may appear to be using the MMEU rule, but call is actually maximizing minimum expected dollars while maximizing actual expected utility. This only looks like MMEU if we confuse the inner utility in each worldpart ("dollars") with the outer utility of Caul itself.
To illustrate, consider what happens when Caul encounters our enthusiastic bookie. As with Perry, Caul believes that one of the tennis players in the unbalanced game could beat the other 99 times out of 100. However, Caul has Knightian uncertainty about whether Anabel or Zara is the superior player. This means that Caul acutally believes there are two worldparts, each with a Caul-sliver: one in which Anabel beats Zara 99 times out of 100, and the other in which Anabel loses to Zara 99 times out of 100.
How might Caul's conversation with the bookie go?
Ah, Caul! I have an offer for you. You know the unbalanced tennis game that is about to be played between Anabel and Zara? I'd like to offer you a bet that's heavily in your favor: I'll give you a bet with 2:1 odds on the player of your choice!
"Sorry", Caul says, "can't do that."
Why not? Look, I'm not trying to screw you over here. I just really like giving people free money, and if I can't give enough away, I'll have to answer to my board of trustees. I seriously don't want to mess with those folks. This really is a bet in your favor! It pays out 2:1. I haven't simulated you, I promise!
"I'm sorry", Caul replies, "but I can't. See, if I place the bet on Anabel, then in the world where Zara is the better player, I expect that version of Caul to lose money."
Yes, but in the world where Anabel is the better player, that version of Caul wins more than the other version loses!
"Perhaps, but I only care about the sliver of me with the worst odds. If I place the bet on Anabel, then I care only about the Caul in the world where she's worse. But if I place the bet on Zara, then I only care about the Caul in the world where she's better." Caul shrugs. "Sorry, there's just no way I can take the bet without making the least convenient world worse off, and I only care about what happens in the least convenient world."
And so our hapless bookie departs, frantically searching for someone to take free money, muttering about irrationality.
Caul captures another portion of the motivation for the MMEU rule: Caul only cares about the least convenient world. Of course, Caul only cares about the least convenient world given Caul's normal uncertainty: Caul cares about the Caul-sliver with the worst odds, not with the worst outcome. To see the difference, imagine that our bookie offered Caul a bet with 100:1 odds on Anabel. Caul would take this bet, because even in the world with the worst odds the expected Caul-value of this bet is 1¢ (99% of the time Zara wins and Caul loses $1, but 1% of the time Anabel wins and Caul wins $100).
An agent that only cared about the actual worst case (rather than only the worldpart with the worst odds) would refuse even this bet, worrying about scenarios where Anabel is a far superior player but loses anyway. This would be an agent in which all uncertainty is "Knightian", and this would turn Caul into a maximizer of minimum actual utility (instead of minimum expected utility), which is wildly impractical (and of dubious usefulness). This raises the uncomfortable question of how Caul decides which uncertainties denote actual worldparts, and which uncertainties denote "normal" uncertainty with which Caul may gamble in the traditional manner.
Perry and Caul show that it's possible for a Bayesian to act according to Sir Percy's preferences, but they are only able to do this because they believe that some of their uncertainty is not internal (caused by incomplete information) but rather external (actually represented in the environment). Perry believes that some of its uncertainty denotes places where Nature gets to choose how the world works after Perry chooses an action. Caul believes that some of its uncertainty actually denotes real worldparts. In both cases, these agents believe that their ambiguity is part of the world, and this is the mechanism by which they can display ambiguity aversion.
Bayesian ambiguity aversion
Perry and Caul help us answer our first question.
Can an agent with preferences about ambiguity be 'rational' in the Bayesian sense?
To this, we can now answer "yes". Ideal Bayesian agents can exhibit the preferences advocated by Sir Percy. The next question is, is this a sane way to act? A rational agent can exhibit ambiguity aversion, and humans seem to exhibit ambiguity aversion… so should we act like this?
Sir Percy advocates this decision rule. He's put forth some interesting arguments and this post has shown that such a viewpoint can be rational in a Bayesian sense. But a Bayesian can also have an anti-Laplacian prior, so that doesn't speak to this viewpoint's sanity. Is the MMEU rule a sane way for humans to act?
In short: No, for the same reason that Caul and Perry are mad. Their views are consistent, and their actions are rational (given their priors), but I am really glad that I don't have their priors. But that's a point I'll explore in more depth in the next (and final) post.
Comments sorted by top scores.