What Do We Mean By "Rationality"?

eliezer_yudkowsky

What Do We Mean By "Rationality"?

post by Eliezer Yudkowsky (Eliezer_Yudkowsky) · 2009-03-16T22:33:55.765Z · LW · GW · Legacy · 19 comments

19 comments

I mean two things:

1. Epistemic rationality: systematically improving the accuracy of your beliefs.

2. Instrumental rationality: systematically achieving your values.

The first concept is simple enough. When you open your eyes and look at the room around you, you’ll locate your laptop in relation to the table, and you’ll locate a bookcase in relation to the wall. If something goes wrong with your eyes, or your brain, then your mental model might say there’s a bookcase where no bookcase exists, and when you go over to get a book, you’ll be disappointed.

This is what it’s like to have a false belief, a map of the world that doesn’t correspond to the territory. Epistemic rationality is about building accurate maps instead. This correspondence between belief and reality is commonly called “truth,” and I’m happy to call it that.¹

Instrumental rationality, on the other hand, is about steering reality—sending the future where you want it to go. It’s the art of choosing actions that lead to outcomes ranked higher in your preferences. I sometimes call this “winning.”

So rationality is about forming true beliefs and making decisions that help you win.

(Where truth doesn't mean “certainty,” since we can do plenty to increase the probability that our beliefs are accurate even though we're uncertain; and winning doesn't mean “winning at others' expense,” since our values include everything we care about, including other people.)

When people say “X is rational!” it’s usually just a more strident way of saying “I think X is true” or “I think X is good.” So why have an additional word for “rational” as well as “true” and “good”?

An analogous argument can be given against using “true.” There is no need to say “it is true that snow is white” when you could just say “snow is white.” What makes the idea of truth useful is that it allows us to talk about the general features of map-territory correspondence. “True models usually produce better experimental predictions than false models” is a useful generalization, and it’s not one you can make without using a concept like “true” or “accurate.”

Similarly, “Rational agents make decisions that maximize the probabilistic expectation of a coherent utility function” is the kind of thought that depends on a concept of (instrumental) rationality, whereas “It’s rational to eat vegetables” can probably be replaced with “It’s useful to eat vegetables” or “It’s in your interest to eat vegetables.” We need a concept like “rational” in order to note general facts about those ways of thinking that systematically produce truth or value—and the systematic ways in which we fall short of those standards.

As we’ve observed in the previous essays, experimental psychologists sometimes uncover human reasoning that seems very strange. For example, someone rates the probability “Bill plays jazz” as less than the probability “Bill is an accountant who plays jazz.” This seems like an odd judgment, since any particular jazz-playing accountant is obviously a jazz player. But to what higher vantage point do we appeal in saying that the judgment is wrong ?

Experimental psychologists use two gold standards: probability theory, and decision theory.

Probability theory is the set of laws underlying rational belief. The mathematics of probability applies equally to “figuring out where your bookcase is” and “estimating how many hairs were on Julius Caesars head,” even though our evidence for the claim “Julius Caesar was bald” is likely to be more complicated and indirect than our evidence for the claim “theres a bookcase in my room.” It’s all the same problem of how to process the evidence and observations to update one’s beliefs. Similarly, decision theory is the set of laws underlying rational action, and is equally applicable regardless of what one’s goals and available options are.

Let “P(such-and-such)” stand for “the probability that such-and-such happens,” and “P(A,B)” for “the probability that both A and B happen.” Since it is a universal law of probability theory that P(A) ≥ P(A,B), the judgment that P(Bill plays jazz) is less than P(Bill plays jazz, Bill is an accountant) is labeled incorrect.

To keep it technical, you would say that this probability judgment is non-Bayesian. Beliefs that conform to a coherent probability distribution, and decisions that maximize the probabilistic expectation of a coherent utility function, are called “Bayesian.”

I should emphasize that this isn't the notion of rationality thats common in popular culture. People may use the same string of sounds, “ra-tio-nal,” to refer to “acting like Mr. Spock of Star Trek” and “acting like a Bayesian”; but this doesn't mean that acting Spock-like helps one hair with epistemic or instrumental rationality.²

All of this does not quite exhaust the problem of what is meant in practice by “rationality,” for two major reasons:

First, the Bayesian formalisms in their full form are computationally intractable on most real-world problems. No one can actually calculate and obey the math, any more than you can predict the stock market by calculating the movements of quarks.

This is why there is a whole site called “Less Wrong,” rather than a single page that simply states the formal axioms and calls it a day. There’s a whole further art to finding the truth and accomplishing value from inside a human mind: we have to learn our own flaws, overcome our biases, prevent ourselves from self-deceiving, get ourselves into good emotional shape to confront the truth and do what needs doing, et cetera, et cetera.

Second, sometimes the meaning of the math itself is called into question. The exact rules of probability theory are called into question by, e.g., anthropic problems in which the number of observers is uncertain. The exact rules of decision theory are called into question by, e.g., Newcomblike problems in which other agents may predict your decision before it happens.³

In cases where our best formalizations still come up short, we can return to simpler ideas like “truth” and “winning.” If you are a scientist just beginning to investigate fire, it might be a lot wiser to point to a campfire and say “Fire is that orangey-bright hot stuff over there,” rather than saying “I define fire as an alchemical transmutation of substances which releases phlogiston.” You certainly shouldn’t ignore something just because you can’t define it. I can't quote the equations of General Relativity from memory, but nonetheless if I walk off a cliff, I'll fall. And we can say the same of cognitive biases and other obstacles to truth—they won't hit any less hard if it turns out we can't define compactly what “irrationality” is.

In cases like these, it is futile to try to settle the problem by coming up with some new definition of the word “rational” and saying, “Therefore my preferred answer, by definition, is what is meant by the word ‘rational.’ ” This simply raises the question of why anyone should pay attention to your definition. I’m not interested in probability theory because it is the holy word handed down from Laplace. I’m interested in Bayesian-style belief-updating (with Occam priors) because I expect that this style of thinking gets us systematically closer to, you know, accuracy, the map that reflects the territory.

And then there are questions of how to think that seem not quite answered by either probability theory or decision theory—like the question of how to feel about the truth once you have it. Here, again, trying to define “rationality” a particular way doesn’t support an answer, but merely presumes one.

I am not here to argue the meaning of a word, not even if that word is “rationality.” The point of attaching sequences of letters to particular concepts is to let two people communicate—to help transport thoughts from one mind to another. You cannot change reality, or prove the thought, by manipulating which meanings go with which words.

So if you understand what concept I am generally getting at with this word “rationality,” and with the sub-terms “epistemic rationality” and “instrumental rationality,” we have communicated: we have accomplished everything there is to accomplish by talking about how to define “rationality.” What’s left to discuss is not what meaning to attach to the syllables “ra-tio-na-li-ty”; what’s left to discuss is what is a good way to think.

If you say, “It’s (epistemically) rational for me to believe X, but the truth is Y,” then you are probably using the word “rational” to mean something other than what I have in mind. (E.g., “rationality” should be consistent under reflection—“rationally” looking at the evidence, and “rationally” considering how your mind processes the evidence, shouldn’t lead to two different conclusions.)

Similarly, if you find yourself saying, “The (instrumentally) rational thing for me to do is X, but the right thing for me to do is Y,” then you are almost certainly using some other meaning for the word “rational” or the word “right.” I use the term “rationality” normatively, to pick out desirable patterns of thought.

In this case—or in any other case where people disagree about word meanings—you should substitute more specific language in place of “rational”: “The self-benefiting thing to do is to run away, but I hope I would at least try to drag the child off the railroad tracks,” or “Causal decision theory as usually formulated says you should two-box on Newcomb’s Problem, but I’d rather have a million dollars.”

In fact, I recommend reading back through this essay, replacing every instance of “rational” with “foozal,” and seeing if that changes the connotations of what I’m saying any. If so, I say: strive not for rationality, but for foozality.

The word “rational” has potential pitfalls, but there are plenty of non-borderline cases where “rational” works fine to communicate what I’m getting at. Likewise “irrational.” In these cases I’m not afraid to use it.

Yet one should be careful not to overuse that word. One receives no points merely for pronouncing it loudly. If you speak overmuch of the Way, you will not attain it.

¹ For a longer discussion of truth, see “The Simple Truth [? · GW]” at the very end of this volume.

² The idea that rationality is about strictly privileging verbal reasoning over feelings is a case in point. Bayesian rationality applies to urges, hunches, perceptions, and wordless intuitions, not just to assertions.

I gave the example of opening your eyes, looking around you, and building a mental model of a room containing a bookcase against the wall. The modern idea of rationality is general enough to include your eyes and your brains visual areas as things-that-map, and to include instincts and emotions in the belief-and-goal calculus.

³ For an informal statement of Newcomb’s Problem, see Jim Holt, “Thinking Inside the Boxes,” Slate, 2002, http://www.slate.com/articles/arts/egghead/2002/02/thinkinginside_the_boxes.single.html.

19 comments

Comments sorted by top scores.

comment by Raemon · 2018-10-15T21:10:18.339Z · LW(p) · GW(p)

Note: this post originally appeared in a context without comments on Overcoming Bias. Old comments on this post are over here [LW · GW].

comment by Valentin2026 (Just Learning) · 2021-05-14T20:11:16.550Z · LW(p) · GW(p)

How should we deal with the cases when epistemic rationality contradicts instrumental? For example, we may want to use placebo effect because one of our values is that healthy is better than sick, and less pain is better than more pain. But placebo effect is based on the fact that we believe pill to be a working medicine that is wrong. Is there any way to satisfy both epistemic and instrumental rationality?

Replies from: Jozdien, EniScien, George Noah Fitzgerald

↑ comment by Jozdien · 2021-05-14T20:34:25.604Z · LW(p) · GW(p)

It depends from case to case, I would think. There are instances [LW · GW] when you're most probably benefited by trading off epistemic rationality for instrumental, but in cases where it's too chaotic to get a good estimate and the tradeoff seems close to equal, I would personally err on the side of epistemic rationality. Brains are complicated, forcing a placebo effect might have ripple effects across your psyche like an increased tendency to shut down that voice in your head that talks when you know your belief is wrong on some level (very speculative example), for limited short-term gain.

Replies from: Just Learning

↑ comment by Valentin2026 (Just Learning) · 2021-05-23T23:19:29.251Z · LW(p) · GW(p)

Thank you, wonderful series!

↑ comment by EniScien · 2022-06-02T16:54:41.624Z · LW(p) · GW(p)

It seems to me that this is not a contradiction of two rationalities. Rather, it is similar to the resonance of doubt. If a placebo works when you believe in it, that means that if you believe in it, it will be true. Here you need a reverse example, when if you believe that something is true, then it becomes false. (Believing that something is safe again won't work, since you just need to not act more carelessly based on the safety of something, which is just a matter of instrumental rationality)

Replies from: matheus-popst

↑ comment by MP (matheus-popst) · 2022-07-16T03:44:34.530Z · LW(p) · GW(p)

If you believe that the placebo works, it works. You're right in believing it works.
If you don't believe that the placebo works, it doesn't work. You're right believing it doesn't work

If you believe that the sky is blue, you're right.
If you believe that the sky is green, it's still blue, you're wrong.

Truths that have humans involve some amounts of reflexivity.

↑ comment by Peter Pehlivanov (George Noah Fitzgerald) · 2022-05-18T17:06:51.735Z · LW(p) · GW(p)

I'd say you shouldn't force yourself to believe something (epistemic rationality) to achieve a goal (instrumental rationality). This is because, in my view, human minds are addicted to feeling consistent, so it'd be very difficult (i.e., resource expensive) to believe a drug works when you know it doesn't.

What does it even mean to believe something is true when you know it's false? I don't know. Whatever it means, it'd have to be a psychological thing rather than an epistemological one. My personal recommendation is to only believe things that are true. This is because the modern environment we live in generally benefits rational behavior based on knowledge anyway, so the problem doesn't need to surface.

comment by Nick Timebreak (nick-timebreak) · 2022-11-05T12:37:17.850Z · LW(p) · GW(p)

The essay reminds me of the book 𝑳𝒂𝒏𝒈𝒖𝒂𝒈𝒆 𝒐𝒏 𝑻𝒉𝒐𝒖𝒈𝒉𝒕 𝒂𝒏𝒅 𝑨𝒄𝒕𝒊𝒐𝒏 by Samuel Hayakawa. The author also used the map and territory metaphor in the book.

Replies from: Richard_Kennaway

↑ comment by Richard_Kennaway · 2022-11-05T19:51:22.541Z · LW(p) · GW(p)

Eliezer has elsewhere mentioned it as having been an influence in his youth. The saying "the map is not the territory" originated with Korzybski, and Hayakawa's book is a popularisation of his work.

Replies from: nick-timebreak

↑ comment by Nick Timebreak (nick-timebreak) · 2022-11-06T05:01:17.121Z · LW(p) · GW(p)

Thank you for the reference. I just stumbled into this website and found the essays interesting to me. As a Chinese reader there is not so many this kind of contents in chinese web. Really lucky to enjoy the thought while improving my English.

Replies from: Richard_Kennaway

↑ comment by Richard_Kennaway · 2022-11-06T09:03:49.112Z · LW(p) · GW(p)

Welcome! There's a monthly open thread [LW · GW] where newcomers are invited to introduce themselves.

comment by It is (james.seng.hpa@gmail.com) · 2024-05-18T14:26:07.423Z · LW(p) · GW(p)

You cannot change reality, or prove the thought, by manipulating which meanings go with which words.

The same word can mean many things, words that have convergent evolution in their sounding but different meanings are spelled differently for a reason. Propaganda is manipulating the meaning of things, this is often done with slogans and words. Lies are the changed meaning of things to shape reality. Reality is a perception from a particular perspective as in the anthropic problems, it is relational not necessarily objective.

Creating a definition can be done, and is at times useful to make sense of and verify the likeness of maps and territory contained in other people's heads. Such to confirm the maps of language and words are congruent.

If things can not be defined the definition is left up to the individual and open to interpretation. The utility of this experiential approach allows individuals to engender their own ideas. When reading around a philosophical work and engaging with the material you build a representation of its meaning. As you do every time you read or write a word. Even where philosophical works have definitions there is often further assumed knowledge to decode and grasp the work in its entirety. In both cases where there is a formal definition, examples and implementation of its usage, this adds meaning and information.

Where the probability of controversy high and the ability to quell controversy is low, the probability of formal defence of ideas is reduced. There is a ceiling but unto time to which, things can be defended, defined or explained.

We need not provide and defend formal definitions, a definition is defined through usage. If the probability of a definition causing controversy is high and defining it has low utility the importance of a formal definition is decreased. Leaving things in ambiguity or with multiple degrees of interpretation limits reprisals.

If you don't have anything nice to say don't allow it to take shape, to become definitive. This is besides the point that communication can still transmit useful information.

The fact that there is no definition is the definition and is evidence for the definition. You can define things, but in the experiential sense what can you do with information that is wrong to steel-man it, to give it utility and make it useful.

If the benefit of a definition providing epistemic accuracy is lower than the instrumental utility of not defining, why define it?

Ultimately if we are to become rational the worst way to brainstorm is to have an anchoring effect around a definition of rationality that also causes controversy. As in the Stability–instability paradox, not naming something creates more names not of the thing in actuality but ideas around it. We are the Blind men yet but touching the elephant that is rationality.

comment by Bruno Vieira (bruno-vieira) · 2024-04-18T23:44:02.331Z · LW(p) · GW(p)

you should substitute more specific language in place of “rational”: “The self-benefiting thing to do is to run away, but I hope I would at least try to drag the child off the railroad tracks,”

Wouldn't it be correct to say that it would be 'instrumentally rational' to run away in this case? It sounds rational to me, as far as you 'winning' means you 'surviving'.

Replies from: AliceZ

↑ comment by ZY (AliceZ) · 2024-10-12T17:07:03.055Z · LW(p) · GW(p)

I think by winning, he meant: "art of choosing actions that lead to outcomes ranked higher in your preferences", though I don't completely agree with this word choice of "winning" which could be ambiguous/causing confusion.

A bit unrelated, but more of a general comment on this - in my belief, I think people generally have unconscious preferences, and knowing/acknowledging these before weighing out preferences are very important, even if some preferences are short term.

comment by lafocade · 2023-08-04T09:18:55.347Z · LW(p) · GW(p)

Is the last sentence rational?

The one that says "If you speak overmuch of the Way, you will not attain it."

Replies from: eric-covert

↑ comment by Eric Covert (eric-covert) · 2023-09-18T03:05:31.990Z · LW(p) · GW(p)

This is a reference to Taoism (the tao = the Way). I believe it is a different approach to the tenet I've heard expressed as "The Tao that can be explained is not the true Tao". I believe the reference is meant to remind us that the point here is to end up performing less wrong rational thinking, not just talking about it.

comment by [deleted] · 2020-07-12T17:14:33.083Z · LW(p) · GW(p)

great post, just wanted to point out a typo here: "I cant quote the equations of General Relativity from memory, but nonetheless if I walk off a cliff, Ill fall. "

it should be "I'll fall". good work otherwise.

Replies from: habryka4

↑ comment by habryka (habryka4) · 2020-07-12T19:17:13.791Z · LW(p) · GW(p)

(Fixed, thank you!)

comment by ToddStoddard · 2019-02-19T18:08:27.940Z · LW(p) · GW(p)

Nice discussion. Thanks for putting this together. I learned something about Epistemic rationality vs Instrumental rationality.

The bit about the sky being blue or green seems to beg the question of a justification for objective truth as championed by St Augustine and Leibniz as opposed to arguments for subjective reality as championed by the Cynics and Skeptics and, more recently, the Frankfort school. One could make the case that the sky appears green to one but blue to another.

This topic comes up in many places throughout the history of thought. I'm actually working on a post for my blog exploring that at www.SimplyUrban.Org.

What Do We Mean By "Rationality"?

Contents

19 comments