Underspecified Probabilities: A Thought Experiment

lunatic_at_large

Underspecified Probabilities: A Thought Experiment

post by lunatic_at_large · 2023-10-04T22:25:07.458Z · LW · GW · 4 comments

4 comments

(Inspired by https://stats.stackexchange.com/questions/175153/approximating-pa-b-c-using-pa-b-pa-c-pb-c-and-pa-pb-pc)

Imagine you're working at an electronics manufacturing company. There's a type of component your company needs for an upcoming production process. Your company can either produce these components in-house or it can source them from a vendor. Your job is to determine whether or not to produce the component in-house. Your team has been budgeted $2,000,000 for procuring as many working components as possible. You won't know whether a part works until the downstream production process is up and running later in the year, but you need to make a decision on part procurement now. Let's say your company agrees to pay you a bonus next year proportional to how many parts turned out to be functional and your personal utility function is linear in this bonus and doesn't depend on anything else.

Using the in-house machine, your $2,000,000 budget will get you 1,000,000 parts which might or might not work. A part will work if and only if it has dimples on the bismuth layer (event ), it has an osmium layer between 40 and 60 microns thick (event $B$ ), and the misalignment between the polonium and tungsten layers is less than 20 microns (event $C$ ). Thus the expected number of working parts is $1, 000, 000 P (A \cap B \cap C)$ . The only way to determine whether a part satisfies properties $A$ , $B$ , and $C$ is by doing destructive testing: you have to physically cut it up. Because of the way this cutting works, you can't measure $C$ if you've already measured $A$ and $B$ on a part, you can't measure $B$ if you've already measured $A$ and $C$ on a part, and you can't measure $A$ if you've already measured $B$ and $C$ on a part. The only way to get data on $A$ , $B$ , and $C$ together would be to send a part downstream into production but the production process doesn't exist yet.

Through destructive testing, you've been able to determine that $P (A) = 0.45$ , $P (B) = 0.6$ , $P (C) = 0.35$ , $P (A \cap B) = 0.3$ , $P (A \cap C) = 0.15$ , and $P (B \cap C) = 0.15$ (you've tested such a truly unfathomable number of parts that you've gotten the uncertainty on these numbers down arbitrarily small).^[1] Otherwise, the machine you're using to create these parts is a black box. The company that sold you the machine has since gone defunct and you can't find anyone else with prior experience working with this machine.

On the other hand, for $2,000,000 the external vendor is willing to give you $⌊ 1, 000, 000 x ⌋$ parts for some specified $x \in [0, 1]$ . These parts have been used in many other production processes and are known to never, ever fail.

If $x = 0$ , you should probably go with the in-house approach. If $x = 1$ , you should probably source externally. What is the value of $x$ below which you stay in-house but above which you go with the vendor?^[2]

I'm working on a project where I'm going to need to estimate probabilities of combinations of events like this with incomplete information. I asked some friends about my problem and they said it was ill-posed in the abstract, so I wanted to create a real-world thought experiment where you can't wave the problem away with "There is no proper answer." If you were in this position, you'd have to actually make a choice.

This problem reminds me of the discussion around the presumption of independence. I think that a good philosophical justification for assuming independence in the absence of evidence to the contrary should be able to be generalized to providing a method for finding the "most independent" assignment of probabilities in settings where we know that actual independence is out of the question.

^{^}
No pair of these events is independent. To see that there exist two (and by taking convex linear combinations, infinitely many) possible values of $P (A \cap B \cap C)$ from this provided information, note that there are two valid probability mass functions, $P_{1}$ and $P_{2}$ , on truth assignments to $A$ , $B$ , $C$ which differ on their value of $P (A \cap B \cap C)$ :
$A$ $B$ $C$ $P_{1}$ $P_{2}$
$⊥$ $⊥$ $⊥$ 0.15 0.14
$⊥$ $⊥$ $⊤$ 0.1 0.11
$⊥$ $⊤$ $⊥$ 0.2 0.21
$⊥$ $⊤$ $⊤$ 0.1 0.09
$⊤$ $⊥$ $⊥$ 0.05 0.06
$⊤$ $⊥$ $⊤$ 0.1 0.09
$⊤$ $⊤$ $⊥$ 0.25 0.24
$⊤$ $⊤$ $⊤$ 0.05 0.06
^{^}
Technically there could be an interval of width $\frac{1}{1, 000, 000}$ where you're indifferent.

4 comments

Comments sorted by top scores.

comment by johnswentworth · 2023-10-05T01:02:00.139Z · LW(p) · GW(p)

You want the widget problem in chapter 14 of Jaynes' Probability Theory: The Logic Of Science; it is extremely similar to the problem you present. Long story short: express each of the known probabilities as expectations of indicator variables (e.g. ), then maximize entropy subject to the constraints given by those expectations. Jaynes covers a bunch of conceptual arguments for why that's a sensible procedure to follow.

To do better than that, the next step would be to look for any structure/pattern in the known probabilities and exploit those - e.g. if the known probabilities approximately factor over a certain Bayes net, then a natural guess is that the unknown probabilities will too, which may allow backing out of the unknown probabilities.

Replies from: lunatic_at_large

↑ comment by lunatic_at_large · 2023-10-05T23:35:50.748Z · LW(p) · GW(p)

Wow, this is exactly what I was looking for! Thank you so much!

comment by AK1089 · 2023-10-05T01:01:03.215Z · LW(p) · GW(p)

These are the probabilities of each state:

State	Probability
	$x$
$A \cap B \cap \neg C$	$0.3 - x$
$A \cap \neg B \cap C$	$0.15 - x$
$A \cap \neg B \cap \neg C$	$x$
$\neg A \cap B \cap C$	$0.15 - x$
$\neg A \cap B \cap \neg C$	$0.15 + x$
$\neg A \cap \neg B \cap C$	$0.05 + x$
$\neg A \cap \neg B \cap \neg C$	$0.2 - x$

with $x$ being the probability of all three parts of a component being fine. (Obviously, $x \leq 0.15$ , because $P (\neg A \cap B \cap C) \geq 0$ .)

This is not enough information to solve for x, of course, but note that $P (A \cap B \cap C) = P (A \cap \neg B \cap \neg C)$ . Note also that $P (A) P (B) \approx P (A \cap B)$ and $P (A) P (C) \approx P (A \cap C)$ (ie A is not strongly correlated or anti-correlated with $B$ or $C$ ). However, $P (B) P (C) > P (B \cap C)$ by quite a long way: $B$ is fairly strongly anti-correlated with $C$ .

Now here's the estimation bit, I suppose: given that $A$ holds, we'd probably expect a similar distribution of probabilities across values of $B$ and $C$ , given that $A$ is not (strongly) correlated with $B$ or $C$ . So $P (B \cap C | A) \approx P (B \cap C)$ etc. This resolves to $x \approx P (A) \cdot P (B \cap C) = 0.45 \cdot 0.15 = 0.0675$ .

State	Probability
$A \cap B \cap C$	$0.0675$
$A \cap B \cap \neg C$	$0.2325$
$A \cap \neg B \cap C$	$0.0825$
$A \cap \neg B \cap \neg C$	$0.0675$
$\neg A \cap B \cap C$	$0.0825$
$\neg A \cap B \cap \neg C$	$0.2175$
$\neg A \cap \neg B \cap C$	$0.1175$
$\neg A \cap \neg B \cap \neg C$	$0.1325$

This seems... not super unreasonable? At least, it appears slightly better than going for the most basic method, which is $P (A \cap B \cap C) \leq 0.15$ , so split the difference and say it's $0.075$ or thereabouts.

The key assumption here is that "if $A$ is pretty much uncorrelated with $B$ and $C$ , it's probably uncorrelated with the conjunction $B \cap C$ . This is not strictly-always true as a matter of probability theory, but we're making assumptions on incomplete information based on a real-world scenario, so I'd say this skewing our guess by a factor of 10% from the most naive approach is probably helpful-on-net.

This means in expectation, we guess the in-house machine to produce $0.0675 \cdot 1000000 = 67500$ good widgets. I'd take that many from the Super Reliable Vendor if offered, but if they were offering less than that I'd roll the dice with the Worryingly Inconsistent In-House Machine. That is, I'm indifferent at $x = 0.0675$ .

comment by tbadalov · 2023-10-05T04:23:56.879Z · LW(p) · GW(p)

If your main intent is just to make the right decision for the company, then my answer is buy those parts unless your main goal is doing math. You don't need statistics to elaborate that, common sense is enough.

Since the manufacturers already produces more on top of those components, you already have enough to risk and test after manufacturing. Once your product works, go for optimisations and consider producing parts in-house.

$A$	$B$	$C$	$P_{1}$	$P_{2}$
$⊥$	$⊥$	$⊥$	0.15	0.14
$⊥$	$⊥$	$⊤$	0.1	0.11
$⊥$	$⊤$	$⊥$	0.2	0.21
$⊥$	$⊤$	$⊤$	0.1	0.09
$⊤$	$⊥$	$⊥$	0.05	0.06
$⊤$	$⊥$	$⊤$	0.1	0.09
$⊤$	$⊤$	$⊥$	0.25	0.24
$⊤$	$⊤$	$⊤$	0.05	0.06

Underspecified Probabilities: A Thought Experiment

Contents

4 comments