kqr

Posts
Comments

Posts

When Is Insurance Worth It? 2024-12-19T19:07:32.573Z

The lying p value 2024-11-12T06:12:59.934Z

Arithmetic Models: Better Than You Think 2024-10-26T09:42:07.185Z

Intention-to-Treat (Re: How harmful is music, really?) 2024-09-18T18:44:41.128Z

Comments

Comment by kqr on When Is Insurance Worth It? · 2024-12-30T09:21:52.574Z · LW · GW

Sure! https://git.sr.ht/~kqr/insurance-calculator

Comment by kqr on When Is Insurance Worth It? · 2024-12-30T09:07:26.224Z · LW · GW

Depends significantly on where you live! I don't worry about hurricanes, floods, earthquakes, etc.

Among the things that remain are fire, and my government says the fire services get called to 6000 domestic fires every year. Divided by a population of, say, 5 million households that's a risk of 0.12 % per year. Maybe not all fires get fire services involvement, so we'll bump it up to 0.2 %.

You won't find actuarial tables, but they can often be constructed from official sources and/or press releases with some ingenuity. We'd do this for other risks too, like burglary, water damage, etc.

Of course, we could also gut feel our way there. Maybe we consider the past 20 years, and that we'd be told if any one in a circle of 5 friends would tell us about a serious event in their household, and we have been told twice in that time. That's twice in 100 person-years, i.e. a 1/50 all-cause risk.

Comment by kqr on When Is Insurance Worth It? · 2024-12-24T22:23:22.014Z · LW · GW

I agree -- sorry about the sloppy wording.

What I tried to say wad that "if you act like someone who maximises compounding money you also act like someone with utility that is log-money."

Comment by kqr on When Is Insurance Worth It? · 2024-12-24T17:14:40.066Z · LW · GW

Your formula is only valid if utility = log($).

This is a synonym for "if money compounds and you want more of it at lower risk". So in a sense, yes, but it seems confusing to phrase it in terms of utility as if the choice was arbitrary and not determined by other constraints.

Comment by kqr on When Is Insurance Worth It? · 2024-12-24T16:50:27.614Z · LW · GW

The insurance company does not have logarithmic discounting on wealth, it will not be using Kelly to allocate bets. From the perspective of the company, it is purely dependent on the direct profitability of the bet - premium minus expected payout and overheads.

Not true. Risk management is a huge part of many types of insurance, and that is about finding the appropriate exposure to a risk -- and this exposure is found through the Kelly criterion.

This matters less in some types of insurance (e.g. life, which has stable long-term rates and rare catastrophic events) but significantly in other types (liability, natural disaster-linked.)

This is only about maximising profit for a given level of risk, it has nothing to do with specific shapes of utility functions.

Comment by kqr on When Is Insurance Worth It? · 2024-12-24T06:20:42.187Z · LW · GW

Fundamentally we are taking the probability-weighted expectation of log-wealth under all possible outcomes from a single set of actions, and comparing this to all other sets of actions.

The way to work in uncompensated claims is to add another term for that outcome, with the probability that the claim is unpaid and the log of wealth corresponding to both paying that cost out of pocket and fighting the insurance company about it.

Comment by kqr on When Is Insurance Worth It? · 2024-12-23T07:43:48.721Z · LW · GW

It is under no such assumption! If you have sufficient wealth you will leave something even if you die early, by virtue of already having the wealth.

If it's easier, think of it as the child guarding the parent's money and deciding whether to place a hedging bet on their parent's death or not -- using said parent's money. Using the same Kelly formula we'll find there is some parental wealth at which it pays more to let it compound instead of using it to pay for premia.

Comment by kqr on When Is Insurance Worth It? · 2024-12-21T21:23:05.998Z · LW · GW

Even so, at some level of wealth you'll leave more behind by saving up the premium and having your children inherit the compound interest instead. That point is found through the Kelly criterion.

(The Kelly criterion is indeed equal to concave utility, but the insurance company is so wealthy that individual life insurance payouts sit on the nearly linear early part of the utility curve, whereas for most individuals it does not.)

Comment by kqr on When Is Insurance Worth It? · 2024-12-20T17:47:16.977Z · LW · GW

I just wouldn't use the word "Kelly", I'd talk about "maximizing expected log money".

Ah, sure. Dear child has many names. Another common name for it is "the E log X strategy" but that tends to not be as recogniseable to people.

you say "this is how to mathematically determine if you should buy insurance".

Ah, I see your point. That is true. I'd argue this isolated E log X approach is still better than vibes, but I'll think about ways to rephrase to not make such a strong claim.

Comment by kqr on When Is Insurance Worth It? · 2024-12-20T16:19:35.927Z · LW · GW

what do you mean when you say this is what Kelly instructs?

Kelly allocations only require taking actions that maximise the expectation of the joint distribution of log-wealth. It doesn't matter how many bets are used to construct that joint distribution, nor when during the period they were entered.

If you don't know at the start of the period which bets you will enter during the period, you have to make a forecast, as with anything unknown about the future. But this is not a problem within the Kelly optimisation, which assumes the joint distribution of outcomes already exists.

This is also how correlated risk is worked into a Kelly-based decision.

Simultaneous (correlated or independent) bets are only a problem in so far as we fail to construct a joint distribution of outcomes for those simultaneous bets. Which, yeah, sure, dimensionality makes itself known, but there's no fundamental problem there that isn't solved the same way as in the unidimensional case.

Edit: In more laymanny terms, Kelly requires that, for each potential combination of simultaneous bets you are going to enter during the period, you estimate the probability distribution of wealth outcomes (and this probability distribution should account for any correlations) after the period has passed. Given that, Kelly tells you to choose the set of bets (and sizes in each) that maximise the expected log of wealth outcomes.

Kelly is a function of actions and their associated probability distributions of outcomes. The actions can be complex compound actions such as entering simultaneous bets -- Kelly does not care, as long as it gets its outcome probability distribution for each action.

Comment by kqr on When Is Insurance Worth It? · 2024-12-20T14:42:44.889Z · LW · GW

I'm confused by the calculator.

The probability should be given as 0.03 -- that might reduce your confusion!

Kelly is derived under a framework that assumes bets are offered one at a time.

If I understand your point correctly, I disagree. Kelly instructs us to choose the course of action that maximises log-wealth in period t+1 assuming a particular joint distribution of outcomes. This course of action can by all means be a complicated portfolio of simultaneous bets.

Of course, the insurance calculator does not offer you the interface to enter a periodful of simultaneous bets! That takes a dedicated tool. The calculator can only tell you the ROI of insurance; it does not compare this ROI to alternative, more complex portfolios which may well outperform the insurance alone.

If you get caught in a flood your whole neighborhood probably does too

This is where reinsurance and other non-traditional instruments of risk trading enter the picture. Your insurance company can offer flood insurance because they insure their portfolio with reinsurers, or hedge with catastrophy bonds, etc.

The net effect of the current practices of the industry is that fire insurance becomes slightly more expensive to pay for flood insurance.

I have a hobby horse that I think people misunderstand the justifications for Kelly, and my sense is that you do too

I don't think I disagree strongly with much of what you say in that article, although I admit I haven't read it that thoroughly. It seems like you're making three points:

Kelly is not dependent on log utility -- we agree.
Simultaneous, independent bets lower the risk and applying the Kelly criterion properly to that situation results in greater allocations than the common, naive application -- we agree.
If one donates one's winnings then one's bets no longer compound and the expected profit is a better guide then expected log wealth -- we agree.

Comment by kqr on The Online Sports Gambling Experiment Has Failed · 2024-11-12T14:03:39.427Z · LW · GW

In A World of Chance, Brenner, Brenner, and Brown look at this same question from a historic perspective, and (IIRC) conclude that gambling is about as damaging as alcohol, both for individuals and society. In other words, it should be legal (it gives the majority a relatively safe good time) but somewhat controlled (some cannot handle it and then it is very bad).

Do these more recent numbers corroborate that comparison to alcohol?

Comment by kqr on Arithmetic Models: Better Than You Think · 2024-10-28T05:31:18.379Z · LW · GW

Oh, these are good objections. Thanks!

I'm inclined to 180 on the original statements there and instead argue that predictive modelling works because, as Pearl says, "no correlation without causation". Then an important step when basing decisions on predictive modelling is verifying that the intervention has not cut off the causal path we depended on for decision-making.

Do you think that would be closer to the truth?

Comment by kqr on The Summoned Heroine's Prediction Markets Keep Providing Financial Services To The Demon King! · 2024-10-26T19:04:24.160Z · LW · GW

The Demon King donned a mortal guise, bought shares in “The Demon King will attack the Frozen Fortress”, and then attacked the Frozen Fortress.

I'm curious: didn't the market work exactly as intended here? I mean, it helped them anticipate the Demon King’s next moves – it's not the market's fault that they couldn't convert foresight into operational superiority.

The King effectively sold good information on his battle plans; he voluntarily leaked military secrets against pay. The Citadel does not have to employ a spy network, because the King spies for them. This should be kind of a good deal, right?

Comment by kqr on Book Review: On the Edge: The Fundamentals · 2024-09-24T23:01:52.155Z · LW · GW

However I also do frequently spend more time on close decisions. I think this can be good praxis. It is wasteful in the moment, but going into detail on close decisions is a great way to learn how to make better decisions. So in any decision where it would be great to improve your algorithm, if it is very close, you might want to overthink things for that reason.

In my experience, the more effective way to learn from close decisions is to just pick one alternative and then study the outcome and overthink the choice, rather than deliberate harder before choosing. This is related to what Cedric Chin describes in Action Produces Information: by going faster through close decisions, we both have more information about the consequences revealed to us, and we can run more experiments in parallel.

That said, I am very hardcore about coinflipping even not-so-close decisions, and made a tool for it.

Comment by kqr on Is "superhuman" AI forecasting BS? Some experiments on the "539" bot from the Centre for AI Safety · 2024-09-21T05:48:36.417Z · LW · GW

Thanks for taking the time to dive into this. I've spent the past few evenings iterating on a forecasting bot while doing embarrassingly little research myself[1], and it seems like I have stumbled into the same approach as Five Thirty Nine, and my bot has the exact same sort of problems. I'll write more later about why I think some of those problems are not as big as they may seem.

But your article also gave me some ideas that might lead to improvements. Thanks!

[1]: In this case, I prioritise the two weeks in the lab over the hour in the library. I'm doing it not to make a good forecasting bot but to learn the APIs involved.

Comment by kqr on How harmful is music, really? · 2024-09-19T15:32:01.300Z · LW · GW

That is, confounding could go both ways here; the effect could be greater than it appears, rather than less.

Absolutely, but if we assume the null hypothesis until proven otherwise, we will prefer to think of confounding as creating effect that is not there, rather than subduing an even stronger effect.

I'll reanalyse that way and post results, if I remember.

Yes, please do! I suspect (60 % confident maybe?) the effect will still be at least a standard error, but it would be nice to know.

I made a script run in the background on my PC, something lik

Ah, bummer! I also have this problem solved for computer time, and I was hoping you had done something for smartphone carriage.

(Note, by the way, that a uniformly random delay is not as surprising as an exponentially distributed delay. Probably does not matter for your usecase, and you might already know all of that...)

Comment by kqr on What does it mean for an event or observation to have probability 0 or 1 in Bayesian terms? · 2024-09-18T08:33:52.809Z · LW · GW

Many of the existing answers seem to confuse model and reality.

In terms of practical prediction of reality, it would be a mistake to emit a 0 or 1, always, because there's always that one-in-a-billion chance that our information is wrong – however vivid it seems at the time. Even if you have secretly looked at the hidden coin and seen clearly that it landed on heads, 99.999 % is a more accurate forecast than 100 %. It could have landed on aardvarks and masqueraded as heads, however unlikely, that is a possibility. Or you confabulated the memory of seeing the coin from a different coin you saw a week ago – also not so likely, but happens. Or you mistook tails for heads – presumably happens every now and then.

When it comes to models, though, probabilities of 0 and 1 show up all the time. Getting a 7 when tossing a d6 with the standard dice model simply does not happen, by construction. Adding two and three and getting five under regular field arithmetic happens every time. We can argue whether the language of probability is really the right tool for those types of questions, but taking a non-normative stance, it is reasonable for someone to ask those questions phrased in terms of probabilities, and then the answers would be 0 % and 100 % respectively.

These probabilities also show up in limits and arguments of general tendency. When a coin is tossed, the probability of getting only tails is 0 % as long as you keep tossing whenever you get tails. In a random walk, the probability of eventually crossing the origin is 100 %. When throwing a d6 for long enough, the mean value will end up within the range 3-4 with probability 100 %.

These latter two paragraphs describe things that apply only to our models, not to reality, but they can serve as a useful mental shortcut as long as one is careful about applying them blindly.

Comment by kqr on How harmful is music, really? · 2024-09-17T17:33:26.350Z · LW · GW

This analysis suffers from a fairly clear confounder: since you are basing the data on which days you actually listened to music, there might be a common antecedent that both improves your mood and causes you to listen to music. As a silly example, maybe you love shopping for jeans, and clothing stores tend to play music, so your mood will, on average, be better on the days you hear music for this reason alone.

An intention-to-treat approach where you make the random booleans the explainatory variable would be better, as in less biased and suffer less from confounding. It would also give you less statistical power, but such is the cost of avoiding false conclusions. You may need to run the experiment for longer to counterbalance.

It appears that listening to music, in the short-term: [...] makes earworms play in my mind for slightly less of the time

Whenever I suffer from an earworm, my solution has for a long time been to just play and listen to that song once, sometimes twice. For some reason, this satisfies my brain and it drops it. Still counter-intuitive, but you might want to try it.

On a completely separate note:

Both response variables were queried by surprise, 0 to 23 times per day (median 6), constrained by convenience.

How was this accomplished, technically? I've long wanted to do similar things but never bothered to look up a good way of doing it.

Comment by kqr on What's the Deal with Logical Uncertainty? · 2024-09-17T04:54:54.926Z · LW · GW

If Q, then anything follows. (By the Principle of Explosion, a false statement implies anything.) For example, Q implies that I will win $1 billion.

I'm not sure even this is the case.

Maybe there's a more sophisticsted version of this argument, but at this level, we only know the implication Q=>$1M is true, not that $1M is true. If Q is false, the implication being true says nothing about $1M.

But more generally, I agree there's no meaningful difference. I'm in the de Finetti school of probability in that I think it only and always expresses our personal lack of knowledge of facts.

Comment by kqr on Stockholm Sweden - ACX Meetups Everywhere Fall 2024 · 2024-09-15T17:35:37.365Z · LW · GW

Thanks everyone. I had a great time!

Comment by kqr on Contra papers claiming superhuman AI forecasting · 2024-09-14T07:57:29.628Z · LW · GW

The AI forecaster is able to consistently outperform the crowd forecast on a sufficiently large number of randomly selected questions on a high-quality forecasting platform

Seeing how the crowd forecast routinely performs at a superhuman level itself, isn't it an unfairly high bar to clear? Not invalidating the rest of your arguments – the methodological problems you point out are really bad – but before asking the question about superhuman performance it makes a lot of sense to fully agree on what superhuman performance really is.

(I also note that a high-quality forecasting platform suffers from self-selection by unusually enthusiastic forecasters, bringing up the bar further. However, I don't believe this to be an actual problem because if someone is claiming "performance on par with humans" I would expect that to mean "enthusiastic humans".)

User info

Posts

Comments