What is a decision theory as a mathematical object?

post by Erik Jenner (ejenner) · 2020-05-25T13:44:54.284Z · LW · GW · 1 comment

This is a question post.

Contents

  Answers
    5 Alexander Gietelink Oldenziel
    2 Charlie Steiner
None
1 comment

If we want a space of all decision theories, what mathematical objects does it contain? For example, if a decision theory is a function, what are its domain and codomain?

The only approach I'm familiar with is to view expected utility maximizing decision theories as ways of building counterfactuals (section 5 in the FDT paper). A decision theory could then be described as a function that takes in a state and an action and spits out a distribution over world states that result from counterfactually taking action in state .

But EDT, CDT and FDT require different amounts and kinds of structure in the description of the state they take as input (pure probability distributions, causal models and logical models respectively), so this approach only works if there is some kind of structure that is sufficient for all decision theories we might come up with at some point.

Answers

answer by Alexander Gietelink Oldenziel · 2020-05-25T17:34:54.628Z · LW(p) · GW(p)

Fundamentally, finding a good mathematical definition of decision theory that encompasses all the phenomena people care about is a big open problem.

answer by Charlie Steiner · 2020-05-27T11:03:22.401Z · LW(p) · GW(p)

I think the most fundamental thing might be taking in a sequences of bits (or distribution over sequences if you think it's important to be analog) and outputting bits (or, again, distributions) that happen to control actions.

All this talk about taking causal models as an input is merely a useful abstraction of what happens when we do sequence prediction in our causal universe, and it might always be possible to find some plausible excuse to violate this abstraction.

1 comment

Comments sorted by top scores.

comment by Pattern · 2020-05-25T17:14:56.317Z · LW(p) · GW(p)

The most structure might be a lookup table*, but moving away from that requires information about how the different situations relate to each other.

*RL seems related, and a missing aspect from most.