[Aspiration-based designs] Outlook: dealing with complexity

post by Jobst Heitzig, jossoliver, thomasfinn, Simon Dima (simon-dima) · 2024-04-28T13:06:35.841Z · LW · GW · 3 comments

Contents

  Multi-dimensional aspirations
  Hierarchical decision making
None
3 comments

Summary. This teaser post sketches our current ideas for dealing with more complex environments. It will ultimately be replaced by one or more longer posts describing these in more detail. Reach out if you would like to collaborate on these issues.

Multi-dimensional aspirations

For real-world tasks that are specified in terms of more than a single evaluation metric, e.g., how much apples to buy and how much money to spend at most, we can generalize Algorithm 2 as follows from aspiration intervals to convex aspiration sets:

(We also consider other variants of this general idea) 

Fig. 1: Admissibility simplices, and construction of action-aspirations by shifting towards corners and intersecting with action admissibility simplices (see text for details).
Fig. 2: An action admissibility simplex  is the convex combination of the successor states' admissibility simplices , mixed in proportion to the respective transition probabilities . An action aspiration  can be rescaled to a successor state aspiration  by first mapping the corners of the action admissibility sets onto each other (dashed lines) and extending this map linearly.

Hierarchical decision making

A common way of planning complex tasks is to decompose them into a hierarchy of two or more levels of subtasks. Similar to existing approaches from hierarchical reinforcement learning, we imagine that an AI system can make such hierarchical decisions as depicted in the following diagram (shown for only two hierarchical levels, but obviously generalizable to more levels):

Fig. 3: Hierarchical world model in the case of two hierarchical levels of decision making.

3 comments

Comments sorted by top scores.

comment by Roman Malov · 2024-05-02T20:33:22.700Z · LW(p) · GW(p)

Pick  many linearly independent linear combinations  
isn't there at most  linearly independent linear combinations of ?

Replies from: Roman Malov
comment by Roman Malov · 2024-05-02T20:37:34.741Z · LW(p) · GW(p)

maybe you meant pairwise linearly independent (by looking at the graph)

Replies from: Jobst Heitzig
comment by Jobst Heitzig · 2024-05-02T20:49:17.902Z · LW(p) · GW(p)

You are of course perfectly right. What I meant was: so that their convex hull is full-dimensional and contains the origin. I fixed it. Thanks for spotting this!