Posts
Comments
DreamerV3 is not a great example, as they use so many hacks to make the task easier that it barely counts as getting a diamond or Minecraft anymore. Action shaping, macro actions, instant block breaking, fake "bug fixing", all to get a diamond in 0.4% of episodes.
More info here: https://x.com/Karolis_Ram/status/1785750372394348632
Would "delta hedging" be useful here? It helps hedge long option exposure by shorting some amount of a stock.
For example, at the money calls generally have a delta of 0.5, so holding 100 at the money calls and shorting 50 shares makes you roughly neutral for small moves in the underlying asset.
Would probably require monthly rebalancing based on how many options you effectively hold and market moves. It also wouldn't work well if AGI happens at GDM and Google stock goes exponential ("volatility smile" problem).
non pdf arxiv link: https://arxiv.org/abs/2305.15324
For the four examples of
24-16=12, 53-25=25, 34-16=13, 63-17=16
is this the pattern?
ab-cd=ca