Posts
Ambiguous out-of-distribution generalization on an algorithmic task
2025-02-13T18:24:36.160Z
The slingshot helps with learning
2024-10-31T23:18:16.762Z
Comments
Comment by
Wilson Wu (wilson-wu) on
Alexander Gietelink Oldenziel's Shortform ·
2025-03-03T06:11:12.690Z ·
LW ·
GW
"Utter elitism" is a nice article about this phenomenon
Comment by
Wilson Wu (wilson-wu) on
Ambiguous out-of-distribution generalization on an algorithmic task ·
2025-02-16T17:32:48.398Z ·
LW ·
GW
For that earlier section, we used smaller models trained on intersect (4,000 parameters) instead of intersect (80,000 parameters) -- the only reason for this was to allow for a larger sample size of 10,000 models with our compute budget. All subsequent sections use the models.