Do humans really learn from "little" data?

post by Alice Wanderland (alice-wanderland) · 2025-01-14T10:46:09.179Z · LW · GW · 5 comments

This is a link post for https://aliceandbobinwanderland.substack.com/p/do-humans-really-learn-with-little

Contents

  How much data does it take to pretrain a (human) brain? I conducted a (fairer) Fermi estimate.
None
5 comments

How much data does it take to pretrain a (human) brain? I conducted a (fairer) Fermi estimate.

The post goes through the following questions:

To get to this conclusion table:

5 comments

Comments sorted by top scores.

comment by FL33TW00D · 2025-01-14T11:00:27.616Z · LW(p) · GW(p)

I think you're missing a trick here purely focusing on waking seconds.

Demis explains this beautifully in this talk here:

TLDW:

The brain replays memories stochastically during sleep, and replays them OOMs faster than they were experienced. This "multi epoch training" allows the brain to learn much more from the environment, and it can prioritise salient experiences.

Replies from: alice-wanderland, PeterMcCluskey
comment by Alice Wanderland (alice-wanderland) · 2025-01-18T07:08:20.719Z · LW(p) · GW(p)

Perhaps? I'm not fully understanding your point, could you explain a bit more what I'm missing - how does accounting for sleep and memory replay add to the point of comparing the pretraining dataset sizes between human brains and LLMs? At first glance, my understanding of your point is that adding in sleep seconds would increase the training set size for humans by a third or more. I wanted to make my estimate conservative so I didn't add in sleep seconds, but I'm sure there's a case for an adjustment adding it in.

Replies from: nathan-helm-burger
comment by Nathan Helm-Burger (nathan-helm-burger) · 2025-01-18T07:27:22.325Z · LW(p) · GW(p)

Yeah, I don't think it makes sense to add sleep if you are estimating "data points", since it's rehearsing remixes of the data from awake times.

On the other hand, if you are estimating "training steps", then it does make sense to count sleep. Just as you'd count additional passes over the same data.

comment by PeterMcCluskey · 2025-01-14T22:34:31.074Z · LW(p) · GW(p)

"OOMs faster "? Where do you get that idea?

Dreams indicate a need for more processing than what happens when we're awake, but likely less than 2x waking time.

Replies from: FL33TW00D
comment by FL33TW00D · 2025-01-15T08:29:10.878Z · LW(p) · GW(p)

The linked video says so at 30:45