Total compute available to evolution

post by redbird · 2022-01-09T03:25:54.149Z · LW · GW · 12 comments

This is a question post.

Contents

  Answers
    4 tailcalled
None
12 comments

I’m looking for estimates of the total compute available to evolution:

Total number of cell divisions

Total number of times a DNA nucleotide was transcribed to RNA

Total number of times an RNA codon was translated to an amino acid

Total number of base pairs in all genomes

Total entropy of the set of all genomes

I’d like to identify other significant sources of compute available to evolution (epigenome? lateral gene transfer? interspecies communication?) and how to quantify them.

I’m looking for estimates of total compute over all 4 billion years of Earth life, and also how compute varied with time.

Grateful for any leads!

Answers

answer by tailcalled · 2022-01-10T11:24:41.386Z · LW(p) · GW(p)

I always feel like this question is missing something, because it forgets the biggest cost for evolution: deploying the algorithms in the real world and seeing what effects they have.

That is, even if you had all the compute necessary to simulate human brains for all of evolutionary time, you would also need to figure out the effects of the brains on the real world in order to know their usefulness in practice. Doing this directly requires compute on the order of magnitude of the real world, though obviously in practice a lot of this can be optimized away.

comment by redbird · 2022-01-10T13:12:15.009Z · LW(p) · GW(p)

Interesting, my first reaction was that evolution doesn't need to "figure out" the extended phenotype (= "effects on the real world") It just blindly deploys its algorithms, and natural selection does the optimization.

But I think what you're saying is, the real world is "computing" which individuals die and which ones reproduce, and we need a way to quantify that computational work. You're right!

Replies from: tailcalled, tailcalled
comment by tailcalled · 2022-01-11T09:19:48.422Z · LW(p) · GW(p)

I should add: I think it is the hardest-to-compute aspects of this that are the most important to the evolution of general intelligence. With a "reasonable" compute budget, you could set up a gauntlet of small tasks that challenge your skills in various ways. However, this could probably be Goodharted relatively "easily". But the real-world isn't some closed short-term system; it also tests you against effects that take years or even decades to become relevant, just as hard as it tests you for effects that immediately become relevant. And that is something I think we will have a hard time with optimizing our AIs against.

Replies from: redbird
comment by redbird · 2022-01-11T14:09:02.396Z · LW(p) · GW(p)

You're saying AI will be much better than us at long-term planning?

It's hard to train for tasks where the reward is only known after a long time (e.g. how would you train for climate prediction?)

Replies from: tailcalled
comment by tailcalled · 2022-01-11T14:30:53.966Z · LW(p) · GW(p)

No, I'm saying that AI might be much worse than us at long-term planning, because evolution has selected us on the basis of very long chains of causal effects, whereas we can't train AIs on the basis of such long chains of causal effects.

comment by tailcalled · 2022-01-10T14:49:49.154Z · LW(p) · GW(p)

But I think what you're saying is, the real world is "computing" which individuals die and which ones reproduce, and we need a way to quantify that computational work.

Yep.

12 comments

Comments sorted by top scores.

comment by jessicata (jessica.liu.taylor) · 2022-01-09T22:07:23.092Z · LW(p) · GW(p)

Here's a model made by Median Group estimating total brain compute in evolutionary history. Sorry it's not better documented, this was put together quickly on-the-fly.

Replies from: redbird, redbird
comment by redbird · 2022-01-10T02:23:58.573Z · LW(p) · GW(p)

What are FLOPz and FLOPs ?

What sources did you draw from to estimate the distributions?

Replies from: jessica.liu.taylor
comment by jessicata (jessica.liu.taylor) · 2022-01-10T07:57:34.026Z · LW(p) · GW(p)

FLOPs = floating point operations per second.

FLOPz = the same thing, I think (it's used as if it's in the FLOPs unit).

I don't remember the sources for everything. If you want to get a more accurate estimate I recommend re-running with your own numbers.

Here are some estimates of brain compute.

Here's an estimate of the mass of a human brain.

Here's an estimate of current animal biomass.

Here are brain to body mass ratios for different species. Here's an estimate of the composition of animal biomass which should help figure out which brain-to-body-mass numbers to use.

Here's a Quora question about changes in Earth biomass over time.

(I think if you spent some time on these estimates they'd turn out different from the numbers in the model; we did this mostly to check rough order of magnitude over the course of a couple hours, finding that evolution will not be simulable with foreseeable compute.)

Replies from: redbird
comment by redbird · 2022-01-10T13:49:02.183Z · LW(p) · GW(p)

Great links, thank you!!

So your focus was specifically on the compute performed by animal brains.

I expect total brain compute is dwarfed by the computation inside cells (transcription & translation). Which in turn is dwarfed by the computation done by non-organic matter [LW(p) · GW(p)] to implement natural selection. I had totally overlooked this last part!

Replies from: jessica.liu.taylor
comment by jessicata (jessica.liu.taylor) · 2022-01-11T04:40:49.477Z · LW(p) · GW(p)

Non-brain matter is most of the compute for a naive physics simulation, however it's plausible that it could be sped up a lot, e.g. the interiors of rocks are pretty static and similar to each other so maybe they can share a lot of computation. For brains it would be harder to speed up the simulation without changing the result a lot.

comment by redbird · 2022-01-10T00:48:52.840Z · LW(p) · GW(p)

Awesome!!! Exactly the kind of thing I was looking for

comment by malin · 2022-01-11T11:19:15.654Z · LW(p) · GW(p)

A common misconception is to envision genomic evolution as a matter of single base substitutions (i.e. an adenine becoming a guanine, etc).

Truth is, most of real genome evolution leaps are created by structural variations, i.e. from whole genome/chromosome duplications to microdeletions/microduplications fostered by the existence of repeated sequences interspersed in the genome (mostly due to [retro-]transposons), which act as hotspots for copy/paste/delete events.

And these repeated sequences aren't equal among time and species. 

comment by Derek M. Jones (Derek-Jones) · 2022-01-09T17:14:09.180Z · LW(p) · GW(p)

This is a poorly thought out question.

 

Evolution implies a direction of travel driven by selection pressure, e.g., comparative fitness within an environment.

 

A sequence of random processes that are not driven by some selection pressure is just, well, random.

 

What is the metric for computational effort?

 

Are you actually interested in computational resources consumed, or percentage of possibilities explored?

Replies from: gwern, redbird
comment by gwern · 2022-01-09T17:34:46.587Z · LW(p) · GW(p)

The main reason for past discussions of this question has been to upperbound the amount of compute necessary to create AGI: "if evolution could create humans with X yottaflops total, then we can certainly create an AGI with <=X yottaflops - if only by literally simulating molecule by molecule the evolution of humanity". Basically, the worst possible biological anchor estimate. (Personally, I think it's so vacuous an upper bound as to not have been worth the energy which has already been put into thinking about it.)

Replies from: redbird
comment by redbird · 2022-01-09T18:41:42.448Z · LW(p) · GW(p)

AGI timeline is not my motivation, but the links look helpful, thanks!

comment by redbird · 2022-01-09T18:49:36.648Z · LW(p) · GW(p)

Hmm how would you define "percentage of possibilities explored"? 

I suggested several metrics, but I am actively looking for additional ones, especially for the epigenome and for communication at the individual level (e.g. chemical signals between fungi and plants, animal calls, human language).

Replies from: Derek-Jones
comment by Derek M. Jones (Derek-Jones) · 2022-01-09T20:09:15.866Z · LW(p) · GW(p)

Chemical space, https://en.wikipedia.org/wiki/Chemical_space, is one candidate for a metric of the possibilities.

 

The book "Chemical Evolution: Origins of the Elements, Molecules and Living Systems" by  Stephen F. Mason might well contain the kinds of calculations you are looking for.