Some real examples of gradient hacking

post by Oliver Sourbut · 2021-11-22T00:11:35.047Z · LW · GW · 4 comments


  In biology
    Sexual selection
  In society

Speculation and an invitation to make further suggestions or clarifications

I've been wondering what might count as an analogous example of gradient hacking [LW · GW] if any has ever occurred, and if it could be useful to collect such analogies (while mindful not to overfit our expectations). Presumably so far the only examples will be for genetic natural selection (perhaps particularly in humans), and perhaps cultural/memetic selection (leaving aside the technicality of whether natural selection has a gradient per se).

I'm looking for deliberate behaviours which are intended to affect the fitness landscape of the outer process.

In biology

Sexual selection

I think examples of sexual selection are adjacent but don't qualify as gradient hacking. While they constitute an execution of a policy [LW · GW] whose main legible effect is indeed on natural selection, presumably in nearly all cases there is not a deliberate process of influence on the natural selection process as such. The policies themselves are encoded by genetic natural selection for the most part, so I'd say this qualifies more as one kind of divergent training trajectory of the outer training process. There remains a plausible analogy that at least some animals and humans engaging in sexual selection are doing so 'because' they are inner misaligned [? · GW] (they are agents with preferences which are mere proxies for the 'true goals' of genetic natural selection and sexual selection is just one such proxy preference).

Examples of sexual selection where the preferences are socially learned seem closer, and there are examples in animals and humans. Even in this case, even though the policy is adapted 'at runtime' (the genetic policy might be something like 'choose a mate which my society codes as attractive', which incidentally presumably encourages other genetic policies like 'try to conform to my society's attractiveness code' and maybe even 'try to nudge my society's attractiveness code towards me and my kin') it is still not carried out with the runtime deliberate intention of affecting the outer process of natural selection (even though it is a deliberate action which does in fact have this side effect, the intended goal is the proxy).


Perhaps the first and only qualifying attempts at actual gradient hacking are efforts towards eugenics practiced by humans, up to and including some instances of (attempted or successful) genocide. Probably since prehistory and certainly since antiquity we've had some 'mesa'/'runtime' understanding of heritability, in contrast to (presumably) all other animals. Most such behaviours are deliberative and involve explicit modelling of the process of heritability, with the stated intention (at least nominally) being to affect the trajectory of the natural selection process (stated in antiquity in very crude but still essentially-correct terms).

What if (I think this is not very credible but at least plausible) a sufficient cause of all eugenics attempts is in fact a genetically-naturally-selected policy schema of 'come up with whatever excuses you can to favour (the success/reproduction/existence of) your kin'? (And the rest of the fluff is just instrumental persuasion and implementation attempts.) Would eugenics be downgraded all the way from being gradient hacking to 'mere' proxy alignment?

In society

This gets more speculative.


What if we adapt the previous hypothesis about genetic eugenics to... eumemics? OK wow, I thought I coined that but Wikipedia even has a tiny bit on this. It is harder to locate the object of memetic selection, and I'm not sure if it's right to identify it with individual organisms the way we can often sloppily get away with doing for biological genetic natural selection. Maybe it is hard to call humans 'inner misaligned with respect to memetic natural selection' with a straight face. But if so, do 'eumemic' attempts qualify as instances of gradient hacking?

If a meme or meme-complex encodes a behaviour of deliberately affecting the meme fitness landscape in ways unrelated to the particular meme(plex), does that qualify as gradient hacking? If so, could boycotts, cancellation, some reading of basically every ethical theory, and many other ideologies besides qualify?

Same question as above - what if such hackery is in fact just the side-effect of memetic natural selection acting on memeplexes which tend to encode behaviours promoting themselves (with a side-effect of also affecting the meme fitness lanscape in other ways)? If so, is this simply proxy alignment rather than gradient hacking?


Comments sorted by top scores.

comment by Oliver Sourbut · 2021-11-23T09:52:02.081Z · LW(p) · GW(p)

Affecting 'someone else's gradient'

A case which didn't make the shortlist, but perhaps domestication counts?

It's a deliberate attempt at affecting the (best understanding of the) outer adaptation process. But in the case of domestication, it's targeted primarily at the outer natural selection process of a different lineage. Of course the lineages interact, meaning it does affect the outer natural selection process of the self lineage, but that's not the main legible effect, nor presumably the intended one.

A more modern and 'competent' example might be the (proposed) use of artificial gene drives to perturb an existing genetic population. Again this acts on a different lineage primarily.

comment by Zac Hatfield Dodds (zac-hatfield-dodds) · 2021-11-22T02:58:45.478Z · LW(p) · GW(p)

Probably since prehistory and certainly since antiquity we've had some 'mesa'/'runtime' understanding of heritability, in contrast to (presumably) all other animals.

No, not so much. See e.g.

Like anything else, the idea of “breeding” had to be invented. That traits are genetically-influenced broadly equally by both parents subject to considerable randomness and can be selected for over many generations to create large average population-wide increases had to be discovered the hard way, with many wildly wrong theories discarded along the way. Animal breeding is a case in point, as reviewed by an intellectual history of animal breeding, Like Engend’ring Like, which covers mistaken theories of conception & inheritance from the ancient Greeks to perhaps the first truly successful modern animal breeder, Robert Bakewell (1725–1795).

Why did it take thousands of years to begin developing useful animal breeding techniques, a topic of interest to almost all farmers everywhere, a field which has no prerequisites such as advanced mathematics or special chemicals or mechanical tools, and seemingly requires only close observation and patience? ... What is most interesting is the intellectual history we can extract from it in terms of inventing heritability and as important, one of the inventions of progress in the gradual realization that selective breeding was even possible.

Replies from: Oliver Sourbut
comment by Oliver Sourbut · 2021-11-23T09:38:58.252Z · LW(p) · GW(p)

That's a very interesting link, thank you! I suppose my reply would be that I don't claim that any of these attempts are particularly competent, merely that they qualify as (incomplete) recognition of an outer adaptation process and deliberate attempts at hacking it.

comment by Slider · 2021-11-22T01:42:33.602Z · LW(p) · GW(p)

It is more fiction but in Man in the High Castle there is the character of Thomas Smith.

I think you are handling the case of pressing other groups down with massmurders but that would be "just" "might makes right". Some of the more frightening aspects would be to pinpoint that pruning inwards to do self-eugenics which would be conceptualised as a favour to your in-group. If all eugenics would be "mere" proxys then one would expect to abandon it if it suggested things strongly in the negatives for the more actual goals. Ie the moment eugenics would call your kin to hinder you would drop it. A large part of the drama in the tv series is gotten by different moral intuitions pulling in different directions and I guess repeatedly exploring how that states ideology is f up in more and more detail.

For the big plot points relevant for this (spoilers for Man In the High Castle):

Thomas Smith becomes a saint for having a terminal case of internalised ablism. Atleast for subkin promotion it did not get dropped. There can be a case made that it makes sense for promoting his siblings. The issue for conceptual analysis is whether the motivations of people involved were grounded.