What long term good futures are possible. (Other than FAI)?
post by Donald Hobson (donald-hobson) · 2020-01-12T18:04:52.803Z · LW · GW · 3 commentsThis is a question post.
Contents
Answers 3 Wei_Dai 3 Mark_Friedenbach -8 maximkazhenkov None 3 comments
Does anyone know any potential long term futures that are good, and do not involve the creation of a friendly super-intelligence.
To be clear, long term future means billion years+. In most of my world models, we settle into a state from which it is much easier to predict the future within the next few hundred years. (Ie a state where it seems unlikely that anything much will change)
By good, I mean any future that you would prefer to be in if you cared only about yourself, or would be replaced with a robot that would do just as much good here. A weaker condition would be any future that you would want not to be erased from existence.
A superintelligent agent running around doing whatever is friendly or moral or whatever would meet these criteria, I am excluding it because I already know about that possibility. Your futures may contain Superintelligences that aren't fully friendly. A superintelligence that acts as a ZFC oracle is fine.
Your potential future doesn't have to be particularly likely, just remotely plausible. You may assume that a random 1% of humanity reads your reply and goes out of their way to make that future happen. Ie people optimizing for this goal can use strategies of the form "someone does X" but not "everyone does X". You can get "a majority of humans does X" if X is easy to do and explain and most people have no strong reason not to X.
You should make clear what stops somebody making a UFAI which goes on to destroy the world. (Eg paperclip maximizer)
What stops Moloch, what stops us trashing away everything of value in to win competitions? (Hansons Hardscrabble frontier replicators.)
Answers
We build what I called a modest superintelligence [LW · GW], consisting of one or more humans who are naturally extremely intelligent or who underwent intelligence amplification, they figure out how to build a stable world government and decide that it's safer to do WBE and gradually increase human (em) intelligence than to build an FAI.
↑ comment by Donald Hobson (donald-hobson) · 2020-01-22T00:18:17.018Z · LW(p) · GW(p)
Safely and gradually enhancing human intelligence is hard. I agree that a team of human geniuses with unlimited time and resources could probably do it. But you need orders of magnitude more resources and thinking time than the fools "trying" to make UFAI.
A genetics project makes a lot of very smart babies, they find it hard to indoctrinate them, while educating them enough, while producing diversity. Militaristic bootcamp will get them all marching in line, and squash out most curiosity and give little room for skill. Handing them off to foster parents with stem backgrounds gets a bunch of smart people with no organizing control, this is a shift in demographics, you have no hope of capturing all the value. Some will work on AI safety, intelligence enhancement or whatever, some will work in all sorts of jobs.
Whole brain emulation seems possible, I question how to get it before someone makes UFAI, but its plausible we get that. If a group of smart coordinated people end up with the first functioning mind uploading, and the first nanomachines, and are fine with duplicating themselves a lot, and there are fast enough computers to let them think really fast, then that is enough for a decisive strategic advantage. If they upload everyone else into a simulation that doesn't contain access to anything Turing complete (so no one can make UFAI within the simulation), then they could guide humanity towards a long term future without any superintelligence. They will probably figure out FAI eventually.
Yes, the most likely outcome: people develop increasingly more capable intelligence capabilities as extensions of themselves in a pluralistic society, without any one entity obtaining monopolistic control. Society continues as-is, but with posthuman capabilities.
↑ comment by Donald Hobson (donald-hobson) · 2020-01-14T09:59:43.418Z · LW(p) · GW(p)
as extensions of themselves
Lets assume that AI doubling time is fairly slow (eg 20 years) and very widely distributed. Huge numbers of people throw together AI systems in garages. If the basic problems of FAI haven't been solved, you are going to get millions of paperclip maximizers. (Well, most of them will be optimising different things) 100 years later, humanity, if it still exists at this point are pawns on a gameboard that contains many superintelligences. What happens depends on how different the superintelligences goals are, and how hard it is for superintelligences to cooperate. Either they fight, killing humanity in the crossfire, or they work together to fill the universe with a mixture of all the things they value. The latter looks like 1% paperclips, 1% staples, 1%... .
Alternately, many people could understand friendlyness and make various FAI's. The FAI's work together to make the world a nice place. In this scenario the FAI's aren't identical, but they are close enough that any one of them would make the world nice. I also agree that a world with FAI's and paperclip maximisers could be nice if the FAI's have a significant portion of total power.
Society continues as-is, but with posthuman capabilities.
Exactly like ancient Egypt except that like electromagnetic charges attract and unlike charges repel. I posit that this sentence doesn't make sense. If matter behaved that way, then atoms couldn't exist. When we say like X but with change Y, we are considering the set of all possible worlds that meet criteria Y, and finding the one nearest to X. But here there is no world where like charges attract that looks anything like ancient Egypt. We can say, like ancient Egypt but gold is 10x more abundant. That ends up as a bronze age society that makes pyramids and makes a lot more gold jewelry than the real Egyptians. I think that "society as is, but with posthuman capabilities" is the first kind of sentence. There is no way of making a change like that and getting anything resembling society as is.
Replies from: None
↑ comment by [deleted] · 2020-01-16T13:38:14.928Z · LW(p) · GW(p)
I don't understand why you keep humans and AI as ontologically separate entities in your prediction model. Increasing AI capabilities support the humans that operate them. This is the case today, and will be for the foreseeable future. Right now the human-computer interface is little mobile devices we interact with using our voice and hands, so AI assistance more resembles a personal super-secretary. But in the coming decades we will develop and deploy direct neural human-computer interfaces that will allow advances in AI to advance ourselves, directly. I don't see that pathway as resembling your "thousands of paperclip maximizers" scenario.
I really don't understand your Egypt analogy, sorry.
Replies from: donald-hobson↑ comment by Donald Hobson (donald-hobson) · 2020-01-16T21:05:52.099Z · LW(p) · GW(p)
At the moment, human brains are a cohesive whole, that optimizes for human values. We haven't yet succeeded in making the machines share our values, and the human brain is not designed for upgrading. The human brain can take knowledge from an external source and use it. External tools follow the calculator model. The human thinks about the big picture world, and realizes that as a mental subgoal of designing a bridge, they need to do some arithmetic. Instead of doing the arithmetic themselves, they pass the task on to the machine. In this circumstance, the human controls the big picture, the human understands what cognitive labor has been externalized and knows that it will help the humans goals.
If we have a system that a human can say "go and do whatever is most moral", that's FAI. If we have a calculator style system where humans specify the power output, weight, material use, radiation output ect of a fusion plant, and the AI tries to design a fusion plant meeting those specs, that's useful but not nearly as powerful as full ASI. Humans with calculator style AI could invent molecular nanotech without working out all the details, but they still need an Eric Drexler to spot the possibility.
In my model you can make a relativistic rocket, but you can't take a sparrow, and upgrade it into something that flies through space at 10% light speed and is still a sparrow. If your worried that relativistic rockets might spew dangerous levels of radiation, you can't make a safe spacecraft by taking a sparrow and upgrading it to fly at 10% c. (Well with enough R&D you could make a rocket that superficially resembles a sparrow. Deciding to upgrade a sparrow doesn't make the safety engineering any easier.)
Making something vastly smarter than a human is like making something far faster than a sparrow. Trying to strap really powerful turbojets to the sparrow and it crashes and burns. Try to attach a human brain to 100X human brain gradient decent and you get an out of control AI system with nonhuman goals. Human values are delicate. I agree that it is possible to carefully unravel what a human mind is thinking and what its goals are, and then upgrade it in a way that preserves those goals, but this requires a deep understanding of how the human mind works. Even granted mind uploading, it would still be easier to create a new mind largely from first principles. You might look at the human brain to figure out what those principles are, in the same way a plane designer looks at birds.
I see a vast space of all possible minds, some friendly, most not. Humans are a small dot in this space. We know that humans are usually friendly. We have no guarantees about what happens as you move away from humans. In fact we know that one small error can sometimes send a human totally mad. If we want to make something that we know is safe, we either need to copy that dot exactly, (ie normal biological reproduction, mind uploading) or we need something we can show to be safe for some other reason.
My point with the Ejypt metafor was that the sentence
Society continues as-is, but with posthuman capabilities.
is incoherent.
Try "the stock market continues as is, except with all life extinct"
Describing the modern world as "like a tribe of monkeys, except with post monkey capabilities" is either wrong or so vague to not tell you much.
At the point when the system (upgraded human, AI whatever you want to call it) is 99% silicon, a stray meteor hits the biological part. If the remaining 99% stays friendly, somewhere in this process you have solved FAI. I see no reason why aligning a 99% silicon being is easier that a 100% silicon being.
3 comments
Comments sorted by top scores.
comment by AprilSR · 2020-01-13T11:34:00.771Z · LW(p) · GW(p)
Given SAI is possible, regulation on AI is necessary to prevent people from making a UFAI. Alternatively, an SAI which is not fully aligned but has not goals directly conflicting with ours might be used to prevent the creation of UFAI.
Replies from: donald-hobson↑ comment by Donald Hobson (donald-hobson) · 2020-01-13T13:25:24.069Z · LW(p) · GW(p)
This seems like one potential path, but for it to work, you would need a government structure that can survive without successful pro AI revolutionaries for a billion years. You also need law enforcement good enough to stop anyone trying to make UFAI, with not a single failure in a billion years. As for a SAI that will help us stop UFAI, can explain 1) how it would help and 2) how it would be easier to build than FAI?
You also need to say what happens with evolution, given this kind of time, and non ancestral selection pressures, evolution will produce beings not remotely human in mind or body. Either argue that the evolution is in a morally ok direction, and that your government structure works with these beings, or stop evolution by selective breeding - frozen samples - genetic modification towards some baseline. Then you just need to say how all human populations get this, or how any population that doesn't won't be building UFAI.
Replies from: AprilSR↑ comment by AprilSR · 2020-01-15T00:18:51.731Z · LW(p) · GW(p)
I have no comment on how plausible either of these scenarios are. I'm only making the observation that long term good futures not featuring friendly AI require some other mechanism preventing UFAI from happening. Either SAI in general would have to be implausible to create at all, or some powerful actor such as a government or limited AI would have to prevent it.