Posts

Uploads are Impossible 2023-05-12T08:03:22.512Z
Value Learning – Towards Resolving Confusion 2023-04-24T06:43:29.360Z
An AI Realist Manifesto: Neither Doomer nor Foomer, but a third more reasonable thing 2023-04-10T00:11:53.802Z
What does it mean for an LLM such as GPT to be aligned / good / positive impact? 2023-03-20T09:21:42.256Z
AI as a Civilizational Risk Part 6/6: What can be done 2022-11-03T19:48:52.376Z
AI as a Civilizational Risk Part 5/6: Relationship between C-risk and X-risk 2022-11-03T02:19:46.847Z
AI as a Civilizational Risk Part 4/6: Bioweapons and Philosophy of Modification 2022-11-01T20:50:54.078Z
AI as a Civilizational Risk Part 3/6: Anti-economy and Signal Pollution 2022-10-31T17:03:00.179Z
AI as a Civilizational Risk Part 2/6: Behavioral Modification 2022-10-30T16:57:51.046Z
AI as a Civilizational Risk Part 1/6: Historical Priors 2022-10-29T21:59:26.858Z

Comments

Comment by PashaKamyshev on Uploads are Impossible · 2023-05-14T23:45:21.575Z · LW · GW

Robin is a libertarian, Nate used to be, but after the whole calls to "bomb datacenters" and vague "regulation," calls from the camp, i don't buy libertarian credentials. 

A specific term of "cradle bags of meat" is de-humanization. Many people view dehumanization as evidence of violent intentions. I understand you do not, but can you step away and realize that some people are quite sensitive to the phrasing? 

More-over when i say "forcefully do stuff to people they don't like", this is a general problem. You seem to interpret this as only taking about "forcing people to be uploaded" which is a specific sub-problem. There are many other instances of this general problem which i refere to such as 

a) forcing people to take care of economically unproductive digital minds. 

it's clear that Nate sees "American tax-payers" with contempt. However depending on the specific economics of digital minds, people are wary of being forced to give up resources to something they don't care about. 

or 

b) ignoring existing people's will with regards to dividing the cosmic endownment. 

In your analogy, if a small group of people wants to go to Mars and take a small piece of it, that's ok. However if they wish to go to Mars and then forcefully prevent other people from going there at all because they claim they need all of Mars to run some computation, this is not ok. 

It's clear Nate doesn't see any problem with that with the interstellar probes comment. 

 

Again, the general issue here, is that I am AWARE of the disagreement about whether on not digital minds (i am not taking about uploads, but other simpler to make categories of digital minds) are ok to get created. Despite me being in the majority position of "only create tool-AIs", I am acknowledging that people might disagree. There are ways to resolve this peacefully (divide the star systems between groups). However, despite being in the minority position LW seems to wish to IGNORE every one else's vision of the future and call them insults like "bags of meat" and "carbon - chauvinists".

Comment by PashaKamyshev on Humans aren't fitness maximizers · 2023-05-14T09:17:22.540Z · LW · GW

I think this post suffers pretty badly from Typical Mind Fallacy. This thinking isn't alien to me. I used to think exactly like this 8 years ago, but since marriage and kid I now disagree with basically every point.

One claim that is hopefully uncontroversial: Humans are not literally optimizing for IGF,

I think this is controverisial because it's basically wrong :)

First, its not actually obvious what "definition" of IGF you are using. If you talk about animals, the definition that might fit is "number of genes in the next generation". However if you talk about humans, we care about both "number of genes in the next generation" and "resources given to the children". Humans can see "one step ahead" and know the rough prospects their children have in the dating market. "Resources" is not just money, it is also knowledge, beauty, etc. 

Given this, if someone decides to have two children instead of four, this might just mean they simply don't trust their ability to equip the kids with the necessary tools to succeed. 

Now, different people ALSO have different weights for the quantity vs quality of offspring. See Shoshannah Tekofsky's comment (unfortunately disagreed with) for the female perspective on this. Evolutionary theory might predict that males are more prone to maximize quantity and satisfice quality and female are more prone to satisfice quantity and maximize quality. That is, "optimization" is not the same as "maximization". There can also be satisfice / maximization mixes where each additional unit of quality or quantity still has value, but it falls off. 

 

 

Would you give up your enjoyment of visual stimuli then, like an actual IGF optimizer would?

If you give a choice between having 10 extra kids with my current wife painlessly + sufficient resources for a good head start for them, I would consider giving up my enjoyment of visual stimuli. The only hesitation is that i don't like "weird hypotheticals" in general and i potentially expect "human preference architectures" to not be as easily "modularizable" compared to computer architectures. This giving up can also have all sorts of negative effects beyond losing "qualia" of visualness, like losing capacity for spacial reasoning. However, if the "only" thing i lose is qualia and not any cognitive capacities, than this is an easy choice. 

But, do you really fundamentally care that your kids have genomes?

Yes, obviously i do. I don't consider "genomeless people" to be a thing, i dislike  genetic engineering and over-cyborgization, i don't think uploads are even possible. 

Or, an even sharper proposal: how would you like to be killed right now, and in exchange you'll be replaced by an entity that uses the same atoms to optimize as hard as those atoms can optimize, for the inclusive genetic fitness of your particular genes. Does this sound like practically the best offer that anyone could ever make you? Or does it sound abhorrent?

This hypothetical is too abstract to be answerable, but if i were to offer an answer to a hypothetical with a similar vibe: many people do in fact die for potential benefits to inclusive fitness for their families, we call those soldiers / warriors / heroes. Now, sometimes their government deceives them about whether or not their sacrifice is in fact helpful for their nation, however the underlying psychology seems be easily consistent with "IGF-optimization" 

My point today is that the observation “humans care about their kids” is not in tension with the observation “we aren't IGF maximizers”,

I think this is where the difference between the terms "optimizer" and "maximizer" is important. Also important to understand what sort of constraints most people in fact operate under. Most people seem to  they act AS IF they are IGF satisficers - they get up to a certain level of quantity / quality and seem to slow down after that. However, it's hard to infer the exact values because very specific subconscious /conscious beliefs could be influencing the strategy. 

For example, i could argue that secretly, many people want to be maximizer, however this thing we call civilization is effectively an agreement between maximizers to forgoe certain maximization tactics and stick to being a satisficers. So people might avoid "overly agressive" maximization because they are correctly worried this is perceived as "defection" and ends up backfiring. Given that the current environment is very different from the ancestral environment, this particular machinery might be malfunctioning and leading to people subconsciously perceive having any children as defection. However i suspect humanity will adapt in a small number of generations.

Humans are not literally optimizing for IGF, and regularly trade other values off against IGF.

Sort of true. The main value people seem to trade off is "physical pain." Humans are also resource and computation constrained and implementing "proper maximization" in a heavily resources constrained computation may not even be possible. 

 

Introspecting my thought before and after kids, I have a theory that the process of finding a mate prior to "settling down" tends to block certain introspection into one's motivations. It's easier to appreciate art if you are not thinking "oh i am looking at art i like because art provides baysean evidence on lifestyle choices to potential mates". Thinking this way can appear low status which is itself a bad sign. So the brain is more prone to lying to itself that "there is enjoyment for it's own sake." After having a kid, the mental "block" is lifted and it is sort of obvious this is what i was doing and why.

Comment by PashaKamyshev on Uploads are Impossible · 2023-05-14T06:15:15.793Z · LW · GW

Robin's whole Age of Em is basically pronouncing "biology is over" in a cheerful way. 

Some posts from Nate:

here

I want our values to be able to mature! I want us to figure out how to build sentient minds in silicon, who have different types of wants and desires and joys 

I don't want that, instead i want a tool intelligence that augments me by looking at my words and actions. Digital minds (not including “uploads”) are certainly possible and highly undesirable for most people simply due to competition for resources and higher potential for conflict. I don’t buy lack-of-resource-scarcity for a second.  

here

uploading minds; copying humans; interstellar probes that aren't slowed down by needing to cradle bags of meat, ability to run civilizations on computers in the cold of space 
...

in the long term, i think you're looking at stuff at least as crazy as people running thousands of copies of their own brain at 1000x speedup and i think it would be dystopian to try to yolk them to, like, the will of the flesh-bodied American taxpayers (or whatever). 

“cradle bags of meat” is a pretty revealing phrase about what he thinks of actual humans and biology

In general, the idea to having regular people now and in the future have any say about the future of digital minds seems like an anathema here. There is no acknowledgement that this is the MINORITY position and that there is NO REASON that other people would go along with that. I don't know how to interpret these pronouncements that go against the will of the people other than a full-blown intention to use state violence against people who disagree. Even if you can convince one nation to brutally suppress protests against digital minds, doesn’t mean others will follow suit.  

here

This is a set of researchers that generally takes egalitarianism, non-nationalism, concern for future minds, non-carbon-chauvinism, and moral humility for granted, as obvious points of background agreement; the debates are held at a higher level than that. 

“non-carbon-chauvinism" is such a funny attempt at an insult. You have already made up an insult for not believing in something that doesn’t exist. Typical atheism:).  

The whole phrase comes off as “people without my exact brand of really weird ideas” are wrong and not invited to the club. You can exclude people all you want, just don’t claim that anything like this represents actual human values. I take this with the same level of seriousness as me pronouncing “atheism has no place in technology because it does not have the favor of the Machine God” 

 

These are only public pronouncements...  

Comment by PashaKamyshev on Uploads are Impossible · 2023-05-13T06:01:18.660Z · LW · GW

I generally don't think LLMs today are conscious, as far as i can tell neither does Sam Altman, but there is some disagreement. They could acquire some characteristics that could be considered conscious as scale increases. However merely having "qualia" and being conscious is not the same thing as being functionally equivalent a new human, let alone a specific human. The term "upload" as commonly understood is a creation of a software construct functionally and qualia-equivalent to a specific human.

  • a human brain in a vat wouldn't be so far from the experience of language models.

Please don't try to generalize over all human minds based on your experience. Human experience is more than just reading and writing language. Some people have a differing level of identification with their "language center," for some it might seem like the "seat of the self," for others it is just another module, some people have next to no internal dialogue at all. I suspect that these differences + cultural differences around "self-identification with linguistic experience" are actually quite large.

  • I personally want to maintain my human form as a whole but expect to drastically upgrade the micro-substrate beyond biology at some point

I suspect a lot of the problems described in this post also occur on the microscale level with that strategy as well.

Comment by PashaKamyshev on Uploads are Impossible · 2023-05-13T05:42:31.770Z · LW · GW

Thanks for the first part of the comment. 

As mentioned in my above comment, the reason for mixing "can" and "should" problems is that they form a "stack" of sorts, where attempting to approximately solve the bottom problems makes the above problems harder and verification is important. How many people would care about the vision if one could never be certain the process succeeds?

Comment by PashaKamyshev on Uploads are Impossible · 2023-05-13T05:22:36.848Z · LW · GW

Fixed the link formatting and added a couple more sources, thanks for the heads up. The temperature claim does not seem unusual to me in the slightest. I have personally tried to do a relatively cold bath and noticed my "perception" alter pretty significantly. 

The organ claim does seem more unusual, but I have heard various forms of it from many sources at this point. It does not however seem in any way implausbile. Even if you maintain that the brain is the "sole" source of cognition, the brain is still an organ and is heavily affected by the operation of other organs.

Comment by PashaKamyshev on Uploads are Impossible · 2023-05-13T05:06:38.002Z · LW · GW

There is a lot to unpack. I have definitely heard from leaders of the community claims to the tune of "biology is over," without further explanation of what exactly that means or what specific steps are expected to happen when the majority of people disagree with this. The lack of clarity here makes it hard to find a specific claim of "I will forcefully do stuff to people they don't like," but me simply saying "I and others want to actually have what we think of as "humans" keep on living" is met with some pushback.

You seem to be saying that the "I" or "Self" of people is somehow static through large possible changes to the body. While on a social and legal level (family and friends recognize them), we need to have a simple shorthand for what constitutes the same person. The social level is not the same as the "molecular level."

On a molecular level, everything impacts cognition. Good vs bad food impacts cognition, taking a cold vs warm shower impacts cognition. If you read Impro, even putting on a complicated mask during a theater performance impacts cognition. 

"I am me," whatever you think of "as yourself" is a product of your quantum-mechanical state. The body fights really hard to preserve some aspects of said state to be invariant. If the temperature of the room increases 1C nothing much might change, however, if the body loses the battle and your core temperature increases 1C, you likely have either a fever or heat-related problems with the corresponding impact on cognition. Even if the room is dusty enough, people can become distressed from the sight or lack of oxygen.

So if you claim that a small portion of molecular information is relevant in the construction of self, you will fail to capture all the factors that are relevant in affecting cognition and behavior. Now only considering a portion of the body's molecules doesn't solve the physics problem of needing to have a molecular level info without destroying the body. You would also need to hope that the relevant information is more "macro-scale" than molecules to get around the thermodynamics issues. However, every approximation one makes away from perfect simulation is likely to drift the cognition and behavior further from the person, which makes the verification problem (did it actually succeed) harder.

This is also why it's a single post. The problems form a "stack" in which fuzzy or approximate solutions to the bottom of the stack make the problems above harder in the other layers of the stack. 

Now, there is a particular molecular level worth mentioning. The DNA of people is the most stable molecular construct in the body. This is preserved by the body with far more care than whatever we think of as cognition. How much cognition is shared between a newborn and the same 80-year old? DNA is also build with redundancies which means that the majority of the body remains intact after a piece of it is collected with DNA. However, i don't think that "write one's DNA to the blockchain" is what people think of when they say uploads.

Comment by PashaKamyshev on Evolution provides no evidence for the sharp left turn · 2023-04-11T21:05:16.400Z · LW · GW

The general vibe of the first two parts seems correct to me. Also, an additional point is that evolution's utility function of inclusive genetic fitness didn't completely disappear and is likely still a sub-portion of the human utility function. I suspect there is going to be disagreement on this, but it would also be interesting to do a poll on this question and break it down by people who do or do not have kids.

Comment by PashaKamyshev on An AI Realist Manifesto: Neither Doomer nor Foomer, but a third more reasonable thing · 2023-04-10T08:01:07.554Z · LW · GW

Yes I think we understand each other. One thing to keep in mind is that different stakeholders in AI are NOT utilitarians, they have local incentives they individually care about. Given the fact that COVID didn't stop gain-of-function research, this means that getting EVERYONE to care would require a death toll larger than COVID. However, getting someone like CEO of google to care would "only" require a half - a - trillion dollar lawsuit against Microsoft for some issue relating to their AIs. 

And I generally expect those - types of warning shots to be pretty likely given how gun-ho the current approach is.

Comment by PashaKamyshev on An AI Realist Manifesto: Neither Doomer nor Foomer, but a third more reasonable thing · 2023-04-10T06:57:03.373Z · LW · GW

I am mostly agreeing with you here, so I am not sure you understood my original point. Yes Reality is giving us things that for a set of reasonable people such as you and me should be warning shots. 

Since a lot of other people don't react to them, you might become pessimistic and extrapolate that NO warning shot is going to be good enough. However I posit that SOME warning shots are going to be good enough. An AI - driven bank run followed by an economic collapse is one example, but there could be others. Generally I expect that when warning shots reach "nation-level" socio-economic problems, people will pay attention. 

However, this will happen before doom.

Comment by PashaKamyshev on An AI Realist Manifesto: Neither Doomer nor Foomer, but a third more reasonable thing · 2023-04-10T04:33:49.340Z · LW · GW

Yes, I agree. I have ideas how to fix it as well, but I seriously doubt they will gain much traction 

Comment by PashaKamyshev on An AI Realist Manifesto: Neither Doomer nor Foomer, but a third more reasonable thing · 2023-04-10T04:19:49.088Z · LW · GW

I am also familiar with Paul Christiano, I think his arguments for slower, more continous take off are broadly on the right track as well. 

Given that the extreme positions have strong stake-outs on twitter, I am once again claiming that there needs to be a strong stake-out of the more reasonable centrism. This isn't the first post in this direction, there were ones before and there will be ones after. 

Just trying to keep this particular ball rolling. 

Comment by PashaKamyshev on An AI Realist Manifesto: Neither Doomer nor Foomer, but a third more reasonable thing · 2023-04-10T03:51:42.171Z · LW · GW

I have skimmed the Alignment Forum side and read most of MIRI's work before 2015. While it's hard to know about the "majority of people," it does seem that the public reporting is around two polarized camps. However in this particular case, I don't think it's just the media. The public figures for both sides (EY and Yann Lecunn) seem pretty consistent with their messaging and talking past each other. 

Also if the majority of people in the field agree with the above, that's great news and also means that reasonable centrism needs to be more prominently signal-boosted.

On a more object level, as I linked in the post, I think the Alignment forum is pretty confused about value learning and the general promise of IRL to solve it. 

Comment by PashaKamyshev on What does it mean for an LLM such as GPT to be aligned / good / positive impact? · 2023-03-21T20:16:19.626Z · LW · GW

Your comment is a little hard to understand. You seem to be saying that "scaling" is going to make it harder to align, which I agree with. I am not sure what "deliberate reasoning" means in this context. I also agree that having a new kind of training process is definitely required to keep GPT aligned either vis-a-vis OpenAI's rules or actually good rules. 

I agree that the current model breaks down into "shoggoth" and "mask." I suspect future training, if it's any good would need to either train both simultaneously with similar levels of complexity for data OR not really have a breakdown into these components at all. 

If we are going to have both "mask" and "shoggoth", my theory is that the complexity of mask needs to be higher / mask needs to be bigger than the shoggoth and right now it's nowhere near the case. 

Comment by PashaKamyshev on AI as a Civilizational Risk Part 6/6: What can be done · 2022-11-06T20:12:41.845Z · LW · GW

While I agree that Twitter is a bad site, I expect some of Musk's actions to make it better (but not fully fix it). Your attempt to tie personality-based critiques (stem / white / male) isn't helpful. Addiction to social platforms is a general issue and needs to be solved in a general way. 

 

However, the solutions you outline are in fact some of the ways that the situation will proceed. I don't think 1. [government] is likely or will sit well either.

However, 2 [fix] is plausible. Companies would not "admin" problems, but they could fix without "admitting." Again, this requires thinking they are not used to, but is plausible. 

4 [new media] is plausible. The field is becoming more open, but there are barriers to entry. 

The piece in general is aimed toward people who have the ability to make 2 and /or 4 happen.

In absence of 2 and 4, the West collapses eventually and other nations learn from its mistakes. 

Comment by PashaKamyshev on AI as a Civilizational Risk Part 3/6: Anti-economy and Signal Pollution · 2022-11-01T20:49:30.824Z · LW · GW

Yeah defining "harm", or more formally, a "non-beneficial modification to a human being" is a hard task and is in many ways the core problem I am pointing at. Allowing people to take part in defining what is "harmful" to themselves is both potentially helpful as it brings local information and tricky because people may have already been ensnared by a hostile narrow AI to misunderstand "harm."

Comment by PashaKamyshev on AI as a Civilizational Risk Part 1/6: Historical Priors · 2022-11-01T20:44:34.257Z · LW · GW

Thanks for the comment! I look into some of the philosophy required in part 4, x-risk relationships in part 5 and todos in part 6. Understanding consciousness is an important sub-component and might be needed sooner than we think. I think an important piece is understanding what modifications to consciousness are harmful or beneficial. This would have a sub-problem of what chemicals or organisms alter it and in what ways as well as what ideas and experiences seem to have a lasting effect on people. It's possible this is more doable than understanding consciousness as a whole, but it certainly touches a lot of the core problem. 

Comment by PashaKamyshev on AI as a Civilizational Risk Part 3/6: Anti-economy and Signal Pollution · 2022-10-31T20:34:55.720Z · LW · GW

Mostly agree. A sophisticated user might have some great feedback on how a website ranks its products, but shouldn't and doesn't want to have access to the internals of the algorithm. So giving some users a slightly better surface area for interaction aside from "being part of the training data" seems like an important problem to solve. Could be useful all the way toward AGI as we would need a story of how a particular person still has some capacity for future influence.