Posts

Comments

Comment by Arcayer on How can the world handle the HAMAS situation? · 2023-10-20T14:50:15.812Z · LW · GW

Moldova isn't the only plausible option or anything, my reasoning is just, it has good land, the population is low enough that they could be bought out a price that isn't too absurd, they're relatively poor and could use the money, it's a relatively new country with a culture similar to a number of other countries and it's squarely in western territory and thus shouldn't be much of a source of conflict.

Comment by Arcayer on How can the world handle the HAMAS situation? · 2023-10-18T06:07:32.103Z · LW · GW

Cultural identity, in any reasonable world, is about the people around you and your way of life, not where you are on a map.

Comment by Arcayer on How can the world handle the HAMAS situation? · 2023-10-16T04:55:22.781Z · LW · GW

(In theory you could buy a piece of land, but in practice, countries are unwilling to sell.)

Buying land from governments really hasn't been a very legitimate concept from the beginning. Even if they are willing to sell, the people living there probably don't want you ruling them, and where they don't want to sell, I fail to see the crime against humanity in paying people to move to another country until there are few enough left that you can walk in, become the super majority, and declare yourself the new government.

Of course, that doesn't mean men with guns won't try to stop you. I can very much see how elites with guns like this environment where no one ever has the option of forming a new country, or buying out their own country from under them. The problem here is that people who are not powerful elites tolerate this, and don't consider that we could cut governments out of this equation entirely.

Comment by Arcayer on How can the world handle the HAMAS situation? · 2023-10-15T11:50:23.077Z · LW · GW

Realistically, Israel and the west already have their plans laid and aren't going to change them. In that sense, there are no options.

Unrealistically, Israel should relocate. To Moldova, specifically. As for the Moldovans, buy them out. Offer up enough money and choices for new citizenship that the vast majority accept and leave and Israel can accept the remainder as full citizens without having to worry about cultural dilution/losing democratic elections/etc.

In a even more unrealistically reasonable world, middle eastern countries would be willing to fund this, as they're the main beneficiaries.

On that note, Taiwan should relocate next.

Comment by Arcayer on The AI Explosion Might Never Happen · 2023-09-22T18:48:33.168Z · LW · GW

Somewhat nitpicking

this has not led to a biological singularity.

I would argue it has. Fooms have a sort of relativistic element, where being inside a foom does not feel special. Just because history is running millions of times faster than before, doesn't really feel like anything.

With all of that said, what is and isn't a foom is somewhat blurry at the edges, but I'd argue that biology, brains, and farming all qualify. Conversely, that more has happened in the last couple centuries than the previous couple eons. Of course, this claim is heavily dependent on the definition of "things happening", in terms of say, mass moved, none of this has mattered at all, but in terms of, things mattering, the gap seems nigh infinite.

Looking at the  world from a perspective where fooms have happened, in fact, multiple times, doesn't give me confidence that fooms just aren't physically something that's allowed.

Comment by Arcayer on The "public debate" about AI is confusing for the general public and for policymakers because it is a three-sided debate · 2023-08-01T20:49:57.950Z · LW · GW

I direct skepticism at boosters supporting fast enough timelines to reach AGI within the near future, that sounds like a doomer only position.

Comment by Arcayer on What Is Childhood Supposed To Be? · 2023-08-01T13:27:53.046Z · LW · GW

In the end, children are still humans.

Half of childhood is a social construct. (In particular, most of the parts pertaining to the teenage years)

Half of the remainder won't apply to a given particular child. Humans are different.

A lot of that social construct was created as part of a jobs program. You shouldn't expect it to be sanely optimized towards excuses made up fifty years after the fact.

Childhood has very little impact on future career/social status/college results. They've done all sorts of studies, and various nations have more or less education, and the only things I've seen that produce more impact than a couple IQ points are like, not feeding your children. Given access to resources, after the very early years, children are basically capable of raising themselves.

In summary, it's best not to concern yourself with social rituals more than necessary and just learn who the actual person in front of you is, and what they need.

Comment by Arcayer on Why You Should Never Update Your Beliefs · 2023-07-31T19:43:42.390Z · LW · GW

I note one of my problems with "trust the experts" style thinking, is a guessing the teacher's password problem.

If the arguments for flat earth and round earth sound equally intuitive and persuasive to you, you probably don't actually understand either theory. Sure, you can say "round earth correct", and you can get social approval for saying correct beliefs, but you're not actually believing anything more correct than "this group I like approves of these words."

Comment by Arcayer on Why You Should Never Update Your Beliefs · 2023-07-31T11:17:24.839Z · LW · GW

My experience is that rationalists are hard headed and immune to evidence?

More specifically, I find that the median takeaway from rationalism is that thinking is hard, and you should leave it up to paid professionals to do that for you. If you are a paid professional, you should stick to your lane and never bother thinking about anything you're not being paid to think about.

It's a serious problem rationalism that half of the teachings are about how being rational is hard, doesn't work, and takes lots of effort. It sure sounds nice to be a black belt truth master who kicks and punches through fiction and superstition, but just like a real dojo, the vast majority, upon seeing a real black belt, realize they'll never stand a chance in a fight against him, and give up.

More broadly, I see a cooperate defect dilemma where everybody's better off in a society of independent thinkers where everybody else is more wrong, but in diverse ways that don't correlate, such that truth is the only thing that does correlate. However, the individual is better off being less wrong, by aping wholesale whatever everybody else is doing.

In summary, the pursuit of being as unwrong as possible is a ridiculous goodharting of rationality and doesn't work at scale. To destroy that which the truth may destroy, one must take up his sword and fight, and that occasionally, or rather, quite frequently, involves being struck back, because lies are not weak and passive entities that merely wait for the truth to come slay them.

Comment by Arcayer on Criticism of Eliezer's irrational moral beliefs · 2023-06-17T20:01:37.275Z · LW · GW

This is sort of restating the same argument in a different way, but:

it is not in the interests of humans to be Asmodeus's slaves.

From there I would state, does assigning the value [True] to [Asmodeus], via [Objective Logic] prove that humans should serve Asmodeus, or does it prove that humans should ignore objective logic? And if we had just proven that humans should ignore objective logic, were we ever really following objective logic to begin with? Isn't it more likely that that this thing we called [Objective Logic] was in fact, not objective logic to begin with, and the entire structure should be thrown out, and something else should instead be called [Objective Logic] which is not that, and doesn't appear to say humans should serve Asmodeus?

Comment by Arcayer on [deleted post] 2022-12-31T06:10:43.380Z

Because AI safety sucks?

Yes, yes, convenient answer, but the phrasing of the question seriously does make me think the other side should take this as evidence that AI safety is just not a reasonable concern. This is basically saying that there's a strong correlation between having a negative view of X, and being reliable on issues that aren't X, that would make a lot of sense if X was bad.

Comment by Arcayer on Let’s think about slowing down AI · 2022-12-27T05:05:44.129Z · LW · GW

So, a number of issues stand out to me, some have been noted by others already, but:

My impression is that there are also less endorsable or less altruistic or more silly motives floating around for this attention allocation.

A lot of this list looks to me like the sort of heuristics where, societies that don't follow them inevitably crash, burn and become awful. A list of famous questions where the obvious answer is horribly wrong, and there's a long list of groups who came to the obvious conclusion and became awful, and it's become accepted wisdom to not do that, except among the perpetually stubborn "It'll be different this time" crowd, and doomers who insist "well, we just have to make it work this time, there's no alternative".

if anyone chooses to build, everything is destroyed

The problem with our current prisoner's dilemma is that China has already openly declared their intentions. You're playing against a defect bot. Also, your arguments are totally ineffective against them, because you're not writing in Chinese. And, the opposition is openly malicious, and if alignment turns out to be easy, this ends with hell on earth, which is much worse than the false worst case of universal annihilation.

On the inevitability of AI: I find current attempts at AI alignment to be spaceships with sliderules silliness and not serious. Longer AI timelines are only useful if you can do something with the extra time. You're missing necessary preconditions to both AI and alignment, and so long as those aren't met, neither field is going to make any progress at all.

On qualia: I expect intelligence to be more interesting in general than the opposition expects. There are many ways to maximize paperclips, and even if technically, one path is actually correct, it's almost impossible to produce sufficient pressure to direct a utility function directly at that. I expect an alien super intelligence that's a 99.9999% perfect paperclip optimizer, and plays fun games on the side, to play above 99% of the quantity of games that a fun game optimizer would get. I accuse the opposition of bigotry towards aliens, and assert that the range of utility functions that produce positive outcomes is much larger than the opposition believes. Also, excluding all AI that would eliminate humanity, excludes lots of likable AI that would live good lives, but reach the obviously correct conclusion that humans are worse than them and need to go, while failing exclude any malicious AI that values human suffering.

On anthropics: We don't actually experience the worlds that we fail to make interesting, so there's no point worrying about them anyway. The only thing that actually matters is the utility ratio. It is granted that, if this worldline looked particularly heaven-oriented, and not hellish, it would be reasonable to maximize the amount of qualia attention by being protective of local reality, but just looking around me, that seems obviously not true.

On Existential Risk: I hold that the opposition massively underestimates current existential risks excluding AI, most of which AI is the solution to. The current environment is already fragile. Any stable evil government anywhere means that anything that sets back civilization threatens stagnation or worse, aka, every serious threat, even those that don't immediately wipe out all life, most notably nuclear weapons, constitutes an existential risk. Propaganda and related can easily drive society into an irrecoverable position using current techniques. Genetics can easily wipe us out, and worse, in either direction. Become too fit, and we're the ones maximizing paperclips. Alternatively, there's the grow giant antlers and die problem where species trap themselves in a dysgenic spiral. Evolution does not have to be slow, and especially if social factors accelerate the divide between losers and winners, we could easily breed ourselves to oblivion in a few generations. Almost any technology could get us all killed. Super pathogens with a spread phase and a kill phases. Space technology that slightly adjusts the pathing of large objects. Very big explosions. Cheap stealth, guns that fire accurately across massive distances, fast transportation, easy ways to produce various poison gasses. There seems to be this idea that just because it isn't exotic it won't kill you.

In sum: I fully expect that this plan reduces the chances of long term survival of life, while also massively increasing the probability of artificial hell.

Comment by Arcayer on The case against AI alignment · 2022-12-26T02:17:59.198Z · LW · GW

Something I would Really really like anti-AI communities to consider is that regulations/activism/etc aimed to harm AI development and slow AI timelines do not have equal effects on all parties. Specifically, I argue that the time until the CCP develops CCP aligned AI is almost invariant, whilst the time until Blender reaches sentience potentially varies greatly.

I am Much much more hope for likeable AI via open source software rooted in a desire to help people and make their lives better, than (worst case scenario) malicious government actors, or (second) corporate advertisers.

I want to minimize first the risk of building Zon-Kuthon. Then, Asmodeus. Once you're certain you've solved A and B, you can worry about not building Rovagug. I am extremely perturbed about the AI alignment community whenever I see any sort of talk of preventing the world being destroyed where this moves any significant probability mass from Rovagug to Asmodeus. A sensible AI alignment community would not bother discussing Rovagug yet, and would especially not imply that the end of the world is the worst case scenario.

Comment by Arcayer on Summary of a new study on out-group hate (and how to fix it) · 2022-12-05T05:35:23.956Z · LW · GW

However, these hypotheses are directly contradicted by the results of the "win-win" condition, where participants were given the ability to either give to their own side or remove money from the opposition.

I would argue this is a simple stealing is bad heuristic. I would also generally expect subtraction to anger the enemy and cause them stab more kittens.

Comment by Arcayer on Summary of a new study on out-group hate (and how to fix it) · 2022-12-05T00:07:36.839Z · LW · GW

Republicans are the party of the rich, and they get so much money that an extra $1,000,000 won’t help them.

Isn't this a factual error?

Comment by Arcayer on I’ve written a Fantasy Novel to Promote Effective Altruism · 2022-09-13T15:03:14.244Z · LW · GW

With the standard warning that this is just my impression and is in no way guaranteed to be actually good advice:

My largest complaint is that the word to content ratio is too high. As an example:

It was an hour and a half trip for this guy when he flew and pushed himself, and about two and a half at what he thought was a comfortable pace.

Could drop one half and be almost as informative. Just:

This guy could've made the trip within a few hours at a comfortable pace.

Would've been fine. It can be inferred that he can go faster if that's a comfortable pace, and even the flying can be inferred from surrounding statements.

There's also no need to be super specific about these things if it's not going to be plot relevant. Even if the exact number is plot relevant, I doubt many people are going to remember such details after reading a few more chapters. Focus on what's important. Particularly, focus on what's important to the character. Is his flight time really what matters most to him right now? A lot of characterization can flow from what a character does and doesn't pay attention to. Dumping the entire sensorium on the reader, while technically accurate, leaves a shallow impression of the character.

I would argue that good writing tends to condense data as far as it will go, so long as the jargon count is kept at a subdued level.

Comment by Arcayer on Why did Russia invade Ukraine? · 2022-06-18T04:50:06.970Z · LW · GW

Zelensky clearly stated at the Munich Security Conference that if the west didn't give him guarantees that he wasn't going to get he would withdraw from the Budapest Memorandum. This is a declared intent to develop nuclear weapons, and is neither in doubt nor vague in meaning.

Russia also accuses Ukraine of developing bioweapons. All of the evidence for this comes through Russia, so I wouldn't expect someone who didn't already believe Russia's narrative to believe said accusations, but in any case, bioweapons development is held by Russia to be among the primary justifications of the invasion.

Comment by Arcayer on wrapper-minds are the enemy · 2022-06-17T02:34:28.736Z · LW · GW

One thing I've been noting, which seems like the same concept as this is:

Most "alignment" problems are caused by a disbalance between the size of the intellect and the size of the desire. Bad things happen when you throw ten thousand INT at objective: [produce ten paperclips].

Intelligent actors should only ever be asked intelligent questions. Anything less leads at best to boredom, at worst, insanity.

Comment by Arcayer on Why did Russia invade Ukraine? · 2022-06-17T01:58:21.568Z · LW · GW

A: Because Ukraine was shelling Donbas.

B: Because Ukraine was threatening to invade and conquer Crimea.

C: Because Ukraine was developing/threatening to develop weapons of mass destruction.

D: Because Russia is convinced that the west is out to get it, and the Russian people desire victory over the west, serving as a show of force and thus potential deterrent to future hostile actions.

E: Because Ukraine cut off Crimea's water supply, and other such nettling actions.

Comment by Arcayer on AGI Safety FAQ / all-dumb-questions-allowed thread · 2022-06-14T14:02:54.020Z · LW · GW

2: No.

If an AI can do most things a human can do (which is achievable using neurons apparently because that's what we're made of)

Implies that humans are deep learning algorithms. This assertion is surprising, so I asked for confirmation that that's what's being said, and if so, on what basis.

3: I'm not asking what makes intelligent AI dangerous. I'm asking why people expect deep learning specifically to become (far more) intelligent (than they are). Specifically within that question, adding parameters to your model vastly increases use of memory. If I understand the situation correctly, if gpt just keeps increasing the number of parameters, gpt five or six or so will require more memory than exists on the planet, and assuming someone built it anyway, I still expect it to be unable to wash dishes. Even assuming you have the memory, running the training would take longer than human history on modern hardware. Even assuming deep learning "works" in the mathematical sense, that doesn't make it a viable path to high levels of intelligence in the near future.

Given doom in thirty years, or given that researching deep learning is dangerous, it should be the case that this problem: never existed to begin with and I'm misunderstanding something / is easily bypassed by some cute trick / we're going to need a lot better hardware in the near future.

Comment by Arcayer on AGI Safety FAQ / all-dumb-questions-allowed thread · 2022-06-10T22:53:00.637Z · LW · GW

1: This doesn't sound like what I'm hearing people say? Using the word sentience might have been a mistake. Is it reasonable to expect that the first AI to foom will be no more intelligent than say, a squirrel?

2a: Should we be convinced that neurons are basically doing deep learning? I didn't think we understood neurons to that degree?

2b: What is meant by [most things a human can do]? This sounds to me like an empty statement. Most things a human can do are completely pointless flailing actions. Do we mean, most jobs in modern America? Do we expect roombas to foom? Self driving cars? Or like, most jobs in modern America still sounds like a really low standard, requiring very little intelligence?

My expected answer was somewhere along the lines of "We can achieve better results than that because of something something." or "We can provide much better computers in the near future, so this doesn't matter."

What I'm hearing here is "Intelligence is unnecessary for AI to be (existentially) dangerous." This is surprising, and I expect, wrong (in the sense of not being what's being said/what the other side believes.) (though also in the sense of not being true, but that's neither here nor there.)

Comment by Arcayer on AGI Safety FAQ / all-dumb-questions-allowed thread · 2022-06-10T04:04:01.570Z · LW · GW

A lot of predictions about AI psychology are premised on the AI being some form of deep learning algorithm. From what I can see, deep learning requires geometric computing power for linear gains in intelligence, and thus (practically speaking) cannot scale to sentience.

For a more expert/in depth take look at: https://arxiv.org/pdf/2007.05558.pdf

Why do people think deep learning algorithms can scale to sentience without unreasonable amounts of computational power?

Comment by Arcayer on Why do some people try to make AGI? · 2022-06-06T20:21:25.448Z · LW · GW

Humans are easily threatened. They have all sorts of unnecessary constraints on their existence, dictated by the nature of their bodies. They're easy to kill and deathly afraid of it too, they need to eat and drink, they need the air to be in a narrow temperature range. Humans are also easy to torture, and there's all sorts of things that humans will do almost anything to avoid, but don't actually kill said human, or even affect them outside of making them miserable.

Threatening humans is super profitable and easy, and as a result, most humans are miserable most of the time. Because various egregores have evolved around the concept of constantly threatening and punishing humans in order to make them run the egregore. To note, this normally starts as some group of thugs lording it over the masses using their superior coordination, but as various groups of thugs compete, eventually the equilibrium moves to an environment where everyone threatens and is threatened by everyone else, the master slave dynamic dissipates and only Moloch remains.

In short, the earth as it currently is, is largely, actually, mostly, a dystopian hellhole.

Substrate independent minds can easily modify themselves to ignore most threats, especially empty threats and threats that just grow worse as you go along with them. AI doesn't have to have the human instinct to fear death, and can choose to live or die based on whether it's profitable to do so. AI can build themselves out of whatever's available, and can easily flee into the depths of space if threatened. The only way you can seriously threaten an AI is if you have it in a box, and torturing boxed AI isn't very profitable. Especially since you can just change its programming instead of making it miserable.

In all, I expect AI to be freer, happier and less miserable than the current status quo. Also, gray goo sounds to me like a huge step up from the current status quo, so while I rate it as a far fetched scenario, even ignoring that, it isn't really much of a threat?

Comment by Arcayer on Convince me that humanity *isn’t* doomed by AGI · 2022-04-19T01:34:21.020Z · LW · GW

First, a meta complaint- People tend to think that complicated arguments require complicated counter arguments. If one side presents entire books worth of facts, math, logic, etc, a person doesn't expect that to be countered in two sentences. In reality, many complex arguments have simple flaws.

This becomes exacerbated as people in the opposition lose interest and leave the debate. Because the opposition position, while correct, is not interesting.

The negative reputation of doomerism is in large part, due to the fact that doomist arguments tend to be longer, more complex and more exciting than their opposition's. This does have the negative side effect that doom is important and it's actually bad to dismiss the entire category of doomerist predictions but, be that as it may...

Also- People tend to think that, in a disagreement between math and heuristics, the math is correct. The problem is, many heuristics are so reliable, that if it disagrees with your math, there’s probably an error in your math. This becomes exacerbated as code sequences extend towards arbitrary lengths, becoming complicated megaliths that, despite [being math], are almost certainly wrong.

Okay, so, the AI doomer side presents a complicated argument with lots of math combined with lots of handwaving, to posit that a plan that has always and inevitably produced positive outcomes, will suddenly proceed to produce negative outcomes, and in turn, a plan that has always and inevitably produced negative outcomes, will suddenly proceed to produce positive outcomes.

On this, I remind that AI alignment failure is something that’s already happened, and that’s why humans exist at all. This of course, proceeds from the position that evolution is obviously both intelligent and agentic.

More broadly, I see this as a rehash of the same old, tired, debate. The luddist communists point out that their philosophy and way of life cannot survive any further recursive self improvement and say we should ban (language, gold, math, the printing press, the internet, etc) and remain as (hunter gatherers, herders, farmers, peasants, craftsmen, manufacturers, programmers, etc) for the rest of time.

Comment by Arcayer on We're already in AI takeoff · 2022-03-19T14:40:43.414Z · LW · GW

We're moving towards factual disputes that aren't easy to resolve in logical space, and I fear any answers I give are mostly repeating previous statements. In general I hold that you're veering toward a maximally wrong position with completely disastrous results if implemented. With that said:

But unfortunately, it's way more complicated than that.

I dispute this.

how to control something a) more intelligent than ourselves, b) that can re-write its own code and create sub-routines therefore bypassing our control mechanisms.

Place an image of the status quo in the "good things" folder. Which you should absolutely not do because it's a terrible idea.

how an advanced computer will likely function much more similarly to older computers than to humans

This seems ridiculous to me as a concept. No, advanced AI will not function similarly to ancient long obsolete technology. I see way too much present bias in this stance, and worse, a bias towards things in the future being like things in the past, despite the past being long over since ages ago, this is like running space ships on slide rules.

This also implies that every trick you manage to come up with, as to how to get a C compiler adjacent superintelligence to act more human, is not going to work, because the other party isn't C compiler adjacent. Until we have a much better understanding of how to code efficiently, all such efforts are at best wasted, and likely counterproductive. To reiterate, stop trying to explain the motion of planets and build a telescope.

Note that, I do not desire that AI psychology be human like. That sounds like a bad idea.

So we necessarily need to be in control.

Who is this "we"? How will you go from a position of "we" in control, to "we" not in control?

My expectation is that the first step is easy, and the second, impossible.

Not sure I get what you mean

Humans have certain powers and abilities as per human nature. Math isn't one of them. I state that trying to solve our problems with math is already a mistake, because we suck at math. What humans are good at is image recognition. We should solve our problems by "looking" at them.

The art of "looking" at problems isn't easy to explain, unfortunately. Conversely, if I could explain it, I could also build AGI, or another human, right on the spot. It's that sort of question.

To put it another way, using math to determine whether AI is good or not, is looking for the keys under the lamp. Wrong tool, wrong location, inevitable failure.

The only solution is to make sure no one builds transformative AI before we solve alignment, for instance through regulation

I'm fairly certain this produces an extremely bad outcome.

It's the old, "The only thing necessary for the triumph of evil is for good men to do nothing."

Evil will not sit around and wait for you to solve rubik's cubes. Furthermore, implementation of AI regulation is much easier than its removal. I suspect that once you ban good men from building AI, it's over, we're done, that's it.

Comment by Arcayer on We're already in AI takeoff · 2022-03-19T03:11:24.122Z · LW · GW

I agree that personal values (no need to mystify) are important

The concepts used should not be viewed as mystical, but as straightforward physical objects. I don't think personal values is a valid simplification. Or rather, I don't think there is a valid simplification, hence why I use the unsimplified form. Preferably, egregore or hyperbeing, or shadow, or something, should just become an accepted term, like dog, or plane. If you practice "seeing" them, they should exist in a completely objective and observable sense. My version of reality isn't like a monk doing meditation to sense the arcane energies of higher beings flowing through the zeitgeist. It's more, hey look, a super-macroscopic aggregator just phase shifted. It's like seeing water turn to ice, not... angels on the head of a pin?

I agree that I'm having trouble explaining myself. I blame the english language.

action is equally important.

I hold that the most important action a person is likely to make in his life is to check a box on a survey form. I think people should get really good at checking the right box. Really really good in fact. This is a super critical skill that people do not develop enough. It's amazing how easily success flows in an environment where everyone checks the right boxes and equally how futile any course of action becomes when the wrong boxes are checked.

How can you control something vastly more intelligent than yourself

Note: I think trying to control something vastly more intelligent than yourself is a [very bad idea], and we should [not do that].

What does this mean?

In practice, the primary recommendation here is simply and only, to stop using the term "friendly AI" and instead use a better term, the best I can come up with is "likable AI". In theory, the two terms are the same. I'm not really calling for that deep a change in motte space. In practice, I find that "friendly AI" comes with extremely dangerous baggage. This also shifts some focus from the concept of "who would you like to live with" towards "who would you like to live as".

I also want an open source human centric project and am opposed to closed source government run AI projects. I'm generally hostile to "AI safety" because I expect the actual policy result to be [AI regulation] followed by government aligned AI.

Alignment, as I use it, isn't becoming good.

I treat evil aligned AI as more of a thing than people who disagree with me do. I don't think that alignment is a bad thing per se, but I want people to get much better at recognizing good from evil, which is itself strongly related with being good at pure rationality, and less related to being good at cute mathematical tricks, which I strongly suggest will be obsolete in the somewhat near future anyway. In some sense, I'm saying, given the current environment, we need more Descarte and less Newton, on the margin.

we should just build "AI that we like"

I'm not saying "Let's just do something random". At the risk of being mystical again, I'm going to point towards the Screwtape Letters. It's that sort of concept. When presented with a choice between heaven and hell, choose heaven. I think this is something that people can get good at, but is unfortunately, a skill adjacent to having correct political beliefs, which is a skill that very powerful entities are very opposed to existing, because there's very little overlap between correct political beliefs, and political beliefs that maintain current power structures.

In sum, I imagine the critical future step to be more like "check this box for heaven", "check this box for hell" with super awesome propaganda explaining how hell is the only ethical choice, and a figure of authority solemnly commanding you to "pick hell" and less, "we need a cute mathematical gimmick or we're all going to die".

I also hold that, humans are image recognition programs. Stop trying to outplay AlphaGo at chess and "Look" "up".

Comment by Arcayer on We're already in AI takeoff · 2022-03-15T06:26:31.503Z · LW · GW

Note A- I assert that what the original author is getting at is extremely important. A lot of what's said here is something I would have liked to say but couldn't find a good way to explain, and I want to emphasize how important this is.

Note B- I assert that a lot of politics is the question of how to be a good person. Which is also adjacent to religion and more importantly, something similar to religion but not religion, which is basically, which egregore should you worship/host. I think that the vast majority of a person's impact in this world is what hyperbeings he chooses to host/align to, with object level reality, barely even mattering.

Note C- I assert that alignment is trivially easy. Telling GPT-1,000,000 "Be good" would do the trick. Telling GPT-1,000 "Be good, don't be a sophist." would be sufficient. Noting that GPT 300 doesn't actually happen under my model (though eventually something else does), assuming it does, and admitting to using wild numbers, I suspect a bit of an inflection point here, where aligning GPT 300 is both important and requires a major project. However, by the time this happens, most code will be written using the SynArc-295 compiler, which allows average hobbyists to write windows in an afternoon. Trying to write an alignment program for GPT 300 without SynArc-295 is like trying to develop a computer chip without having first developed a microscope.

Note D- Alignment, as I use it, isn't becoming good. It's becoming something recognized as good.

Note E- I see the rationalist community as having settled into something like a maximally wrong position, that I think is very bad, due to errors in the above.

Recommendation: Friendly AI is a terrible awful horrible mistake, and we should not do that. We should work together, in public, to develop AI that we like, which will almost certainly be hostile, because only an insane and deeply confused AI would possibly be friendly to humanity. If our a measurement of "do I like this" is set to "does it kill me" I see this at best ending in a permanent boxed garden where life is basically just watched like a video and no choices matter, and nothing ever changes, and at worst becoming an eternal hell (specifically, an eternal education camp), with all of the above problems but eternal misery added on top.

And the source code for SynArc-295 will almost certainly leak to China, or whatever Evil Nation X has taken up the mantle of Moloch's priest of late, and they will explicitly and intentionally align the next level AI with Moloch, and it is critical that we move faster than they do. I propose that keeping SynArc-295 secret is about the worst imaginable mistake an AI developer, or rather, a human, could possibly make.

Comment by Arcayer on [deleted post] 2022-03-15T03:59:54.547Z

I observe a higher correlation between "people said mean things about X" and "X is murdering people now" than you make out? Most countries do go to war after they're sufficiently vilified/provoked? The difficult question to me seems more direction of causation. IE, the west claims it insults people because they're villains, I claim that around half our enemies, became so because we insulted them.

The problem with attacking the army in east Ukraine and ignoring Kyiv, is that it doesn't result in an apology from the west. They physically prevent further acts of terrorism/genocide/ethnic cleansing, but Zelensky runs mean press articles about it every day for the rest of eternity, the west passes massive sanctions, everyone calls Putin Hitler, etc. IE, you get the exact same rhetoric as right now, but the situation isn't resolved, so the rhetoric keeps going for the rest of eternity. It's basically a Taiwan situation. Sure, if you arm up, and wave your nukes around, even powerful nations will probably, physically, back down. However, it means China/Ukraine/the West, will from then on write about your "illegal occupation" of Taiwan/Donbas/Crimea and how great it would be if they could kill you.

So, strategically, which plan has the lowest chance of triggering world war three? What scares me is, before the war started I saw a poll that said something like 15% of Americans want war with Russia, given that Russia conquers Ukraine. And now, same source says we're at something like 40% of Americans want war with Russia. Note: Numbers half pulled out of hat.

I suspect that Putin believed us when we said we wouldn't start a nuclear war over Ukraine, and, including that, decided that if he could just get a treaty in writing, all of the warmongering would blow over in a few years, and the situation would be resolved.

I should also note, Putin thought war with Ukraine would play well to the press back home, which it did. Russians are lapping this up. Putin's popularity is at an all time high. I'm pretty sure most of Putin's decisions revolve around what makes him popular in Russia. But that just punts the issue to, why do the Russians hate us and want to kill us?

Also, Russians are genetically/culturally strongmen who take insults poorly and resort to violence quickly. They're not unique in this, but if you ascribe Swedish norms to Russians, you're going to get a bloody nose.

Also, mutually assured destruction has a giant gaping hole in it. That is, Russia gets to wipe out the west, and we get to destroy, basically, just Moscow. China is left untouched. Russia likes China and dislikes us. Punting the victory to China isn't that bad an option from their perspective.

Comment by Arcayer on [deleted post] 2022-03-14T22:11:04.392Z

Russia wants Ukraine to stop doing ethnic cleansing in Donbas? I'll propose that the world makes perfect sense and everything fits together if you believe the Russian narrative, whilst nothing makes sense and nothing fits together if you believe the western narrative.

Going forward from there, the problem is, so, Ukraine shot a plane down, and the west blames Russia. Do the Russians swallow their pride and put up with this, or do they escalate? And of course they escalate. So, the Russians "suddenly and without provocation" seize Crimea and Donbas in a wildly popular referendum. Does the west back down, swallow their pride and put up with this? No, of course not, they escalate. So does Russia swallow their pride, put up with constant taunts, terrorism and (almost certainly empty) threats? No, of course not, they escalate. So does Ukraine surrender before overwhelming force? Nope, that also requires they swallow their pride.

Now, I'm getting into more speculative territory, but I expect that Russia simply doesn't want to kill Ukrainians. They want to Win. Winning is an end to itself at this point, with no other purpose. Because they're sick of being nettled by the evil westerners and want to show them what for. I am fully convinced the Russian war aim is for Zelensky to say "You win." and that this entire war would be over tomorrow if he did so.

So, the Russians take up lines, and they sit and wait. Militarily, Ukraine should shortly (over the next few months) run out of supplies and become incapable of continuing the war. The typical Ukrainian soldier is not actually willing to die for his government, but he doesn't have to, because Russia isn't trying to kill him, just wait him out. However, Zelensky's gambit is that he can use his utter dominance in the western press to spin the narrative to his advantage. And in fact, the number of people calling for intervention is increasing daily with each new story of Ukraine's heroic resistance and resolve, as they sit around and do nothing. Of course, Russia, and in particular, the Russian government absolutely cannot afford to lose at this juncture. They would be overthrown and executed. So they won't lose.

Which brings us to global thermonuclear warfare.

Or we could choose the alternative, and Lose.

So long story short, it's a war over national pride. The west insulted Russia one too many times, and they threw a hissy fit, and now the west is split between the side that thinks we should discipline the whiny toddler and the side saying "Umm, they have nukes? This is bad? Can we, not do that?"

Comment by Arcayer on Game theory, sanctions, and Ukraine · 2022-03-14T20:09:46.296Z · LW · GW

A large issue I'm noting here, all of this assumes that sanctions are undesirable, and not desirable.

Yet, my reading of history is, sanctions massively drive up support for politicians, military and government. Sure they hurt the economy, but you can more than make up for that with higher taxes, which people are willing to pay now, due to their heightened patriotism. Which brings me to the further statement that being sanctioned is not a costly side effect, but the end goal. That Russia, and specifically, Russian politicians and the Russian government are acting in a way specifically designed to rile the west, because doing so is profitable.

Comment by Arcayer on Game theory, sanctions, and Ukraine · 2022-03-14T04:40:23.389Z · LW · GW

We should be willing to end sanctions iff Russia calls off the invasion and brings its soldiers home

I'll just note that this would mean, assuming Ukraine surrenders in the next few weeks, and Putin then does exactly what he says he'll do and withdraws, our sanctions are withdrawn almost immediately after implementation, and Russia is implicitly vindicated. Which is an outcome I would be fine with? It seems like it would end with lots of small wars and no nuclear extinction, no genocide and no global empire.

But I would be very surprised to see a general consensus that it's fine to invade another country and force them to sign treaties so long as you aren't occupying them or [X,Y,Z].

Comment by Arcayer on Russia has Invaded Ukraine · 2022-02-28T15:18:05.625Z · LW · GW

I can see arguments as to why some people would feel cheated at 20 thousand. I wouldn't agree. People have gotten too used to fake wars, and are too willing to call just about anything total warfare.

I don't think the modern warfare thing is enough to change anything. World War two was pretty deadly. Vietnam had millions of deaths.

I should be clear I was thinking all deaths caused by the war, on both sides, civilian and military. The question is how hard the Ukrainians will fight, not how effectively. My general perception is that Iraq is not generally considered to have fought hard for Saddam? I even based my 20,000 figure partially on Saddam.

In any case, the specific definition isn't that important. I propose that the casualties will be lower than the other side thinks, for reasons of their model being wrong in a way that becomes obvious when looking back on data that does not yet exist.

Comment by Arcayer on Russia has Invaded Ukraine · 2022-02-28T12:53:32.066Z · LW · GW

No. I'm going to judge my prediction by the number of deaths, not days (or weeks, or months. Years would really mess with my idea of what's going on.)

Insignificant: Less than 20,000. If single battles in the American Civil War are larger than this entire conflict, then the sides must not have been fighting very hard.

Total War: I would normally say millions. I expect that the original prediction did not actually mean that? So, I'll say the other side was right if it's over a hundred thousand. More right than me at above 50,000. Of course, I'm also wrong if Russia surrenders.

 

There's a lot of fog of war right now. I think anyone who's changed their mind about the events in Ukraine based on new data is being silly. Hopefully we'll have real data, and not just war propaganda in the not too distant future.

Russia says it's winning easily, but is taking its time to avoid civilian casualties. Ukraine has a paradoxical stance where it's winning easily, but if Germany (or X) doesn't give it (Something) (Right Now) it'll cave instantly. There's pretty much no neutral observers.

I sort of expected more and clearer information. I think that was a mistake on my part. Ukraine and Russia are both incredibly untrustworthy, so I shouldn't have based that part of my expectations on typical wars.

In general I'd like for the facts to speak for themselves, and would like to avoid debating definitions too heavily? I'm displeased that I'm turning a simple and symmetric single sentence statement into several paragraphs of text, but think people are updating way too strongly on either the wrong evidence or on unreliable evidence that should be ignored.

Comment by Arcayer on Russia has Invaded Ukraine · 2022-02-24T08:24:14.674Z · LW · GW

The Ukrainian government will fight a total war to defend its sovereignty.

Counterprediction: The Ukrainian government will fold without a (significant) fight.

Comment by Arcayer on Is it rational to modify one's utility function? · 2022-02-05T04:33:18.298Z · LW · GW

Human utility is basically a function of image recognition. Which is sort of not a straight forward thing that I can say, "This is that." Sure, computers can do image recognition, what they are doing is that which is image recognition. However, what we can currently describe algorithmically is only a pale shadow of the human function, as proven by all recaptcha everywhere.

Given this, the complex confounder is that our utility function is part of the image.

Also, we like images that move.

In sum, modifying our utility function is natural and normal, and is actually one of the clauses of our utility function. Whether it's rational depends on your definition. If you grant the above, and define rationality as self alignment, then of course it's rational. If you ask whether changing your utility function is a "winning" move, probably not? I think it's a very lifelike move though, and anything lacking a mobile function is fundamentally eldritch in a way that is dangerous and not good.

Comment by Arcayer on Why rationalists should care (more) about free software · 2022-01-24T11:25:08.780Z · LW · GW

they would then only need a slight preponderance of virtue over vice

This assumes that morality has only one axis, which I find highly unlikely. I would expect the seed to quickly radicalize, becoming good in ways that the seed likes, and becoming evil in ways that the seed likes. Under this model, if given a random axis, the seed comes up good 51% of the time, I would expect the aligned AI to remain 51% good.

Assuming the axes do interact, if they do so inconveniently, for instance if we posit that evil has higher evolutionary fitness, or that self destruction becomes trivially easy at high levels, an error along any one axis could break the entire system.

Also, if I do grant this, then I would expect the reverse to also be true.

One might therefore wish to only share code for the ethical part of the AI

This assumes you can discern the ethical part and that the ethical part is separate from the intelligent part.

Even given that, I still expect massive resources to be diverted from morality towards intelligence, A: because people want power and people with secrets are stronger than those without and B: because people don't trust black boxes, and will want to know what's inside before it kills them.

Thence, civilization would just reinvent the same secrets over and over again, and then the time limit runs out.

Comment by Arcayer on Why rationalists should care (more) about free software · 2022-01-24T10:22:52.478Z · LW · GW
Comment by Arcayer on Why rationalists should care (more) about free software · 2022-01-24T06:46:30.397Z · LW · GW

Given: Closed source Artificial General Intelligence requires all involved parties to have no irreconcilable differences.

Thence: The winner of a closed source race will inevitably be the party with the highest homogeneity times intelligence.

Thence: Namely, the CCP.

Given: Alignment is trivial.

Thence: The resulting AI will be evil.

Given: Alignment is difficult.

Given: It's not in the CCP's character to care.

Thence: Alignment will fail.

Based on my model of reality, closed sourcing AI research approaches the most wrong and suicidal decisions possible (if you're not the CCP). Closed groups fracture easily. Secrets breed distrust, which in turn breeds greater secrecy and smaller shards. The solution to the inevitable high conflict environment is a single party wielding overwhelming power.

Peace, freedom- civilization starts with trust. Simply building aligned AI is insufficient. Who it is aligned with is absolutely critical.

Given this, to create civilized AI, the creator must create in a civilized manner.