teradimich

Posts
Comments

Posts

Largest open collection quotes about AI 2019-07-12T17:18:20.401Z

Comments

Comment by teradimich on A Slow Guide to Confronting Doom · 2025-04-07T23:19:02.126Z · LW · GW

I sympathize with this line of thinking, but I've never understood something like P(doom)>0.8.

The analogies with cancer or poison seem a bit odd, because we're trying to estimate the probability of an event that has never happened before. Without relying on anything like physical laws, without anything close to consensus. Even among the people who proposed the key ideas of the AI Risk discussions, not all were confident pessimists.

We have too many unknowns. We don't know when superintelligence will appear. We can't predict how governments and corporations will treat AI in the coming years. We don't know what will happen if someone tries to use a sufficiently advanced AI for automated safety research. Or narrow AI might change the situation in the world before superintelligence appears. Our civilization could collapse for any number of reasons.
And I don't think we can say for sure what superintelligence will do to humans.

Comment by teradimich on Thane Ruthenis's Shortform · 2025-03-30T11:30:37.444Z · LW · GW

Comment by teradimich on OpenAI: Detecting misbehavior in frontier reasoning models · 2025-03-12T12:17:56.541Z · LW · GW

Earlier, you wrote about a change to your AGI timelines.
What about p(doom)? It seems that in recent months there have been reasons for both optimism and pessimism.

Comment by teradimich on Towards_Keeperhood's Shortform · 2025-03-08T16:58:38.396Z · LW · GW

It seems a little surprising to me how rarely confident pessimists (p(doom)>0.9) they argue with moderate optimists (p(doom)≤0.5).
I'm not specifically talking about this post. But it would be interesting if people revealed their disagreement more often.

Comment by teradimich on Emergent Misalignment: Narrow finetuning can produce broadly misaligned LLMs · 2025-02-25T20:47:34.505Z · LW · GW

Thanks for the reply. I remembered a recent article by Evans and thought that reasoning models might show a different behavior. Sorry if this sounds silly

Comment by teradimich on Emergent Misalignment: Narrow finetuning can produce broadly misaligned LLMs · 2025-02-25T19:13:55.186Z · LW · GW

Are you planning to test this on reasoning models?

Comment by teradimich on Reflections on the state of the race to superintelligence, February 2025 · 2025-02-24T00:39:22.370Z · LW · GW

I agree. But now people write so often about short timelines that it seems appropriate to recall the possible reason for the uncertainty.

Comment by teradimich on o1 is a bad idea · 2025-02-23T22:18:34.087Z · LW · GW

Doesn't that seem like a reason to be optimistic about reasoning models?

Comment by teradimich on Reflections on the state of the race to superintelligence, February 2025 · 2025-02-23T14:37:07.453Z · LW · GW

There doesn't seem to be a consensus that ASI will be created in the next 5-10 years. This means that current technology leaders and their promises may be forgotten.
Does anyone else remember Ben Goertzel and Novamente? Or Hugo de Garis?

Comment by teradimich on How to Make Superbabies · 2025-02-20T21:19:08.844Z · LW · GW

Yudkowsky may think that the plan 'Avert all creation of superintelligence in the near and medium term — augment human intelligence' has <5% chance of success, but your plan has <<1% chance. Obviously, you and he disagree not only on conclusions, but also on models.

Comment by teradimich on How to Make Superbabies · 2025-02-20T20:22:00.672Z · LW · GW

EY is known for considering humanity almost doomed.
He may think that the idea of human intelligence augmentation is likely to fail. But it's the only hope. Of course, many will disagree with this.

He writes more about it here or here.

Comment by teradimich on Joseph Miller's Shortform · 2025-02-20T17:05:53.595Z · LW · GW

It seems that we are already at the GPT 4.5 level? Except that reasoning models have confused everything, and increasing OOM on output can have the same effect as ~OOM on training, as I understand it.

By the way, you've analyzed the scaling of pretraining a lot. But what about inference scaling? It seems that o3 has already used thousands of GPUs to solve tasks in ARC-AGI.

Comment by teradimich on Joseph Miller's Shortform · 2025-02-19T17:30:09.135Z · LW · GW

Thank you. In conditions of extreme uncertainty about the timing and impact of AGI, it's nice to know at least something definite.

Comment by teradimich on Joseph Miller's Shortform · 2025-02-19T14:25:26.795Z · LW · GW

Can we assume that Gemini 2.0, GPT-4o, Claude 3.5 and other models with similar performance have a similar compute?

Comment by teradimich on nikola's Shortform · 2025-02-19T12:57:51.879Z · LW · GW

If we don't build fast enough, then the authoritarian countries could win.

Ideally it would be something like the UN, but given the geopolitical complexities, that doesn't seem very possible.

This sounds like a rejection of international coordination.

But there was coordination between the United States and the USSR on nuclear weapons issues, despite geopolitical tensions, for example. You can interact with countries you don't like without trying to destroy the world faster than them!

Comment by teradimich on AGI Safety & Alignment @ Google DeepMind is hiring · 2025-02-18T15:59:32.756Z · LW · GW

2 years ago, you seemed quite optimistic about AGI Safety/Alignment and had a long timeline.
Have your views changed since then?
I understand that hiring will be necessary in any case.

Comment by teradimich on AI #102: Made in America · 2025-02-17T03:08:50.745Z · LW · GW

Keeping people as a commodity for acasual trade or pets seems like a more likely option.

Comment by teradimich on ≤10-year Timelines Remain Unlikely Despite DeepSeek and o3 · 2025-02-15T22:14:53.779Z · LW · GW

If only one innovation separates us from AGI, we're fucked.
It seems that if OpenAI or Anthropic had agreed with you, they should have had even shorter timelines.

Comment by teradimich on AI #103: Show Me the Money · 2025-02-13T17:45:19.988Z · LW · GW

A short reading list which should be required before one has permission to opine. You can disagree, but step 1 is to at least make an effort to understand why some of the smartest people in the world (and 100% of the top 5 ai researchers — the group historically most skeptical about ai risk) think that we’re dancing on a volcano . [Flo suggests: There’s No Fire Alarm for Artificial General Intelligence, AGI Ruin: A List of Lethalities, Superintelligence by Nick Bostrom, and Superintelligence FAQ by Scott Alexander]

But Bostrom estimated the probability of extinction within a century as <20%. Scott Alexander estimated the risk from AI as 33%.
They could have changed their forecasts. But it seems strange to refer to them as a justification for confident doom.

Comment by teradimich on The Paris AI Anti-Safety Summit · 2025-02-13T04:29:31.617Z · LW · GW

I would expect that the absence of a global catastrophe for ~2 years after the creation of AGI would increase the chances of most people's survival. Especially in a scenario where alignment was easy.
After all, then there will be time for political and popular action. We can expect something strange when politicians and their voters finally understand the existential horror of the situation!
I don't know. Attempts to ban all AI? The Butlerian jihad? Nationalization of AI companies? Revolutions and military coups? Everything seems possible.
If AI respects the right to property, why shouldn't it respect the right to UBI if such a law is passed? The rapid growth of the economy will make it possible to feed many.
In fact, a world in which someone shrugs their shoulders and allows 99% of the population to die seems obviously unsafe for the remaining 1%.

Comment by teradimich on The Paris AI Anti-Safety Summit · 2025-02-13T02:56:54.269Z · LW · GW

It's possible that we won't get something that deserves the name ASI or TAI until, for example, 2030.
And a lot can change in more than 5 years!

The current panic seems excessive. We do not live in a world where all reasonable people expect the emergence of artificial superintelligence in the next few years and the extinction of humanity soon after that.
The situation is very worrying, and this is the most likely cause of death for all of us in the coming years, yes. But I don't understand how anyone can be so sure of a bad outcome as to consider people's survival a miracle.

Comment by teradimich on Orienting to 3 year AGI timelines · 2025-02-13T00:07:04.140Z · LW · GW

Then what is the probability of extinction caused by AI?

Comment by teradimich on The Paris AI Anti-Safety Summit · 2025-02-12T23:46:56.123Z · LW · GW

Of course, capital is useful in order to exert influence now. Although I would suggest that for a noticeable impact on events, capital or power is needed, which are inaccessible to the vast majority of the population.

But can we end up in a world where the richest 1% or 0.1% will survive, and the rest will die? Unlikely. Even if property rights were respected, such a world would have to turn into a mad hell.
Even a world in which only people like Sam Altman and their entourage will survive the singularity seems more likely.
But the most likely options should be the extinction of all or the survival of almost all without a strong correlation with current well-being. Am I mistaken?

Comment by teradimich on The Paris AI Anti-Safety Summit · 2025-02-12T17:35:37.856Z · LW · GW

Most experts do not believe that we are certainly (>80%) doomed. It would be an overreaction to give up after the news that politicians and CEO are behaving like politicians and CEO.

Comment by teradimich on Gradual Disempowerment: Systemic Existential Risks from Incremental AI Development · 2025-02-04T22:12:45.551Z · LW · GW

It still surprises me that so many people agree on most issues, but have very different P(doom). And even long-term patient discussions do not bring people's views closer. It will probably be even more difficult to convince a politician or the CEO.

Comment by teradimich on Gradual Disempowerment: Systemic Existential Risks from Incremental AI Development · 2025-02-04T17:17:01.737Z · LW · GW

So what's your P(doom)?

Comment by teradimich on Do you know of lists of p(doom)s/AI forecasts/ AI quotes? · 2024-05-10T15:15:56.684Z · LW · GW

I have already tried to collect the most complete collection of quotes here. But it is already very outdated.

Comment by teradimich on Nick Bostrom’s new book, “Deep Utopia”, is out today · 2024-03-30T20:11:14.503Z · LW · GW

It seems that in 2014 he believed that p(doom) was less than 20%

Comment by teradimich on Most people should probably feel safe most of the time · 2023-05-12T18:55:56.255Z · LW · GW

I do expect some of the potential readers of this post to live in a very unsafe environment - e.g. parts of current-day Ukraine, or if they live together with someone abusive - where they are actually in constant danger.

I live ~14 kilometers from the front line, in Donetsk. Yeah, it's pretty... stressful.
But I think I'm much more likely to be killed by an unaligned superintelligence than an artillery barrage.
Most people survive urban battles, so I have a good chance.
And in fact, many people worry even less than I do! People get tired of feeling in danger all the time.

Comment by teradimich on Geoff Hinton Quits Google · 2023-05-02T06:12:33.145Z · LW · GW

'“Then why are you doing the research?” Bostrom asked.
“I could give you the usual arguments,” Hinton said. “But the truth is that the prospect of discovery is too sweet.” He smiled awkwardly, the word hanging in the air—an echo of Oppenheimer, who famously said of the bomb, “When you see something that is technically sweet, you go ahead and do it, and you argue about what to do about it only after you have had your technical success.”'
'I asked Hinton if he believed an A.I. could be controlled. “That is like asking if a child can control his parents,” he said. “It can happen with a baby and a mother—there is biological hardwiring—but there is not a good track record of less intelligent things controlling things of greater intelligence.” He looked as if he might elaborate. Then a scientist called out, “Let’s all get drinks!”'

https://www.newyorker.com/magazine/2015/11/23/doomsday-invention-artificial-intelligence-nick-bostrom

Hinton seems to be more responsible now!

Comment by teradimich on Four mindset disagreements behind existential risk disagreements in ML · 2023-04-11T13:08:49.341Z · LW · GW

The level of concern and seriousness I see from ML researchers discussing AGI on any social media platform or in any mainstream venue seems wildly out of step with "half of us think there's a 10+% chance of our work resulting in an existential catastrophe".

In fairness, this is not quite half the researchers. This is half the agreed survey.

'We contacted approximately 4271 researchers who published at the conferences NeurIPS or ICML in 2021. [...] We received 738 responses, some partial, for a 17% response rate'.

I expect that worried researchers are more likely to agree to participate in the survey.

Comment by teradimich on Eliezer Yudkowsky’s Letter in Time Magazine · 2023-04-06T16:00:21.893Z · LW · GW

Thanks for your answer, this is important to me.

Comment by teradimich on Eliezer Yudkowsky’s Letter in Time Magazine · 2023-04-06T10:38:37.492Z · LW · GW

I am not an American (so excuse me for my bad English!), so my opinion about the admissibility of attack on the US data centers is not so important. This is not my country.

But reading about the bombing of Russian data centers as an example was unpleasant. It sounds like a Western bias for me. And not only for me.

'What on Earth was the point of choosing this as an example? To rouse the political emotions of the readers and distract them from the main question?'.

If the text is aimed at readers not only from the First World countries, well, perhaps the authors should do such a clarification as you did! Then it will not look like political hypocrisy. Or not write about air strikes at all, because people are distracted for discussing this.

Comment by teradimich on Eliezer Yudkowsky’s Letter in Time Magazine · 2023-04-06T09:04:38.897Z · LW · GW

I'm not an American, so my consent doesn't mean much :)

Comment by teradimich on Eliezer Yudkowsky’s Letter in Time Magazine · 2023-04-06T03:55:59.507Z · LW · GW

Suppose China and Russia accepted the Yudkowsky's initiative. But the USA is not. Would you support to bomb a American data center?

Comment by teradimich on Who are some prominent reasonable people who are confident that AI won't kill everyone? · 2022-12-28T09:25:44.296Z · LW · GW

I can provide several links. And you choose those that are suitable. If suitable. The problem is that I retained not the most complete justifications, but the most ... certain and brief. I will try not to repeat those that are already in the answers here.

Jaron Lanier and Neil Gershenfeld

Magnus Vinding and his list

Tobias Baumann

Brian Tomasik

Maybe Abram Demski? But he changed his mind, probably.
Well, Stuart Russell. But this is a book. I can quote.

I do think that I’m an optimist. I think there’s a long way to go. We are just scratching the surface of this control problem, but the first scratching seems to be productive, and so I’m reasonably optimistic that there is a path of AI development that leads us to what we might describe as “provably beneficial AI systems.”

There are also a large number of reasonable people who directly called themselves optimists or pointed out a relatively small probability of death from AI. But usually they did not justify this in ~ 500 words…

I also recommend this book.

Comment by teradimich on Who are some prominent reasonable people who are confident that AI won't kill everyone? · 2022-12-27T19:28:10.443Z · LW · GW

My fault. I should just copy separate quotes and links here.

Comment by teradimich on Who are some prominent reasonable people who are confident that AI won't kill everyone? · 2022-12-07T09:33:08.742Z · LW · GW

I have collected many quotes with links about the prospects of AGI. Most people were optimistic.

Comment by teradimich on Theoretical Neuroscience For Alignment Theory · 2021-12-14T22:34:46.790Z · LW · GW

Glad you understood me. Sorry for my english!
Of course, the following examples themselves do not prove the opportunity to solve the entire problem of AGI alignment! But it seems to me that this direction is interesting and strongly underestimated. Well, someone smarter than me can look at this idea and say that it is bullshit, at least.

Partly this is a source of intuition for me, that the creation of aligned superintellect is possible. And maybe not even as hard as it seems.
We have many examples of creatures that follow the goals of someone more stupid. And the mechanism that is responsible for this should not be very complex.

Such a stupid process, as a natural selection, was able to create mentioned capabilities. It must be achievable for us.

Comment by teradimich on Theoretical Neuroscience For Alignment Theory · 2021-12-08T14:38:20.840Z · LW · GW

It seems to me that the brains of many animals can be aligned with the goals of someone much more stupid themselves.
People and pets. Parasites and animals. Even ants and fungus.
Perhaps the connection that we would like to have with superintellence, is observed on a much smaller scale.

Comment by teradimich on Ngo and Yudkowsky on AI capability gains · 2021-11-19T18:26:58.738Z · LW · GW

I apologize for the stupid question. But…

Do we have more chances to survive in the world, which is closer to Orwell's '1984'?
It seems to me that we are moving towards more global surveillance and control. China's regime in 2021 may seem extremely liberal for an observer in 2040.

Comment by teradimich on Attempted Gears Analysis of AGI Intervention Discussion With Eliezer · 2021-11-16T13:57:40.135Z · LW · GW

I guess I missed the term gray goo. I apologize for this and for my bad English.
Is it possible to replace it on the 'using nanotechnologies to attain a decisive strategic advantage'?
I mean the discussion of the prospects for nanotechnologies on SL4 20+ years ago. This is especially:

My current estimate, as of right now, is that humanity has no more than a 30% chance of making it, probably less. The most realistic estimate for a seed AI transcendence is 2020; nanowar, before 2015.

I understand that since then the views of EY have changed in many ways. But I am interested in the views of experts on the possibility of using nanotechnology for those scenarios that he implies now. That little thing I found.

Comment by teradimich on Attempted Gears Analysis of AGI Intervention Discussion With Eliezer · 2021-11-15T14:21:26.609Z · LW · GW

Nanosystems are definitely possible, if you doubt that read Drexler’s Nanosystems and perhaps Engines of Creation and think about physics.

Is there something like the result of a survey of experts about the feasibility of drexlerian nanotechnology? Are there any consensus among specialists about the possibility of a gray goo scenario?

Drexler and Yudkowsky both extremely overestimated the impact of molecular nanotechnology in the past.

Comment by teradimich on What is Compute? - Transformative AI and Compute [1/4] · 2021-09-26T17:34:21.737Z · LW · GW

I do not know the opinions of experts on this issue. And I lack competence for such conclusions, sorry.

Comment by teradimich on What is Compute? - Transformative AI and Compute [1/4] · 2021-09-24T17:47:34.894Z · LW · GW

AlexNet was the first publication that leveraged graphical processing units (GPUs) for the training run

Do you mean the first of the data points on the chart? The GPU was used for DL long before AlexNet. References: [1], [2], [3], [4], [5].

Comment by teradimich on What 2026 looks like · 2021-08-23T05:35:07.276Z · LW · GW

The most realistic estimate for a seed AI transcendence is 2020; nanowar, before 2015. The most optimistic estimate for project Elisson would be 2006; the earliest nanowar, 2003.

But this is 1999, yes.

Comment by teradimich on "AI and Compute" trend isn't predictive of what is happening · 2021-04-03T09:01:14.669Z · LW · GW

Probably that:

When we didn’t have enough information to directly count FLOPs, we looked GPU training time and total number of GPUs used and assumed a utilization efficiency (usually 0.33)

Comment by teradimich on "AI and Compute" trend isn't predictive of what is happening · 2021-04-02T11:11:50.097Z · LW · GW

This can be useful:

We trained the league using three main agents (one for each StarCraft race), three main exploiter agents (one for each race), and six league exploiter agents (two for each race). Each agent was trained using 32 third-generation tensor processing units (TPUs) over 44 days

Comment by teradimich on Draft report on AI timelines · 2020-09-19T09:56:59.515Z · LW · GW

Perhaps my large collection of quotes about the impact of AI on the future of humanity here will be helpful.

Comment by teradimich on Possible takeaways from the coronavirus pandemic for slow AI takeoff · 2020-06-03T18:57:00.953Z · LW · GW

Then it is worth considering the majority of experts from the FHI to be extreme optimists, the same 20%? I really tried to find all the publicly available forecasts of experts and those who were confident that AI would lead to the extinction of humanity, there were very few among them. But I have no reason not to believe you or Luke Muehlhauser who introduced AI safety experts as even more confident pessimists: ’Many of them are, roughly speaking, 65%-85% confident that machine superintelligence will lead to human extinction’ . The reason may be that not everyone agrees, whose opinion is worth considering.

User info

Posts

Comments