Posts

It's time for a self-reproducing machine 2024-08-07T21:52:22.819Z

Comments

Comment by Carl Feynman (carl-feynman) on Most Questionable Details in 'AI 2027' · 2025-04-06T00:37:27.112Z · LW · GW

Nitpick: No single organism can destroy the biosphere; at most it can fill its niche & severely disrupt all ecosystems.

Have you read the report on mirror life that came out a few months ago? A mirror bacterium has a niche of “inside any organism that uses carbon-based biochemistry”. At least, it would parasitize all animals, plants, fungi, and the larger Protozoa, and probably kill them. I guess bacteria and viruses would be left. I bet that a reasonably smart superintelligence could figure out a way to get them too.

Comment by Carl Feynman (carl-feynman) on Daniel Tan's Shortform · 2025-03-22T22:47:14.945Z · LW · GW

Quite right. AI safety is moving very quickly and doesn’t have any methods that are well-understood enough to merit a survey article. Those are for things that have a large but scattered literature, with maybe a couple of dozen to a hundred papers that need surveying. That takes a few years to accumulate.

Comment by Carl Feynman (carl-feynman) on Daniel Tan's Shortform · 2025-03-22T16:00:23.886Z · LW · GW

Could you give an example of the sort of distinction you’re pointing at? Because I come to completely the opposite conclusion.

Part of my job is applied mathematics. I’d rather read a paper applying one technique to a variety of problems, than a paper applying a variety of techniques to one problem. Seeing the technique used on several problems lets me understand how and when to apply it. Seeing several techniques on the same problem tells me the best way to solve that particular problem, but I’ll probably never run into that particular problem in my work.

But that’s just me; presumably you want something else out of reading the literature. I would be interested to know what exactly.

Comment by Carl Feynman (carl-feynman) on So how well is Claude playing Pokémon? · 2025-03-10T16:30:57.140Z · LW · GW

When I say Pokémon-type games, I don’t mean games recounting the adventures of Ash Ketchum and Pikachu. I mean games with a series of obstacles set in a large semi-open world, with things you can carry, a small set of available actions at each point, and a goal of progressing past the obstacles. Such games can be manufactured in unlimited quantities by a program. They can also be “peopled” by simple LLMs, for increased complexity. They don’t actually have to be fun to play or look at, so the design requirements are loose.

There have been attempts at reinforcement learning using unlimited computer-generated games. They haven’t worked that well. I think the key feature that favors Pokémon-like games is that when the player dies or gets stuck, they can go back to the beginning and try again. This rewards trial-and-error learning to get past obstacles, keeping a long-term memory, and to re-plan your approach when something doesn’t work. These are capabilities in which current LLMs are notably lacking.

Another way of saying what Claude’s missing skill is: managing long-term memory. You need to remember important stuff, forget minor stuff, summarize things, and realize when a conclusion in your memory is wrong and needs correction.

Comment by Carl Feynman (carl-feynman) on So how well is Claude playing Pokémon? · 2025-03-10T00:48:18.990Z · LW · GW

True. I was generalizing it to a system that tries to solve lots of Pokémon-like tasks in various artificial worlds, rather than just expecting it to solve Pokémon over and over. But I didn’t say that, I just imagined in my mind and assumed everyone else would too. Thank you for making it explicit!

Comment by Carl Feynman (carl-feynman) on A model of the final phase: the current frontier AIs as de facto CEOs of their own companies · 2025-03-09T18:05:25.176Z · LW · GW

This is an important case to think about. I think it is understudied. What separates current AIs from the CEO role? And how long will it take? I see three things:

Long term thinking, agency, the ability to remember things, not going crazy in an hour or two. It seems to me like this is all the same problem, in the sense that I think one innovation will solve all of them. This has a lot of effort focused on it. I feel like it's been a known big problem since GPT-4 and Sydney/Bing, 2 1/2 years ago. So, by the Lindy principle, it should be another 2 1/2 years until it is solved.
Persuasiveness. I've known a few CEOs in my life; they were all more persuasive than average, and one was genuinely unearthly in her ability to convince people of things. LLMs have been steadily increasing in persuasiveness, and are now par-human. So I think scaling will take care of this. Perhaps a year or two?
Experience. I don't know how much of this can be inculcated with the training data, and how much of it requires actually managing people and products and thinking about what can go wrong. Every CEO has to deal with individual subordinates, customers, and counterparties. How much of the job is learning the ins and outs of those particular people? Or does it suffice to have spent a thousand subjective years reading about a million people? So we may already have this.

(Side point: As an engineer watching CEOs, I am amazed by their ability to take a few scattered hints spread across their very busy days, and assemble them into a theory of what's going on and what to do about it. They're willing to take what I would consider intolerably flimsy evidence and act on it. When this doesn't work, its called "jumping to conclusions", when it does work it's called "a genius move". AIs should be good at this if you turn up their temperature.)

Comment by Carl Feynman (carl-feynman) on Phoenix Rising · 2025-03-09T17:15:36.044Z · LW · GW

Eyes are exactly the worst tissue in which to express luciferase. Lighting the eyeball from within prevents seeing things outside the eye, causing blindness. Make your cat glow anywhere else!

Epistemic status: I know you're kidding. So am I.

Comment by Carl Feynman (carl-feynman) on Why it's so hard to talk about Consciousness · 2025-03-09T17:01:22.245Z · LW · GW

My charity extends no further than the human race. Once in a while I think about animal ethics and decide that no, I still don't care enough to make an effort.

A basic commitment of my charity group from the beginning: no money that benefits things other than people. We don't donate to benefit political groups, organizations, arts, animals, or the natural world. I'm good with that. Members of the group may of course donate elsewhere, and generally do.

We've been doing this since 1998, decades before Effective Altruism was a thing. I don't have a commitment to Effective Altruism the movement, just to altruism which is effective.

Comment by Carl Feynman (carl-feynman) on What is the best / most proper definition of "Feeling the AGI" there is? · 2025-03-09T16:24:56.919Z · LW · GW

It seems to me that I have done a lot of careful thinking about timelines, and that I also feel the AGI. Why can't you have a careful understanding what timelines we should expect, and also have an emotional reaction to that? Reasonably coming to the conclusion that many things will change greatly in the next few years deserves a reaction.

Comment by Carl Feynman (carl-feynman) on So how well is Claude playing Pokémon? · 2025-03-09T14:49:26.288Z · LW · GW

A task like this, at which the AI is lousy but not hopeless, is an excellent feedback signal for RL. It's also an excellent feedback signal for "grad student descent": have a human add mechanisms, and see if Claude gets better. This is a very good sign for capabilities, unfortunately.

Comment by Carl Feynman (carl-feynman) on Why it's so hard to talk about Consciousness · 2025-03-05T13:37:50.378Z · LW · GW

You would suppose wrong! My wife and I belong to a group of a couple of dozen people that investigates charities, picks the best ones, and sends them big checks. I used to participate more, but now I outsource all the effort to my wife. I wasn’t contributing much to the choosing process. I just earn the money 🙂.

What does this have to do with my camp#1 intuitions?

Comment by Carl Feynman (carl-feynman) on Why it's so hard to talk about Consciousness · 2025-03-04T17:18:09.667Z · LW · GW

I don’t think enjoyment and suffering are arbitrary or unimportant. But I do think they’re nebulous. They don’t have crisp, definable, generally agreed-upon properties. We have to deal with them anyway.

I don’t reason about animal ethics; I just follow standard American cultural standards. And I think ethics matters because it helps us live a virtuous and rewarding life.

Is that helpful?

Comment by Carl Feynman (carl-feynman) on Why it's so hard to talk about Consciousness · 2025-03-04T02:51:50.594Z · LW · GW

Well, I’m glad you’ve settled the nature of qualia. There’s a discussion downthread, between TAG and Signer, which contains several thousand words of philosophical discussion of qualia. What a pity they didn’t think to look in Wikipedia, which settles the question!

Seriously, I definitely have sensations. I just think some people experience an extra thing on top of sensations, which they think is an indissoluble part of cognition, and which causes them to find some things intuitive that I find incomprehensible.

Comment by Carl Feynman (carl-feynman) on Why it's so hard to talk about Consciousness · 2025-03-04T02:31:27.640Z · LW · GW

Well, I can experience happiness and sadness, and all the usual other emotions, so I just always assumed that was enough to make me a morally significant entity.

Comment by Carl Feynman (carl-feynman) on Why it's so hard to talk about Consciousness · 2025-03-04T02:12:00.598Z · LW · GW

With the lawn mower robot, we are able to say what portions of its construction and software are responsible for its charging-station-seeking behavior, in a well-understood mechanistic way. Presumably, if we knew more about the construction of the human mind, we’d be able to describe the mechanisms responsible for human enjoyment of eating and resting. Are the two mechanisms similar enough that it makes sense to refer to the robot enjoying things? I think that the answer is (a) we don’t know, (b) probably not, and (c) there is no fact of the matter that will allow us to settle the question. It’s just a question of whether two things, more or less similar, should be lumped into the same category. On the other hand, thinking of my dogs as enjoying eating and resting is pretty clearly sensible, because it seems to predict their behavior well, and because the mechanisms involved are probably quite similar in the two species.

Comment by Carl Feynman (carl-feynman) on james oofou's Shortform · 2025-03-01T15:35:21.240Z · LW · GW

An interesting and important question.

We have data about how problem-solving ability scales with reasoning time for a fixed model. This isn’t your question, but it’s related. It’s pretty much logarithmic, IIRC.

The important question is, how far can we push the technique whereby reasoning models are trained? They are trained by having them solve a problem with chains of thought (CoT), and then having them look at their own CoT, and ask “how could I have thought that faster?” It’s unclear how far this technique can be pushed (at least to those of us outside the main AI labs).

The known scaling principles are unsatisfying from the point of view of someone who actually wants to know what will happen next. They can predict numbers like score on a certain test, or residual perplexity. But they can’t predict the emergence of new abilities like “can translate languages” or “can tell a joke” or “can take over the world.”

By the way, I wouldn’t put too much weight on any claims about “marginal cost of training DeepSeek R1 over DeepSeek v3”. DeepSeek has a track record of understating how much effort it took to do something. I’m not saying it’s actual dishonesty (although it might be) but it’s at least not counting costs that other companies include, so their estimates come out apparently much lower than other people.

Comment by Carl Feynman (carl-feynman) on LWLW's Shortform · 2025-02-28T01:18:36.863Z · LW · GW

For me, depression has been independent of the probability of doom. I’ve definitely been depressed, but I’ve been pretty cheerful for the past few years, even as the apparent probability of near-term doom has been mounting steadily. I did stop working on AI, and tried to talk my friends out of it, which was about all I could do. I decided not to worry about things I can’t affect, which has clarified my mind immensely.

The near-term future does indeed look very bright.

Comment by Carl Feynman (carl-feynman) on LWLW's Shortform · 2025-02-28T01:04:56.162Z · LW · GW

I am in violent agreement. Nowhere did I say that MuZero could learn a world model as complicated as those LLMs currently enjoy. But it could learn continuously, and execute pretty complex strategies. I don’t know how to combine that with the breadth of knowledge or cleverness of LLMs, but if we could, we’d be in trouble.

Comment by Carl Feynman (carl-feynman) on LWLW's Shortform · 2025-02-27T19:05:18.417Z · LW · GW

Whoops, meant MuZero instead of AlphaZero.

Comment by Carl Feynman (carl-feynman) on LWLW's Shortform · 2025-02-27T19:01:39.027Z · LW · GW

You shouldn’t worry about whether something “is AGI”; it’s an I’ll-defined concept. I agree that current models are lacking the ability to accomplish long-term tasks in the real world, and this keeps them safe. But I don’t think this is permanent, for two reasons.

Current large-language-model type AI is not capable of continuous learning, it is true. But AIs which are capable of it have been built. AlphaZero is perhaps the best example; it learns to play games to a superhuman level in a few hours. It’s a topic of current research to try to combine them.

Moreover, tool-type AIs tend to be developed to provide agency, because it’s more useful to direct an agent than it is a tool. This is a more fully fleshed out here: https://gwern.net/tool-ai

Much of my probability of non-doom is resting on people somehow not developing agents.

Comment by Carl Feynman (carl-feynman) on LWLW's Shortform · 2025-02-27T00:25:15.430Z · LW · GW

Welcome to Less Wrong. Sometimes I like to go around engaging with new people, so that’s what I’m doing.

On a sentence-by-sentence basis, your post is generally correct. It seems like you’re disagreeing with something you’ve read or heard. But I don’t know what you read, so I can’t understand what you’re arguing for or against. I could guess, but it would be better if you just said.

Comment by Carl Feynman (carl-feynman) on Programming Language Early Funding? · 2025-02-19T22:06:05.075Z · LW · GW

I work for a company that developed its own programming language and has been selling it for over twenty years for a great deal of money. For many of those twenty years, I worked in the group developing the language. Before working for my current employer, I participated in several language development efforts. I say this not in order to toot my own horn, but to indicate that what I say has some weight of experience behind it.

There is no way to get the funding you want. I am sorry to tell you this.

From a funder's point of view, there are several effects that make this a bad bet. First, the difference between reasonably up-to-date general-purpose languages does not seem to be that large. This is as measured by actual effectiveness of projects (manager view) as opposed to whether a language is fun to use (engineer view). So presumably a new language will not offer a large improvement. If your language meets a combination of needs not served by any existing language, that may be a way around this. Second, every person who invents a new programming language thinks it will be a marvelous advance over all existing languages. It's like how every parent thinks their baby is the cutest. Some of them are actually right, but since all of them think this, a funder has no way of telling which is which. Third, as you note above, it is hard to monetize a new language. This means that the capitalist startup system will not be helpful to you. Fourth, much of language design is a balancing of imponderables (simplicity vs power, garbage collection vs control, etc.) There's no way to measure these before funding the project, so no way to make confident predictions of whether a new language will be an advance. So the backing of a project is even more of a leap of faith than it usually is. (Some of this risk can be retired by releasing an interpreter and documentation before writing a compiler.)

It's lousy and it sucks. Sorry.

Comment by Carl Feynman (carl-feynman) on Why it's so hard to talk about Consciousness · 2025-01-20T23:44:09.728Z · LW · GW

Well, let me quote Wikipedia:

Much of the debate over the importance of qualia hinges on the definition of the term, and various philosophers emphasize or deny the existence of certain features of qualia. Some philosophers of mind, like Daniel Dennett, argue that qualia do not exist. Other philosophers, as well as neuroscientists and neurologists, believe qualia exist and that the desire by some philosophers to disregard qualia is based on an erroneous interpretation of what constitutes science.

If it was that easy to understand, we wouldn't be here arguing about it. My claim is that arguments about qualia are (partially) caused by people actually having different cognitive mechanisms that produce different intuitions about how experience works.

Comment by Carl Feynman (carl-feynman) on notfnofn's Shortform · 2025-01-03T18:51:19.950Z · LW · GW

Humans continue to get very offended if they find out they are talking to an AI

In my limited experience of phone contact with AIs, this is only true for distinctly subhuman AIs. Then I emotionally react like I am talking to someone who is being deliberately obtuse, and become enraged. I'm not entirely clear on why I have this emotional reaction, but it's very strong. Perhaps it is related to the Uncanny Valley effect. On the other hand, I've dealt with phone AIs that (acted like they) understood me, and we've concluded a pleasant and businesslike interaction. I may be typical-minding here, but I suspect that most people will only take offense if they run into the first kind of AI.

Perhaps this is related: I felt a visceral uneasiness dealing with chat-mode LLMs, until I tried Claude, which I found agreeable and helpful. Now I have a claude.ai subscription. Once again, I don't understand the emotional difference.

I'm 62 years old, which may have something to do with it. I can feel myself being less mentally flexible than I was decades ago, and I notice myself slipping into crotchety-old-man mode more often. It's a problem that requires deliberate effort to overcome.

Comment by Carl Feynman (carl-feynman) on The subset parity learning problem: much more than you wanted to know · 2025-01-03T15:49:23.447Z · LW · GW

Has anybody tried actual humans or smart LLMs? It would be interesting to know what methods people actually use.

Comment by carl-feynman on [deleted post] 2025-01-03T15:25:21.535Z

If we're being pragmatic, the priors we had at birth almost don't matter. A few observations will overwhelm any reasonable prior. As long as we don't assign zero probability to anything that can actually happen, the shape of the prior makes no practical difference.

Comment by Carl Feynman (carl-feynman) on My AGI safety research—2024 review, ’25 plans · 2024-12-31T21:41:41.937Z · LW · GW

including probably reworking some of my blog post ideas into a peer-reviewed paper for a neuroscience journal this spring.

I think this is a great idea. It will broadcast your ideas to an audience prepared to receive them. You can leave out the "friendly AI" motivation and your ideas will stand on their own as a theory of (some of) cognition.

Comment by Carl Feynman (carl-feynman) on Could orcas be (trained to be) smarter than humans?  · 2024-12-31T20:44:14.066Z · LW · GW

Do we have a sense for how much of the orca brain is specialized for sonar? About a third of the human brain is specialized to visual perception. If sonar is harder than vision, evolution might have dedicated more of the orca brain to it. On the other hand, orcas don't need a bunch of brain for manual dexterity, like us.

In humans, the prefrontal cortex is dedicated to "higher" forms of thinking. But evolution slides functions around on the cortical surface, and (Claude tells me) association areas like the prefrontal cortex are particularly prone to this. Just looking for the volume of the prefrontal cortex won't tell you how much actual thought goes on there.

Comment by Carl Feynman (carl-feynman) on Review: Planecrash · 2024-12-27T20:03:16.927Z · LW · GW

All the pictures are missing for me.

Comment by Carl Feynman (carl-feynman) on What are the strongest arguments for very short timelines? · 2024-12-26T19:45:18.691Z · LW · GW

Is this the consensus view? I think it’s generally agreed that software development has been sped up. A factor of two is ambitious! But that’s what it seems to me, and I’ve measured three examples of computer vision programming, each taking an hour or two, by doing them by hand and then with machine assistance. The machines are dumb and produce results that require rewriting. But my code is also inaccurate on a first try. I don’t have any references where people agree with me. And this may not apply to AI programming in general.

You ask about “anonymous reports of diminishing returns to scaling.” I have also heard these reports, direct from a friend who is a researcher inside a major lab. But note that this does not imply a diminished rate of progress, since there are other ways to advance besides making LLMs bigger. O1 and o3 indicate the payoffs to be had by doing things other than pure scaling. If there are forms of progress available to cleverness, then the speed of advance need not require scaling.

Comment by Carl Feynman (carl-feynman) on What are the main arguments against AGI? · 2024-12-26T17:07:49.313Z · LW · GW

One argument against is that I think it’s coming soon, and I have a 40 year history of frothing technological enthusiasm, often predicting things will arrive decades before they actually do. 😀

Comment by Carl Feynman (carl-feynman) on Shortform · 2024-12-26T17:03:44.425Z · LW · GW

These criticisms are often made of “market dominant minorities”, to use a sociologist’s term for what American Jews and Indian-Americans have in common. Here’s a good short article on the topic: https://scholarship.law.duke.edu/cgi/viewcontent.cgi?article=5582&context=faculty_scholarship

Comment by Carl Feynman (carl-feynman) on Purplehermann's Shortform · 2024-12-26T16:47:49.692Z · LW · GW

This isn’t crazy— people have tried related techniques. But it needs more details thought out.

In the chess example, the AIs start out very stupid, being wired at random. But in a game between two idiots, moving at random, eventually someone is going to win. And then you reinforce the techniques used by the winner, and de-reinforce the ones used by the loser. In any encounter, you learn, regardless of who wins. But in an encounter between a PM and a programmer, if the programmer fails, who gets reinforced? It might be because the programmer is dumb, and should be de-reinforced. But it might be because the PM is dumb, and asked for something impossible or far beyond what can be done, in which case it should be de-reinforced. But it might be because the PM came up with a task just barely beyond the programmer’s ability, which is good and should be reinforced. We somehow need to keep the PM producing problems which are hard but possible. Maybe the programmer could be tasked with coming up with either a solution or a proof of impossibility?

AlphaGo had a mechanism which tracked how important each move was. It was trained to predict the probability that white would win, on each position encountered in the game. Moves where this probability swung wildly were given a larger weight in reinforcement. This was important for concentrating training on decisive moves, allowing the extraction of information from each move instead of each game. It’s not clear if this is possible in the programming task.

Comment by Carl Feynman (carl-feynman) on Why is neuron count of human brain relevant to AI timelines? · 2024-12-26T15:19:52.106Z · LW · GW

This is a great question!

Point one:

The computational capacity of the brain used to matter much more than it matters now. The AIs we have now are near-human or superhuman at many skills, and we can measure how skill capacity varies with resources in the near-human range. We can debate and extrapolate and argue with real data.

But we spent decades where the only intelligent system we had was the human brain, so it was the only anchor we had for timelines. So even though it’s very hard to make good estimates from, we had to use it.

Point two:

Most information that gives rise to the human mind is learned, not evolved.

The information encoded by evolution is less than a hundred megabytes. It’s limited by the size of the genome (1 gigabytes). Moreover, we know that much of the genome is unimportant for mental development. About 40% is parasitic (viruses and transposons). Much of the remaining DNA is not under evolutionary control, varying randomly between individuals. Of expressed genes, only about a quarter appear to be expressed in the brain. And some of them encode things AI doesn’t need, like the high-reliability plumbing of the circle of Willis, or the mysteries of love, or the biochemical pickiness of the blood-brain barrier, or wanting to pee when you hear running water. So the “program” contributed by evolution is no more than the size of a largish program like a compiler. (I would claim it’s probably even less. I think the important instincts plus the learning algorithms are only a few thousand lines of code. But that’s debatable.)

On the other hand, the amount learned in a lifetime is on the order of one or a few gigabytes.

Point three:

Most of the information accumulated by evolution has been destroyed. All of the information accumulated in a species is lost when that species goes extinct. And most species have gone extinct, leaving no descendants. The world of the Permian period was (as far as we know) just as busy as today, with hundreds of thousands of animal species. Just one of those species, a little burrowing critter among many other types of little burrowing critters, was the ancestor of all mammals. All the other little burrowing critters lost out. All their evolutionary innovations have been lost.

This doesn’t apply to species with horizontal transmission of genes, like bacteria. But it applies to animals, who are the only creatures with brains.

Comment by Carl Feynman (carl-feynman) on Purplehermann's Shortform · 2024-12-26T14:23:49.272Z · LW · GW

What’s a PM?

Comment by Carl Feynman (carl-feynman) on What are the strongest arguments for very short timelines? · 2024-12-24T22:34:42.887Z · LW · GW

I disagree that there is a difference of kind between "engineering ingenuity" and "scientific discovery", at least in the business of AI. The examples you give-- self-play, MCTS, ConvNets-- were all used in game-playing programs before AlphaGo. The trick of AlphaGo was to combine them, and then discover that it worked astonishingly well. It was very clever and tasteful engineering to combine them, but only a breakthrough in retrospect. And the people that developed them each earlier, for their independent purposes? They were part of the ordinary cycle of engineering development: "Look at a problem, think as hard as you can, come up with something, try it, publish the results." They're just the ones you remember, because they were good.

Paradigm shifts do happen, but I don't think we need them between here and AGI.

Comment by Carl Feynman (carl-feynman) on What are the strongest arguments for very short timelines? · 2024-12-24T21:56:54.901Z · LW · GW

Came here to say this, got beaten to it by Radford Neal himself, wow! Well, I'm gonna comment anyway, even though it's mostly been said.

Gallagher proposed belief propagation as an approximate good-enough method of decoding a certain error-correcting code, but didn't notice that it worked on all sorts of probability problems. Pearl proposed it as a general mechanism for dealing with probability problems, but wanted perfect mathematical correctness, so confined himself to tree-shaped problems. It was their common generalization that was the real breakthrough: an approximate good-enough solution to all sorts of problems. Which is what Pearl eventually noticed, so props to him.

If we'd had AGI in the 1960s, someone with a probability problem could have said "Here's my problem. For every paper in the literature, spawn an instance to read that paper and tell me if it has any help for my problem." It would have found Gallagher's paper and said "Maybe you could use this?"

Comment by Carl Feynman (carl-feynman) on What are the strongest arguments for very short timelines? · 2024-12-24T21:23:59.927Z · LW · GW

Summary: Superintelligence in January-August, 2026. Paradise or mass death, shortly thereafter.

This is the shortest timeline proposed in these answers so far. My estimate (guess) is that there's only 20% of this coming true, but it looks feasible as of now. I can't honestly assert it as fact, but I will say it is possible.

It's a standard intelligence explosion scenario: with only human effort, the capacities of our AIs double every two years. Once AI gets good enough to do half the work, we double every one year. Once we've done that for a year, our now double-smart AIs help us double in six months. Then we double in three months, then six weeks.... to perfect ASI software, running at the the limits of our hardware, in a finite time. Then the ASI does what it wants, and we suffer what we must.

I hear you say "Carl, this argument is as old as the hills. It hasn't ever come true, why bring it up now?" The answer is, I bring it up because it seems to be happening.

For at least six months now, we've had software assistants that can roughly double the productivity of software development. At least at the software company where I work, people using AI (including me) are seeing huge increases in effectiveness. I assume the same is true of AI software development, and that AI labs are using this technology as much as they can.
In the last few months, there's been a perceptible increase in the speed of releases of better models. I think this is a visible (subjective, debatable) sign of an intelligence explosion starting up.

So I think we're somewhere in the "doubling in one year" phase of the explosion. If we're halfway through that year, the singularity is due in August 2026. If we're near the end of that year, the date is January 2026.

There are lots of things that might go wrong with this scenario, and thereby delay the intelligence explosion. I will mention a few, so you don't have to.

First, the government might stop the explosion, by banning AI being used for the development of AI. Or perhaps the management of all major AI labs will spontaneously not be so foolish as to. This will delay the problem for an unknown time.

Second, the scenario has an extremely naive model of intelligence explosion microeconomics. It assumes that one doubling of "smartness" produces one doubling of speed. In Yudkowsky's original scenario, AIs were doing all the work of development, and this might be a sensible assumption. But what has actually happened is that successive generations of AI can handle larger and larger tasks, before they go off the rails. And they can handle these tasks far faster than humans. So the way we work now is that we ask the AI to do some small task, and bang, it's done. It seems like testing is showing that current AIs can do things that would take a human up to an hour or two. Perhaps the next generation will be able to do tasks up to four hours. The model assumes that this allows a twofold speedup, then fourfold, etc. But this assumption is unsupported.

Third, the scenario assumes that near-term hardware is sufficient for superintelligence. There isn't time for the accelerating loop to take effect in hardware. Even if design was instant, the physical processes of mask making, lithography, testing, yield optimization and mass production take more than a year. The chips that the ASI will run on in mid-2026 have their design almost done now, at the end of 2024. So we won't be able to get to ASI, if the ASI requires many orders of magnitude more FLOPs than current models. Instead, we'll have to wait until the AI designs future generations of semiconductor technology. This will delay matters by years (if using humans to build things) or hours (if using nanotechnology.)

(I don't think the hardware limit is actually much of a problem; AIs have recently stopped scaling in numbers of parameters and size of training data. Good engineers are constantly figuring out how to pack more intelligence into the same amount of computation. And the human brain provides an existence proof that human-level intelligence requires much less training data. Like Mr. Helm-Burger above, I think human-equivalent cognition is around 10^15 Flops. But reasonable people disagree with me.)

Comment by Carl Feynman (carl-feynman) on Robbin's Farm Sledding Route · 2024-12-22T16:50:19.169Z · LW · GW

There’s a shorter hill with a good slope in McLellan park, about a mile away. It debouches into a flat area, so you can coast a long time and don’t have to worry about hitting a fence. If you’ve got the nerve, you can sled onto a frozen pond and really go far.

The shorter hill means it’s quicker to climb, so it provides roughly equal fun per hour.

Comment by Carl Feynman (carl-feynman) on Biological risk from the mirror world · 2024-12-20T18:57:27.883Z · LW · GW

This is a lot easier to deal with than other large threats. The CO2 keeps rising because fossil fuels are so nearly indispensable. AI keeps getting smarter because they’re harmless and useful now and only dangerous in some uncertain future. Nuclear weapons still exist because they can end any war. But there is no strong argument for building mirror life.

I read (much of) the 300 page report giving the detailed argument. They make a good case that the effects of a release of a mirror bacterium would be apocalyptic. But what I found more interesting and encouraging was the discussion of the benefits of creating a mirror bacterium. They’re honestly pretty small.
—It would make it cheaper to manufacture some theoretical drugs, not yet known to be effective.

—it would be an amazing biochemical stunt. Someone might get a Nobel Prize.

—and that’s it!
Given the very small “pro” factors and the very large “con”, I would think it would be very easy to prevent anyone from doing it. Sensible people will refrain, and sensible lawmakers will forbid it. Moreover, given the indiscriminate nature of mirror bacteria, I find it hard to believe any group will want to release it.

Comment by Carl Feynman (carl-feynman) on Biological risk from the mirror world · 2024-12-20T16:53:14.461Z · LW · GW

The advantage is that they would have neither predators nor parasites, and their prey would not have adapted defenses to them. This would be true of any organism with a sufficiently unearthly biochemistry. Mirror life is the only such organism we are likely to create in the near term.

Comment by Carl Feynman (carl-feynman) on The Dangers of Mirrored Life · 2024-12-15T14:31:27.434Z · LW · GW

Thanks.

Comment by Carl Feynman (carl-feynman) on The Dangers of Mirrored Life · 2024-12-13T18:31:38.835Z · LW · GW

Has anyone been able to get to the actual “300 page report”? I follow the link in the second line of this article and I get to a page that doesn’t seem to have any way to actually download the report.

Comment by Carl Feynman (carl-feynman) on "The Solomonoff Prior is Malign" is a special case of a simpler argument · 2024-11-25T14:51:45.692Z · LW · GW

“…Solomonoff’s malignness…”

I was friends with Ray Solomonoff; he was a lovely guy and definitely not malign.

Epistemic status: true but not useful.

Comment by Carl Feynman (carl-feynman) on O O's Shortform · 2024-11-17T16:38:18.847Z · LW · GW

I was all set to disagree with this when I reread it more carefully and noticed it said “superhuman reasoning” and not “superintelligence”. Your definition of “reasoning” can make this obviously true or probably false.

Comment by Carl Feynman (carl-feynman) on Shortform · 2024-11-10T16:16:43.920Z · LW · GW

The Antarctic Treaty (and subsequent treaties) forbid colonization. They also forbid extraction of useful resources from Antarctica, thereby eliminating one of the main motivations for colonization. They further forbid any profitable capitalist activity on the continent. So you can’t even do activities that would tend toward permanent settlement, like surveying to find mining opportunities, or opening a tourist hotel. Basically, the treaty system is set up so that not only can’t you colonize, but you can’t even get close to colonizing.

Northern Greenland is inhabited, and it’s at a similar latitude.

(Begin semi-joke paragraph) I think the US should pull out of the treaty, and then announce that Antarctica is now part of the US, all countries are welcome to continue their purely scientific activity provided they get a visa, and announce the continent is now open to productive activity. What’s the point of having the world’s most powerful navy if you can’t do a fait accompli once in a while? Trump would love it, since it’s simultaneously unprecedented, arrogant and profitable. Biggest real estate development deal ever! It’s huuuge!

Comment by Carl Feynman (carl-feynman) on Alexander Gietelink Oldenziel's Shortform · 2024-11-06T17:23:05.197Z · LW · GW

A fascinating recent paper on the topic of human bandwidth is https://arxiv.org/abs/2408.10234. Title and abstract:

The Unbearable Slowness of Being

Jieyu Zheng, Markus Meister

This article is about the neural conundrum behind the slowness of human behavior. The information throughput of a human being is about 10 bits/s. In comparison, our sensory systems gather data at an enormous rate, no less than 1 gigabits/s. The stark contrast between these numbers remains unexplained. Resolving this paradox should teach us something fundamental about brain function: What neural substrate sets this low speed limit on the pace of our existence? Why does the brain need billions of neurons to deal with 10 bits/s? Why can we only think about one thing at a time? We consider plausible explanations for the conundrum and propose new research directions to address the paradox between fast neurons and slow behavior.

Comment by Carl Feynman (carl-feynman) on The Median Researcher Problem · 2024-11-05T16:48:51.548Z · LW · GW

They’re measuring a noisy phenomenon, yes, but that’s only half the problem. The other half of the problem is that society demands answers. New psychology results are a matter of considerable public interest and you can become rich and famous from them. In the gap between the difficulty of supply and the massive demand grows a culture of fakery. The same is true of nutrition— everyone wants to know what the healthy thing to eat is, and the fact that our current methods are incapable of discerning this is no obstacle to people who claim to know.

For a counterexample, look at the field of planetary science. Scanty evidence dribbles in from occasional spacecraft missions and telescopic observations, but the field is intellectually sound because public attention doesn’t rest on the outcome.

Comment by Carl Feynman (carl-feynman) on What's a good book for a technically-minded 11-year old? · 2024-11-05T16:06:18.307Z · LW · GW

Here is a category of book that I really loved at that age: non-embarrasing novels about how adults do stuff. Since, for me, that age was in 1973, the particular books I name might be obsolete. There’s a series of novels by Arthur Hailey, with titles like “Hotel” and “Airport”, that are set inside the titular institutions, and follow people as they deal with problems and interact with each other. And there is no, or at least minimal, sex, so they’re not icky to a kid. They’re not idealized; there is a reasonable degree of fallibility, venality and scheming, but that is also fascinating. And all the motivations, and the way the systems work, is clearly explained, so it can be understood by an unsophisticated reader.

These books were bestsellers back in the day, so you might be able to find a copy in the library. See if he likes it!

Another novel in this vein is “The view from the fortieth floor”, which is about a badly managed magazine going bankrupt. Doesn’t sound amazing, I know, but if you’re a kid, who’s never seen bad managers blunder into ineluctable financial doom, it’s really neat.

My wife is a middle school librarian. I’ll ask her when I see her for more books like this.

Comment by Carl Feynman (carl-feynman) on What's a good book for a technically-minded 11-year old? · 2024-11-05T15:32:17.204Z · LW · GW

Doesn’t matter, because HPMOR is engaging enough on a chapter-by-chapter basis. I read lots of books when I was a kid when I didn’t understand the overarching plot. As long as I had a reasonable expectation that cool stuff would happen in the next chapter, I’d keep reading. I read “Stand On Zanzibar” repeatedly as a child, and didn’t understand the plot until I reread it as an adult last year. Same with the detective novel “A Deadly Shade of Gold”. I read it for the fistfights, snappy dialogue, and insights into adult life. The plot was lost on me.

User info

Posts

Comments

The Unbearable Slowness of Being