LessWrong 2.0 Reader

View: New · Old · Top

Restrict date range: Today · This week · This month · Last three months · This year · All time

← previous page (newer posts) · next page (older posts) →

Fake thinking and real thinking
Joe Carlsmith (joekc) · 2025-01-28T20:05:06.735Z · comments (11)

Three Months In, Evaluating Three Rationalist Cases for Trump
Arjun Panickssery (arjun-panickssery) · 2025-04-18T08:27:27.257Z · comments (21)

My model of what is going on with LLMs
Cole Wyeth (Amyr) · 2025-02-13T03:43:29.447Z · comments (49)

[link] A short course on AGI safety from the GDM Alignment team
Vika · 2025-02-14T15:43:50.903Z · comments (1)

C'mon guys, Deliberate Practice is Real
Raemon · 2025-02-05T22:33:59.069Z · comments (25)

Third-wave AI safety needs sociopolitical thinking
Richard_Ngo (ricraz) · 2025-03-27T00:55:30.548Z · comments (23)

AI Control May Increase Existential Risk
Jan_Kulveit · 2025-03-11T14:30:05.972Z · comments (13)

[link] What the Headlines Miss About the Latest Decision in the Musk vs. OpenAI Lawsuit
garrison · 2025-03-06T19:49:02.145Z · comments (0)

Reviewing LessWrong: Screwtape's Basic Answer
Screwtape · 2025-02-05T04:30:34.347Z · comments (18)

Timaeus in 2024
Jesse Hoogland (jhoogland) · 2025-02-20T23:54:56.939Z · comments (1)

The Lizardman and the Black Hat Bobcat
Screwtape · 2025-04-06T19:02:01.238Z · comments (13)

How I talk to those above me
Maxwell Peterson (maxwell-peterson) · 2025-03-30T06:54:59.869Z · comments (13)

Show, not tell: GPT-4o is more opinionated in images than in text
Daniel Tan (dtch1997) · 2025-04-02T08:51:02.571Z · comments (41)

The Rising Sea
Jesse Hoogland (jhoogland) · 2025-01-25T20:48:52.971Z · comments (2)

How training-gamers might function (and win)
Vivek Hebbar (Vivek) · 2025-04-11T21:26:18.669Z · comments (4)

Six Thoughts on AI Safety
boazbarak · 2025-01-24T22:20:50.768Z · comments (55)

[link] Towards a scale-free theory of intelligent agency
Richard_Ngo (ricraz) · 2025-03-21T01:39:42.251Z · comments (24)

Dear AGI,
Nathan Young · 2025-02-18T10:48:15.030Z · comments (11)

[link] Elite Coordination via the Consensus of Power
Richard_Ngo (ricraz) · 2025-03-19T06:56:44.825Z · comments (15)

Tips and Code for Empirical Research Workflows
John Hughes (john-hughes) · 2025-01-20T22:31:51.498Z · comments (14)

We should start looking for scheming "in the wild"
Marius Hobbhahn (marius-hobbhahn) · 2025-03-06T13:49:39.739Z · comments (4)

How To Believe False Things
Eneasz · 2025-04-02T16:28:29.055Z · comments (10)

On Emergent Misalignment
Zvi · 2025-02-28T13:10:05.973Z · comments (5)

[link] Anthropic releases Claude 3.7 Sonnet with extended thinking mode
LawrenceC (LawChan) · 2025-02-24T19:32:43.947Z · comments (8)

[link] Wired on: "DOGE personnel with admin access to Federal Payment System"
Raemon · 2025-02-05T21:32:11.205Z · comments (45)

What goals will AIs have? A list of hypotheses
Daniel Kokotajlo (daniel-kokotajlo) · 2025-03-03T20:08:31.539Z · comments (19)

How I force LLMs to generate correct code
claudio · 2025-03-21T14:40:19.211Z · comments (7)

[link] The Manhattan Trap: Why a Race to Artificial Superintelligence is Self-Defeating
Corin Katzke (corin-katzke) · 2025-01-21T16:57:00.998Z · comments (11)

Voting Results for the 2023 Review
Raemon · 2025-02-06T08:00:37.461Z · comments (3)

Vacuum Decay: Expert Survey Results
JessRiedel · 2025-03-13T18:31:17.434Z · comments (26)

The Risk of Gradual Disempowerment from AI
Zvi · 2025-02-05T22:10:06.979Z · comments (15)

Stargate AI-1
Zvi · 2025-01-24T15:20:18.752Z · comments (1)

One-shot steering vectors cause emergent misalignment, too
Jacob Dunefsky (jacob-dunefsky) · 2025-04-14T06:40:41.503Z · comments (6)

OpenAI #11: America Action Plan
Zvi · 2025-03-18T12:50:03.880Z · comments (3)

How might we safely pass the buck to AI?
joshc (joshua-clymer) · 2025-02-19T17:48:32.249Z · comments (58)

A Slow Guide to Confronting Doom
Ruby · 2025-04-06T02:10:56.483Z · comments (20)

Ambiguous out-of-distribution generalization on an algorithmic task
Wilson Wu (wilson-wu) · 2025-02-13T18:24:36.160Z · comments (6)

Keltham's Lectures in Project Lawful
Morpheus · 2025-04-01T10:39:47.973Z · comments (4)

[link] ASI existential risk: Reconsidering Alignment as a Goal
habryka (habryka4) · 2025-04-15T19:57:42.547Z · comments (14)

The Mask Comes Off: A Trio of Tales
Zvi · 2025-02-14T15:30:15.372Z · comments (1)

Mistral Large 2 (123B) exhibits alignment faking
Marc Carauleanu (Marc-Everin Carauleanu) · 2025-03-27T15:39:02.176Z · comments (4)

Open problems in emergent misalignment
Jan Betley (jan-betley) · 2025-03-01T09:47:58.889Z · comments (13)

You will crash your car in front of my house within the next week
Richard Korzekwa (Grothor) · 2025-04-01T21:43:21.472Z · comments (6)

Microplastics: Much Less Than You Wanted To Know
jenn (pixx) · 2025-02-15T19:08:14.561Z · comments (8)

MONA: Managed Myopia with Approval Feedback
Seb Farquhar · 2025-01-23T12:24:18.108Z · comments (29)

Go home GPT-4o, you’re drunk: emergent misalignment as lowered inhibitions
Stuart_Armstrong · 2025-03-18T14:48:54.762Z · comments (12)

[PAPER] Jacobian Sparse Autoencoders: Sparsify Computations, Not Just Activations
Lucy Farnik (lucy.fa) · 2025-02-26T12:50:04.204Z · comments (8)

Elon Musk May Be Transitioning to Bipolar Type I
Cyborg25 · 2025-03-11T17:45:06.599Z · comments (22)

Announcing ILIAD2: ODYSSEY
Alexander Gietelink Oldenziel (alexander-gietelink-oldenziel) · 2025-04-03T17:01:06.004Z · comments (1)

[link] Preparing for the Intelligence Explosion
fin · 2025-03-11T15:38:29.524Z · comments (17)

← previous page (newer posts) · next page (older posts) →

Archive

Recent comments

neel-nanda-1 on aog's Shortform

I consider the deepmind safety team to have its own scene that is distinct from any specific outside view, though closest to the constellation one. Our AGI safety approach spells some of this out https://deepmind.google/discover/blog/taking-a-responsible-path-to-agi/

mis-understandings on Mis-Understandings's Shortform

In short, it seems like the current system unfairly kills drugs that take a long time to develop and do not have a patentable change in the last few years of that cycle.

bitnotri on Understanding and overcoming AGI apathy

Would it have been better in counterfactual world to develop GPT-2/3 much sooner, as soon as enough compute was available or would it just lead to racing sooner and burn the timeline even more? The compute overhang argument has been drawn before and given the current state of the field I do wonder where we would be if capabilities were visible on lower level much sooner.

mis-understandings on Mis-Understandings's Shortform

If the story about drug prices and price controls is correct (that price controls are bad because the limiting factor for drug development is returns on capital, which this reduces), then we must rethink the political economy of drug development.

Basically, we would expect if that to be the case that the sectoral return rates of biotech to match the risk adjusted rate , but drug development is both risky and skewed, effecting costs of capital.

Most of drug prices are capital costs, and so interventions that lower the capital costs of pharmaceutical companies might produce more drugs.

Most of those capital costs from the total raise required, which is effected basically by the costs of pharmaceutical research (which is probably mostly the labor of expensive professionals).

The expected rate of return is dominated by the risks of pharmaceutical companies.

Drug prices are what the market will bear/monopoly for a time, then drop to a very low level once a compound is generic.

There is a big problem here with out of patent molecules, since if a drug is covered by a patent and stalls 20 years, there is not the return to push it through the process, which means that there might be zombie drugs around from companies that fell apart and did a bad job of selling that asset (so it did not finish the process and did not fail the process).

There seems to be space for the various approvals to become more IP like (so that all drugs have the same exclusivity, regardless of how long they took to prove out).

aynonymousprsn123 on Moral patienthood of simulated minds allows uncountabe infinity of value on finite hardware

I'd like to offer a counterargument, that, I'll admit, can get into some pretty gnarly philosophical territory quite quickly.

Premise 1: We are not simulated minds—we are real, biological observers.

Premise 2: We can treat ourselves as a random sample drawn from the set of all conscious minds, with each mind weighted by some measure—i.e., a way of assigning significance or “probability” to different observers. The exact nature of this measure is still debated in cosmology and philosophy of mind.

Inference: If we really are a typical observer (as Premise 2 assumes), and yet we are not simulated (as Premise 1 asserts), then the measure must assign significantly greater weight to real biological observers than to simulated ones. This must be true even if there are vastly more simulations in a numerical sense—even uncountably infinitely more—because our non-simulated status would be extremely improbable otherwise.

Conclusion: So, under the assumption that we are typical, our existence as real observers implies that simulated minds must have much lower measure than real ones. Therefore, even if digital minds exist in large numbers, they may not matter proportionally in ethical calculations—since their measure, not just their count, determines their relevance. This gives us reason to think utilitarianism, when properly weighted by measure, may still prioritize the welfare of real, biological minds.

nulevel on Why Should I Assume CCP AGI is Worse Than USG AGI?

I think the the assumption is that this is the USG of the last 50 years - which has flaws, but also has human rights goals and an ability to eventually change and accommodate the public’s beliefs.

So in the scenario where AI is controlled by a strongly democratic USG, you have a much more robust “alignment” to enlightenment values and no one person with too much power.

That said, that’s probably a flawed assumption for how the US government operates now/ over the next decade.

edmund-nelson on Is Gemini now better than Claude at Pokémon?

I'll mention beating pokemon isn't that big of a challenge in and of itself, what's important here is that this thing that wasn't trained to do pokemon can. *

Depending on how strict you want to be with what you call AI beating pokemon we have Ai's that beat pokemon in less than 2 hours or if you want to go with the interpretation that "AI beating pokemon is a program that beats pokemon" we have Ai's that beat pokemon in less than 2 minutes or less than 1:30 if you want a more strict definition of "beat the game".

mitchell_porter on Finance and AI Timelines

I read this with interest, but without much ability to think for myself about what's next. I am aware that enormous amounts of money circulate in the modern world, but it's out of my reach; my idea of how to raise money would be to open a Patreon account.

Nonetheless, what do we have to work with? We have the AI 2027 scenario. We have the trade war, which may yet evolve into a division of the world into currency zones [LW · GW]. Vladimir Nesov is keeping track [LW · GW] of how much compute is needed to keep scaling, how much is available, and how much it costs. Remmelt has been telling us to prepare for an AI crash [? · GW], even before the tariffs. We should also remember that China is a player. It would be wacky if the American ability to keep scaling collapsed so completely that China was the only remaining player with the ability to reach superintelligence; or if both countries were hobbled by economic crisis; but that doesn't seem very likely. What seems more likely is that the risk of losing the AI race would be enough for both countries to use state financial resources, to keep going if private enterprise no longer had the means.

Your idea is that AI companies have the valuations they do, not because investors want to create world-transforming superintelligence per se, but because investors think these companies have the potential to become profitable tech giants like Google, Facebook, or Microsoft; and if money gets tight, investors will demand that they start turning a profit, which means they'll have to focus on making products rather than on scaling and pure research, which will slow down the timeline to superintelligence.

It makes sense as a scenario. But I find it interesting that (in the opinion of many), one of the tech giants recently got to the front of the race - I'm talking about Google with Gemini 2.5. Or at least, it is sharing the lead, now that OpenAI has released o3, which seems to have roughly similar capabilities. This seems to undermine the dichotomy between frontier AI companies forging ahead on VC money, and tech giants offering products and services that actually turn a profit, since it reminds us that frontier AI work can prosper, even inside the tech giants.

If there is a scaling winter brought on by a bear market, it may be that the model of frontier AI companies living on VC money dies, and that frontier AI survives only within profitable tech giants, or with state backing. In a comment to Remmelt I suggested that Google and xAI have enough money to survive on their own terms, and OpenAI and Anthropic have potential big brothers in the form of Microsoft and Amazon respectively. China has a similar division between big old Internet companies and "AI 2.0" startups that they invest in, so an analogous shakeup there is conceivable.

It occurs to me that if there is an AI slowdown because all the frontier AI startups have to submit themselves to profit-making Internet giants, it will also give the advocates of an AI pause a moment to reenter the scene and push for e.g. an American-Chinese agreement similar to the slow timeline in "AI 2027". American and Chinese agreement on anything might seem far away now, but things can change quickly, especially if the dust settles from the trade war and both countries have arrived at a new economic strategy and equilibrium.

I still feel like such changes don't affect the trajectory much; no matter what the economic and political circumstances, a world that had o3-level AI in it is only a few more steps away from superintelligence, it seems to me (and getting there by further brute scaling is just the dumbest way to do it, I'm sure there are enormous untapped latent capabilities within the hardware and software that we already have). But it's good to be able to think about the nuances of the situation, so thanks for your contribution.

mateusz-baginski on Why Should I Assume CCP AGI is Worse Than USG AGI?

To steelman a devil's advocate: If your intent-aligned AGI/ASI went something like

oh, people want the world to be according to their preferences but whatever normative system one subscribes to, the current implicit preference aggregation method is woefully suboptimal, so let me move the world's systems to this other preference aggregation method which is much more nearly-Pareto-over-normative-uncertainty-optimal than the current preference aggregation method

and this would be, in an important sense, more democratic, because the people (/demos) would have more influence over their societies.

rasool on jacquesthibs's Shortform

Might Leopold Aschenbrenner also be involved? He runs an investment fund with money from Nat Friedman, Daniel Gross, and Patrick Collison, so the investment in Mechanize might have come from that?

https://situationalawarenesslp.com/

https://www.forourposterity.com/