LessWrong 2.0 Reader

View: New · Old · Top

Restrict date range: Today · This week · This month · Last three months · This year · All time

next page (older posts) →

“The Era of Experience” has an unsolved technical alignment problem
Steven Byrnes (steve2152) · 2025-04-24T13:57:38.984Z · comments (12)

[link] Modifying LLM Beliefs with Synthetic Document Finetuning
RowanWang (KevinRoWang) · 2025-04-24T21:15:17.366Z · comments (11)

The Intelligence Curse: an essay series
L Rudolf L (LRudL) · 2025-04-24T12:59:15.247Z · comments (3)

[link] My Favorite Productivity Blog Posts
Parker Conley (parker-conley) · 2025-04-24T00:32:47.594Z · comments (0)

Reward hacking is becoming more sophisticated and deliberate in frontier LLMs
Kei · 2025-04-24T16:03:57.359Z · comments (4)

AI #113: The o3 Era Begins
Zvi · 2025-04-24T13:40:06.043Z · comments (2)

[link] Token and Taboo
Guive (GAA) · 2025-04-24T20:17:24.987Z · comments (5)

Worries About AI Are Usually Complements Not Substitutes
Zvi · 2025-04-25T20:00:03.421Z · comments (1)

This prompt (sometimes) makes ChatGPT think about terrorist organisations
jakub_krys (kryjak) · 2025-04-24T21:15:15.249Z · comments (8)

Training-time schemers vs behavioral schemers
Alex Mallen (alex-mallen) · 2025-04-24T19:07:55.256Z · comments (0)

Personal evaluation of LLMs, through chess
Karthik Tadepalli · 2025-04-24T07:01:06.221Z · comments (3)

A review of "Why Did Environmentalism Become Partisan?"
David Scott Krueger (formerly: capybaralet) (capybaralet) · 2025-04-25T05:12:50.986Z · comments (0)

Why would AI companies use human-level AI to do alignment research?
MichaelDickens · 2025-04-25T19:12:56.202Z · comments (3)

[link] Will Programmer Compensation Decouple from Productivity?
Gordon Seidoh Worley (gworley) · 2025-04-25T15:32:42.744Z · comments (1)

Zstd Window Size
jefftk (jkaufman) · 2025-04-25T14:40:06.742Z · comments (1)

Finding an Error-Detection Feature in DeepSeek-R1
keith_wynroe · 2025-04-24T16:03:28.675Z · comments (0)

Trouble at Miningtown: Prologue
Quinn (quinn-dougherty) · 2025-04-24T19:09:10.105Z · comments (0)

Academia as a happy place?
jow (jowen) · 2025-04-24T14:03:08.267Z · comments (0)

Who's Working On It? AI-Controlled Experiments
sarahconstantin · 2025-04-25T21:40:02.543Z · comments (0)

[link] AI 2027 Thoughts
PeterMcCluskey · 2025-04-26T00:00:23.699Z · comments (0)

[link] Does Reinforcement Learning Really Incentivize Reasoning Capacity in LLMs Beyond the Base Model?
Matrice Jacobine · 2025-04-24T14:11:27.625Z · comments (3)

LLM Pareto Frontier But Live
winstonBosan · 2025-04-24T21:22:41.801Z · comments (0)

What Physically Distinguishes a Brain with False Beliefs Using a Swimming Pool Example
YanLyutnev (YanLutnev) · 2025-04-24T00:01:41.589Z · comments (0)

List of petitions against OpenAI's for-profit move
Remmelt (remmelt-ellen) · 2025-04-25T10:03:12.026Z · comments (1)

[link] How Democratic Is Effective Altruism — Really?
B Jacobs (Bob Jacobs) · 2025-04-25T16:02:42.915Z · comments (0)

Cognitive Dissonance is Mentally Taxing
SorenJ (Mascal's Pugging) · 2025-04-24T00:38:25.535Z · comments (0)

[link] Intelligence explosion
samuelshadrach (xpostah) · 2025-04-24T06:35:12.561Z · comments (0)

[link] Anticipating AI: Keeping Up With What We Build
Alvin Ånestrand (alvin-anestrand) · 2025-04-24T15:23:08.343Z · comments (0)

[link] [Linkpost] AI War seems unlikely to prevent AI Doom
thenoviceoof · 2025-04-25T20:44:48.267Z · comments (1)

Severe control over AI agents as a tool for mass-surveillance
Andrey Seryakov (andrey-seryakov) · 2025-04-24T20:27:50.860Z · comments (0)

next page (older posts) →

Archive

Recent comments

knight-lee on Worries About AI Are Usually Complements Not Substitutes

Are there any suggestions for how to get this message across? To all those AI x-risk disbelievers?

michaeldickens on o3 Is a Lying Liar

Huh. I knew that's how ChatGPT worked but I had assumed they would've worked out a less hacky solution by now!

ete on Jaan Tallinn's 2024 Philanthropy Overview

You've funded a what looks from my vantage point to be a huge portion of the quality-adjusted attempts to avert doom, perhaps a majority. Much appreciation for stepping up for humanity.

ebenezer-dukakis on less-wronger-numb89's Shortform

Technically the point of going to college is to help you thrive in the rest of your life after college. If you believe in AI 2027, the most important thing for the rest of your life is for AI to be developed responsibly. So, maybe work on that instead of college?

I think the EU could actually be good place to protest for an AI pause. Because the EU doesn't have national AI ambitions, and the EU is increasingly skeptical of the US, it seems to me that a bit of protesting could do a lot to raise awareness of the reckless path that the US is taking. That, in turn, could motivate the EU to apply leverage via ASML, sanctions, etc.

The only thing I'm worried about is that EU criticism of the US could create anti-EU polarization among the GOP in the US, which motivates them to be more reckless on AI. This question seems worth a lot more study.

nate-showell on This prompt (sometimes) makes ChatGPT think about terrorist organisations

Have you tried seeing how ChatGPT responds to individual lines of code from that excerpt? There might be an anomalous token in it along the lines of " petertodd" [LW · GW].

anthonyc on Fish and Faces

I'd say that in most contexts in normal human life, (3) is the thing that makes this less of an issue for (1) and (2). If the thing I'm hearing about it real, I'll probably keep hearing about it, and from more sources. If I come across 100 new crazy-seeming ideas and decide to indulge them 1% of the time, and so do many other people, that's usually, probably enough to amplify the ones that (seem to) pan out. By the time I hear about the thing from 2, 5, or 20 sources, I will start to suspect it's worth thinking about at a higher level.

veedrac on less-wronger-numb89's Shortform

Ultimately you have to make a bet on your guesses of reality. If your modal guess is civilizational collapse in 2-3 years, skipping uni is hardly a disproportionate action, but at the same time it's not going to win you much either. Personally I'd leave the uni-or-not decision to the plausible worlds where the choice matters more, and look for some higher leverage change you can make for the rest.

snewman on AI 2027 is a Bet Against Amdahl's Law

I added up the median "Predictions for gap size" in the "How fast can the task difficulty gaps be crossed?" table, summing each set of predictions separately ("Eli", "Nikola", "FutureSearch") to get three numbers ranging from 30-75.

Does this table cover the time between now and superhuman coder? I thought it started at RE-Bench, because:

I took all of this to be in context of the phrase, about one page back, "For each gap after RE-Bench saturation"
The earlier explanation that Method 2 is "a more complex model starting from a forecast saturation of an AI R&D benchmark (RE-Bench), and then how long it will take to go from that system to one that can handle real-world tasks at the best AGI company" [emphasis added]
The first entry in the table ("Time horizon: Achieving tasks that take humans lots of time") sounds more difficult than saturating RE-Bench.
Earlier, there's a separate discussion forecasting time to RE-bench saturation.

But sounds like I was misinterpreting?

anthonyc on AI 2027 is a Bet Against Amdahl's Law

Exactly. More fundamentally, that is not a probability graph, it's a probability density graph, and we're not shown the line beyond 2032 but just have to assume the integral from 2100-->infinity is >10% of the integral from 0-->infinity. Infinity is far enough away that the decay doesn't even need to be all that slow for the total to be that high.

linch on You Better Mechanize

(There’s a funny aside on Thermopylae, and the limits of ‘excellent leadership,’ yes they did well but they ultimately lost. To which I would respond, they only ultimately lost because they got outflanked, but also in this case ‘good leadership’ involves a much bigger edge. A better example is, classically, Cortes, who they mention later. Who had to fight off another Spanish force and then still won. But hey.)

I know this is an aside to your aside, but as an avid Sparta-hater, I want to point out that we don't have much evidence that Spartans are good at military leadership, and indeed plenty of evidence in the other direction:

Sparta's opponents often won using clever, innovative tactics, such as at the Battle of Phyle (404 BC), the Battle of Olpae (426 BC), the Battle of Cyzicus (410 BC), the Battle of Arginusae (406 BC), Battle of Tegyra (375 BC), the Battle of Leuctra (371 BC), and the 2nd Battle of Mantinea (362 BC). The Spartans, so far as I have found in reviewing these 51 battles, never made a single creative strategic or tactical innovation. They were sometimes clever; mostly through treachery and trickery.

That said, you have to remember that pretty much all of the primary sources you read on this topic are written by Athenians for Athenians, and less about preserving an accurate historical record for future generations (or even propaganda/promoting internal Athenian solidarity) and more about making specific political points for their own internecine fights. So you should pay more attention to what's objectively verifiable (things like win-loss records, inputs and outputs) and less about the overall vibe that they present.