LessWrong 2.0 Reader

View: New · Old · Top

Restrict date range: Today · This week · This month · Last three months · This year · All time

next page (older posts) →

Why does LW not put much more focus on AI governance and outreach?
Severin T. Seehrich (sts) · 2025-04-12T14:24:54.197Z · comments (28)
[link] Frontier AI Models Still Fail at Basic Physical Tasks: A Manufacturing Case Study
Adam Karvonen (karvonenadam) · 2025-04-14T17:38:02.918Z · comments (4)
One-shot steering vectors cause emergent misalignment, too
Jacob Dunefsky (jacob-dunefsky) · 2025-04-14T06:40:41.503Z · comments (5)
Steelmanning heuristic arguments
Dmitry Vaintrob (dmitry-vaintrob) · 2025-04-13T01:09:33.392Z · comments (0)
Vestigial reasoning in RL
Caleb Biddulph (caleb-biddulph) · 2025-04-13T15:40:11.954Z · comments (7)
How I switched careers from software engineer to AI policy operations
Lucie Philippon (lucie-philippon) · 2025-04-13T06:37:33.507Z · comments (1)
[link] College Advice For People Like Me
henryj · 2025-04-12T14:36:46.643Z · comments (0)
Four Types of Disagreement
silentbob · 2025-04-13T11:22:38.466Z · comments (2)
Try training token-level probes
StefanHex (Stefan42) · 2025-04-14T11:56:23.191Z · comments (0)
[link] Sentinel's Global Risks Weekly Roundup #15/2025: Tariff yoyo, OpenAI slashing safety testing, Iran nuclear programme negotiations, 1K H5N1 confirmed herd infections.
NunoSempere (Radamantis) · 2025-04-14T19:11:20.977Z · comments (0)
MONA: Three Month Later - Updates and Steganography Without Optimization Pressure
David Lindner · 2025-04-12T23:15:07.964Z · comments (0)
Thoughts on the Double Impact Project
Mati_Roy (MathieuRoy) · 2025-04-13T19:07:57.687Z · comments (10)
How to evaluate control measures for LLM agents? A trajectory from today to superintelligence
Tomek Korbak (tomek-korbak) · 2025-04-14T16:45:46.584Z · comments (0)
[link] The 4-Minute Mile Effect
Parker Conley (parker-conley) · 2025-04-14T21:41:27.726Z · comments (2)
[link] Unbendable Arm as Test Case for Religious Belief
Ivan Vendrov (ivan-vendrov) · 2025-04-14T01:57:12.013Z · comments (24)
Will US tariffs push data centers for large model training offshore?
ChristianKl · 2025-04-12T12:47:12.917Z · comments (3)
The Internal Model Principle: A Straightforward Explanation
Alfred Harwood · 2025-04-12T10:58:51.479Z · comments (1)
Monthly Roundup #29: April 2025
Zvi · 2025-04-14T11:50:02.324Z · comments (4)
The Bell Curve of Bad Behavior
Screwtape · 2025-04-14T19:58:10.293Z · comments (3)
Offer: Team Conflict Counseling for AI Safety Orgs
Severin T. Seehrich (sts) · 2025-04-14T15:17:00.835Z · comments (1)
[question] What is autism?
Adam Zerner (adamzerner) · 2025-04-12T18:12:19.468Z · answers+comments (7)
Experts have it easy
beyarkay · 2025-04-12T19:32:17.158Z · comments (3)
The Last Light
Bridgett Kay (bridgett-kay) · 2025-04-14T15:41:02.745Z · comments (0)
Calling Bullshit - the Cheatsheet
Niklas Lehmann · 2025-04-12T11:43:23.822Z · comments (1)
[link] Slopworld 2035: The dangers of mediocre AI
titotal (lombertini) · 2025-04-14T13:14:08.390Z · comments (6)
What are good safety standards for open source AIs from China?
ChristianKl · 2025-04-12T13:06:16.663Z · comments (2)
[question] How likely are the USA to decay and how will it influence the AI development?
StanislavKrym · 2025-04-12T04:42:27.604Z · answers+comments (0)
What if there was a nuke in Manhattan and why that could be a good thing
Ratburn · 2025-04-15T00:19:41.844Z · comments (5)
Commitment Races are a technical problem ASI can easily solve
Knight Lee (Max Lee) · 2025-04-12T22:22:47.790Z · comments (5)
A Dissent on Honesty
eva_ · 2025-04-15T02:43:44.163Z · comments (1)
A Talmudic Rationalist Cautionary Tale
Noah Birnbaum (daniel-birnbaum) · 2025-04-15T04:11:16.972Z · comments (0)
[link] Distributed whistleblowing
samuelshadrach (xpostah) · 2025-04-12T06:36:05.952Z · comments (5)
The Structure of the Pain of Change
ReverendBayes (vedernikov-andrei) · 2025-04-13T21:51:53.823Z · comments (0)
[question] Does this game have a name?
Mis-Understandings (robert-k) · 2025-04-12T01:52:47.584Z · answers+comments (4)
Sam Altman's sister claims Sam sexually abused her -- Part 8: Timeline, continued
pythagoras5015 (pl5015) · 2025-04-14T17:42:53.705Z · comments (0)
[question] Is Local Order a Clue to Universal Entropy? How a Failed Professor Searches for a 'Sacred Motivational Order'
P. João (gabriel-brito) · 2025-04-12T13:39:55.857Z · answers+comments (2)
Creating 'Making God': a Feature Documentary on risks from AGI
Connor Axiotes (connor-axiotes-1) · 2025-04-15T02:56:09.206Z · comments (0)
The Era of the Dividual—are we falling apart?
James Stephen Brown (james-brown) · 2025-04-12T22:35:56.593Z · comments (2)
Self propagating story.
Canaletto (weightt-an) · 2025-04-12T12:32:21.312Z · comments (0)
Луна Лавгуд и Комната Тайн, Часть 4
Kongo Landwalker (kongo-landwalker) · 2025-04-13T20:55:03.281Z · comments (0)
ACX Spring Meetup 2025 @ Klang Valley, Malaysia
Yi-Yang (yiyang) · 2025-04-12T07:31:16.434Z · comments (0)
Луна Лавгуд и Комната Тайн, Часть 3
Kongo Landwalker (kongo-landwalker) · 2025-04-12T19:20:15.846Z · comments (0)
Intro to Multi-Agent Safety
james__p · 2025-04-13T17:40:41.128Z · comments (0)
Луна Лавгуд и Комната Тайн, Часть 4
Kongo Landwalker (kongo-landwalker) · 2025-04-14T00:10:36.028Z · comments (0)
Sam Altman's sister claims Sam sexually abused her -- Part 7: Timeline, continued
pythagoras5015 (pl5015) · 2025-04-14T17:43:28.897Z · comments (0)
Religious Persistence: A Missing Primitive for Robust Alignment
lauriewired · 2025-04-14T22:03:45.868Z · comments (1)
Correcting Deceptive Alignment using a Deontological Approach
JeaniceK · 2025-04-14T22:07:57.860Z · comments (0)
$500 bounty for best short-form fiction about our near future world; $100 for recommending winning piece: new “Art of Near Future World” quarterly art project
Ramon Gonzalez (ramon-gonzalez) · 2025-04-15T00:46:10.637Z · comments (0)
Sam Altman's sister claims Sam sexually abused her -- Part 4: Timeline, continued
pythagoras5015 (pl5015) · 2025-04-13T23:41:55.411Z · comments (0)
Lightning Talks!
nathandunkerley · 2025-04-14T20:39:17.593Z · comments (0)
next page (older posts) →