LessWrong 2.0 Reader

View: New · Old · Top

Restrict date range: Today · This week · This month · Last three months · This year · All time

← previous page (newer posts) · next page (older posts) →

[link] The US Executive vs Supreme Court Deportations Clash
NunoSempere (Radamantis) · 2025-04-21T19:56:03.711Z · comments (12)
Tabula Bio: towards a future free of disease (& looking for collaborators)
mpoon (michael-poon) · 2025-03-23T16:30:15.523Z · comments (15)
On GPT-4.5
Zvi · 2025-03-03T13:40:05.843Z · comments (12)
Virtue signaling, and the "humans-are-wonderful" bias, as a trust exercise
lc · 2025-02-13T06:59:17.525Z · comments (16)
[link] Automated Researchers Can Subtly Sandbag
gasteigerjo · 2025-03-26T19:13:26.879Z · comments (0)
o3 Will Use Its Tools For You
Zvi · 2025-04-18T21:20:02.566Z · comments (3)
Superintelligent Agents Pose Catastrophic Risks: Can Scientist AI Offer a Safer Path?
Yoshua Bengio (yoshua-bengio) · 2025-02-24T18:31:48.580Z · comments (15)
Why care about AI personhood?
Francis Rhys Ward (francis-rhys-ward) · 2025-01-26T11:24:45.596Z · comments (6)
Handling schemers if shutdown is not an option
Buck · 2025-04-18T14:39:18.609Z · comments (0)
A Dissent on Honesty
eva_ · 2025-04-15T02:43:44.163Z · comments (52)
Paper
dynomight · 2025-04-11T12:20:04.200Z · comments (12)
ALLFED emergency appeal: Help us raise $800,000 to avoid cutting half of programs
denkenberger · 2025-04-16T21:47:40.687Z · comments (9)
Self-dialogue: Do behaviorist rewards make scheming AGIs?
Steven Byrnes (steve2152) · 2025-02-13T18:39:37.770Z · comments (0)
Putting up Bumpers
Sam Bowman (sbowman) · 2025-04-23T16:05:05.476Z · comments (13)
[link] Could Advanced AI Accelerate the Pace of AI Progress? Interviews with AI Researchers
jleibowich · 2025-03-03T19:05:31.212Z · comments (1)
AI #108: Straight Line on a Graph
Zvi · 2025-03-20T13:50:00.983Z · comments (5)
The first AI war will be in your computer
Viliam · 2025-04-08T09:28:53.191Z · comments (10)
Brainrot
Jesse Hoogland (jhoogland) · 2025-01-26T05:35:35.396Z · comments (0)
[link] The Takeoff Speeds Model Predicts We May Be Entering Crunch Time
johncrox · 2025-02-21T02:26:31.768Z · comments (3)
[link] My Favorite Productivity Blog Posts
Parker Conley (parker-conley) · 2025-04-24T00:32:47.594Z · comments (0)
AI #109: Google Fails Marketing Forever
Zvi · 2025-03-27T14:50:01.825Z · comments (12)
An Advent of Thought
Kaarel (kh) · 2025-03-17T14:21:08.765Z · comments (8)
[link] Sentinel's Global Risks Weekly Roundup #15/2025: Tariff yoyo, OpenAI slashing safety testing, Iran nuclear programme negotiations, 1K H5N1 confirmed herd infections.
NunoSempere (Radamantis) · 2025-04-14T19:11:20.977Z · comments (0)
How accurate was my "Altered Traits" book review?
lsusr · 2025-02-18T17:00:55.584Z · comments (3)
A City Within a City
Declan Molony (declan-molony) · 2025-02-24T15:51:19.118Z · comments (1)
[link] Paths and waystations in AI safety
Joe Carlsmith (joekc) · 2025-03-11T18:52:57.772Z · comments (1)
AI #112: Release the Everything
Zvi · 2025-04-17T15:10:02.029Z · comments (6)
Follow me on TikTok
lsusr · 2025-04-01T08:22:29.521Z · comments (8)
Analyzing long agent transcripts (Docent)
jsteinhardt · 2025-03-24T20:49:54.472Z · comments (2)
[link] The case for AGI by 2030
Benjamin_Todd · 2025-04-09T20:35:55.167Z · comments (6)
Response to Scott Alexander on Imprisonment
Zvi · 2025-03-11T20:40:06.250Z · comments (4)
Why Can't We Hypothesize After the Fact?
David Udell · 2025-02-26T22:41:39.819Z · comments (3)
An overview of control measures
ryan_greenblatt · 2025-03-24T23:16:49.400Z · comments (0)
Proof idea: SLT to AIT
Lucius Bushnaq (Lblack) · 2025-02-10T23:14:24.538Z · comments (15)
[link] what an efficient market feels from inside
DMMF · 2025-02-25T02:38:40.129Z · comments (9)
AI #101: The Shallow End
Zvi · 2025-01-30T14:50:08.269Z · comments (1)
[link] Map of all 40 copyright suits v. AI in U.S.
Remmelt (remmelt-ellen) · 2025-03-26T07:57:58.976Z · comments (3)
SHIFT relies on token-level features to de-bias Bias in Bios probes
Tim Hua · 2025-03-19T21:29:15.974Z · comments (2)
The Intelligence Curse: an essay series
L Rudolf L (LRudL) · 2025-04-24T12:59:15.247Z · comments (3)
Cautions about LLMs in Human Cognitive Loops
Alice Blair (Diatom) · 2025-03-02T19:53:10.253Z · comments (9)
On Writing #1
Zvi · 2025-03-04T13:30:06.103Z · comments (2)
We need (a lot) more rogue agent honeypots
Ozyrus · 2025-03-23T22:24:52.785Z · comments (12)
Scaffolding Skills
Screwtape · 2025-04-18T17:39:25.634Z · comments (8)
[link] Three Types of Intelligence Explosion
rosehadshar · 2025-03-17T14:47:46.696Z · comments (8)
AI #104: American State Capacity on the Brink
Zvi · 2025-02-20T14:50:06.375Z · comments (9)
Notable runaway-optimiser-like LLM failure modes on Biologically and Economically aligned AI safety benchmarks for LLMs with simplified observation format
Roland Pihlakas (roland-pihlakas) · 2025-03-16T23:23:30.989Z · comments (6)
LessOnline 2025: Early Bird Tickets On Sale
Ben Pace (Benito) · 2025-03-18T00:22:02.653Z · comments (4)
Crime and Punishment #1
Zvi · 2025-04-21T15:30:06.420Z · comments (10)
They Took MY Job?
Zvi · 2025-03-21T13:30:38.507Z · comments (4)
[link] Existing Safety Frameworks Imply Unreasonable Confidence
Joe Rogero · 2025-04-10T16:31:50.240Z · comments (2)
← previous page (newer posts) · next page (older posts) →