LessWrong 2.0 Reader

View: New · Old · Top

Restrict date range: Today · This week · This month · Last three months · This year · All time

next page (older posts) →

Anomalous Tokens in DeepSeek-V3 and r1
henry (henry-bass) · 2025-01-25T22:55:41.232Z · comments (2)
Ten people on the inside
Buck · 2025-01-28T16:41:22.990Z · comments (4)
[link] Attribution-based parameter decomposition
Lucius Bushnaq (Lblack) · 2025-01-25T13:12:11.031Z · comments (10)
“Sharp Left Turn” discourse: An opinionated review
Steven Byrnes (steve2152) · 2025-01-28T18:47:04.395Z · comments (1)
My supervillain origin story
Dmitry Vaintrob (dmitry-vaintrob) · 2025-01-27T12:20:46.101Z · comments (0)
The Rising Sea
Jesse Hoogland (jhoogland) · 2025-01-25T20:48:52.971Z · comments (2)
Should you go with your best guess?: Against precise Bayesianism and related views
Anthony DiGiovanni (antimonyanthony) · 2025-01-27T20:25:26.809Z · comments (6)
Kessler's Second Syndrome
Jesse Hoogland (jhoogland) · 2025-01-26T07:04:17.852Z · comments (2)
On polytopes
Dmitry Vaintrob (dmitry-vaintrob) · 2025-01-25T13:56:35.681Z · comments (5)
Brainrot
Jesse Hoogland (jhoogland) · 2025-01-26T05:35:35.396Z · comments (0)
Why care about AI personhood?
Francis Rhys Ward (francis-rhys-ward) · 2025-01-26T11:24:45.596Z · comments (6)
The Game Board has been Flipped: Now is a good time to rethink what you’re doing
Alex Lintz (alex-lintz) · 2025-01-28T23:36:18.106Z · comments (2)
DeepSeek Panic at the App Store
Zvi · 2025-01-28T19:30:07.555Z · comments (12)
Agents don't have to be aligned to help us achieve an indefinite pause.
Hastings (hastings-greer) · 2025-01-25T18:51:03.523Z · comments (0)
Operator
Zvi · 2025-01-28T20:00:08.374Z · comments (1)
[question] Is the output of the softmax in a single transformer attention head usually winner-takes-all?
Linda Linsefors · 2025-01-27T15:33:28.992Z · answers+comments (1)
The generalization phase diagram
Dmitry Vaintrob (dmitry-vaintrob) · 2025-01-26T20:30:15.212Z · comments (2)
[link] Reinforcement Learning by AI Punishment
Abhishaike Mahajan (abhishaike-mahajan) · 2025-01-28T00:57:51.715Z · comments (0)
The Upcoming PEPFAR Cut Will Kill Millions, Many of Them Children
omnizoid · 2025-01-27T16:03:51.214Z · comments (2)
Fake thinking and real thinking
Joe Carlsmith (joekc) · 2025-01-28T20:05:06.735Z · comments (0)
so you have a chronic health issue
agencypilled · 2025-01-26T19:00:29.972Z · comments (5)
[link] Are we trying to figure out if AI is conscious?
Kristaps Zilgalvis (kristaps-zilgalvis-1) · 2025-01-27T01:05:07.001Z · comments (5)
AI Strategy Updates that You Should Make
Alice Blair (Diatom) · 2025-01-27T21:10:41.838Z · comments (2)
The memorization-generalization spectrum and learning coefficients
Dmitry Vaintrob (dmitry-vaintrob) · 2025-01-28T16:53:24.628Z · comments (0)
[link] Notes on Argentina
Annapurna (jorge-velez) · 2025-01-26T03:51:15.393Z · comments (5)
Deference and Decision-Making
ben_levinstein (benlev) · 2025-01-27T22:02:17.578Z · comments (0)
The present perfect tense is ruining your life
PatrickDFarley · 2025-01-27T16:14:48.843Z · comments (7)
[link] Lazy Hasselback Pommes Anna
Brendan Long (korin43) · 2025-01-26T21:30:36.587Z · comments (18)
How different LLMs answered PhilPapers 2020 survey
Satron · 2025-01-27T21:41:12.334Z · comments (1)
SAE regularization produces more interpretable models
Peter Lai (peter-lai) · 2025-01-28T20:02:56.662Z · comments (2)
Monet: Mixture of Monosemantic Experts for Transformers Explained
CalebMaresca (caleb-maresca) · 2025-01-25T19:37:09.078Z · comments (2)
Nvidia doesn’t just sell shovels
winstonBosan · 2025-01-28T04:56:38.720Z · comments (3)
[question] Recommendations for Recent Posts/Sequences on Instrumental Rationality?
Benjamin Hendricks (benjamin-hendricks) · 2025-01-26T00:41:08.577Z · answers+comments (3)
Detecting out of distribution text with surprisal and entropy
Sandy Fraser (alex-fraser) · 2025-01-28T18:46:46.977Z · comments (3)
[link] Anatomy of a Dance Class: A step by step guide
Nathan Young · 2025-01-26T18:02:04.974Z · comments (0)
[link] Links and short notes, 2025-01-26: Atlas Shrugged and the irreplaceable founder, pumping stations and civic pride, and thoughts on the eve of AGI
jasoncrawford · 2025-01-26T20:52:51.416Z · comments (1)
Starting an Egan High School
Chris Wintergreen · 2025-01-26T19:02:17.658Z · comments (2)
[question] AI Safety in secret
Michael Flood (michael-flood) · 2025-01-25T18:16:03.181Z · answers+comments (0)
The Clueless Sniper and the Principle of Indifference
Jim Buhler (jim-buhler) · 2025-01-27T11:52:57.978Z · comments (19)
[question] A Floating Cube - Rejected HLE submission
Shankar Sivarajan (shankar-sivarajan) · 2025-01-25T04:52:22.194Z · answers+comments (1)
[link] Narratives as catalysts of catastrophic trajectories
EQ · 2025-01-26T19:01:21.558Z · comments (0)
If you wanted to actually reduce the trade deficit, how would you do it?
Logan Zoellner (logan-zoellner) · 2025-01-26T18:04:54.702Z · comments (5)
Jevon's paradox and economic intuitions
Abhimanyu Pallavi Sudhir (abhimanyu-pallavi-sudhir) · 2025-01-27T23:04:23.854Z · comments (0)
[question] Supposing that the "Dead Internet Theory" is true or largely true, how can we act on that information?
SpectrumDT · 2025-01-27T16:47:01.338Z · answers+comments (4)
[link] Understanding AI World Models w/ Chris Canal
jacobhaimes · 2025-01-27T16:32:47.724Z · comments (0)
Using an LLM for creative writing feels wrong to me
Declan Molony (declan-molony) · 2025-01-28T06:42:24.799Z · comments (13)
[link] A concise definition of what it means to win
testingthewaters · 2025-01-25T06:37:37.305Z · comments (0)
Disproving the "People-Pleasing" Hypothesis for AI Self-Reports of Experience
rife (edgar-muniz) · 2025-01-26T15:53:10.530Z · comments (18)
Death vs. Suffering: The Endurist-Serenist Divide on Life’s Worst Fate
Alex_Steiner · 2025-01-27T03:59:40.279Z · comments (7)
Scanless Whole Brain Emulation
Knight Lee (Max Lee) · 2025-01-27T10:00:08.036Z · comments (0)
next page (older posts) →