LessWrong 2.0 Reader

View: New · Old · Top

Restrict date range: Today · This week · This month · Last three months · This year · All time

next page (older posts) →

“The Era of Experience” has an unsolved technical alignment problem
Steven Byrnes (steve2152) · 2025-04-24T13:57:38.984Z · comments (12)
[link] Modifying LLM Beliefs with Synthetic Document Finetuning
RowanWang (KevinRoWang) · 2025-04-24T21:15:17.366Z · comments (11)
The Intelligence Curse: an essay series
L Rudolf L (LRudL) · 2025-04-24T12:59:15.247Z · comments (3)
[link] My Favorite Productivity Blog Posts
Parker Conley (parker-conley) · 2025-04-24T00:32:47.594Z · comments (0)
Reward hacking is becoming more sophisticated and deliberate in frontier LLMs
Kei · 2025-04-24T16:03:57.359Z · comments (4)
AI #113: The o3 Era Begins
Zvi · 2025-04-24T13:40:06.043Z · comments (2)
[link] Token and Taboo
Guive (GAA) · 2025-04-24T20:17:24.987Z · comments (5)
Worries About AI Are Usually Complements Not Substitutes
Zvi · 2025-04-25T20:00:03.421Z · comments (1)
This prompt (sometimes) makes ChatGPT think about terrorist organisations
jakub_krys (kryjak) · 2025-04-24T21:15:15.249Z · comments (8)
Training-time schemers vs behavioral schemers
Alex Mallen (alex-mallen) · 2025-04-24T19:07:55.256Z · comments (0)
Personal evaluation of LLMs, through chess
Karthik Tadepalli · 2025-04-24T07:01:06.221Z · comments (3)
A review of "Why Did Environmentalism Become Partisan?"
David Scott Krueger (formerly: capybaralet) (capybaralet) · 2025-04-25T05:12:50.986Z · comments (0)
Why would AI companies use human-level AI to do alignment research?
MichaelDickens · 2025-04-25T19:12:56.202Z · comments (3)
[link] Will Programmer Compensation Decouple from Productivity?
Gordon Seidoh Worley (gworley) · 2025-04-25T15:32:42.744Z · comments (1)
Zstd Window Size
jefftk (jkaufman) · 2025-04-25T14:40:06.742Z · comments (1)
Finding an Error-Detection Feature in DeepSeek-R1
keith_wynroe · 2025-04-24T16:03:28.675Z · comments (0)
Trouble at Miningtown: Prologue
Quinn (quinn-dougherty) · 2025-04-24T19:09:10.105Z · comments (0)
Academia as a happy place?
jow (jowen) · 2025-04-24T14:03:08.267Z · comments (0)
Who's Working On It? AI-Controlled Experiments
sarahconstantin · 2025-04-25T21:40:02.543Z · comments (0)
[link] AI 2027 Thoughts
PeterMcCluskey · 2025-04-26T00:00:23.699Z · comments (0)
[link] Does Reinforcement Learning Really Incentivize Reasoning Capacity in LLMs Beyond the Base Model?
Matrice Jacobine · 2025-04-24T14:11:27.625Z · comments (3)
LLM Pareto Frontier But Live
winstonBosan · 2025-04-24T21:22:41.801Z · comments (0)
What Physically Distinguishes a Brain with False Beliefs Using a Swimming Pool Example
YanLyutnev (YanLutnev) · 2025-04-24T00:01:41.589Z · comments (0)
List of petitions against OpenAI's for-profit move
Remmelt (remmelt-ellen) · 2025-04-25T10:03:12.026Z · comments (1)
[link] How Democratic Is Effective Altruism — Really?
B Jacobs (Bob Jacobs) · 2025-04-25T16:02:42.915Z · comments (0)
Cognitive Dissonance is Mentally Taxing
SorenJ (Mascal's Pugging) · 2025-04-24T00:38:25.535Z · comments (0)
[link] Intelligence explosion
samuelshadrach (xpostah) · 2025-04-24T06:35:12.561Z · comments (0)
[link] Anticipating AI: Keeping Up With What We Build
Alvin Ånestrand (alvin-anestrand) · 2025-04-24T15:23:08.343Z · comments (0)
[link] [Linkpost] AI War seems unlikely to prevent AI Doom
thenoviceoof · 2025-04-25T20:44:48.267Z · comments (1)
Severe control over AI agents as a tool for mass-surveillance
Andrey Seryakov (andrey-seryakov) · 2025-04-24T20:27:50.860Z · comments (0)
next page (older posts) →