LessWrong 2.0 Reader

View: New · Old · Top

Restrict date range: Today · This week · This month · Last three months · This year · All time

← previous page (newer posts) · next page (older posts) →

Biology-Inspired AGI Timelines: The Trick That Never Works
Eliezer Yudkowsky (Eliezer_Yudkowsky) · 2021-12-01T22:35:28.379Z · comments (142)
Ironing Out the Squiggles
Zack_M_Davis · 2024-04-29T16:13:00.371Z · comments (36)
My thoughts on the social response to AI risk
Matthew Barnett (matthew-barnett) · 2023-11-01T21:17:08.184Z · comments (37)
What’s up with LLMs representing XORs of arbitrary features?
Sam Marks (samuel-marks) · 2024-01-03T19:44:33.162Z · comments (61)
[link] Gradient Routing: Masking Gradients to Localize Computation in Neural Networks
cloud · 2024-12-06T22:19:26.717Z · comments (12)
EIS XIII: Reflections on Anthropic’s SAE Research Circa May 2024
scasper · 2024-05-21T20:15:36.502Z · comments (16)
Language Models Model Us
eggsyntax · 2024-05-17T21:00:34.821Z · comments (55)
Deep Learning Systems Are Not Less Interpretable Than Logic/Probability/Etc
johnswentworth · 2022-06-04T05:41:56.713Z · comments (55)
Dear Self; We Need To Talk About Social Media
Elizabeth (pktechgirl) · 2021-12-07T00:40:01.949Z · comments (19)
[link] Comp Sci in 2027 (Short story by Eliezer Yudkowsky)
sudo · 2023-10-29T23:09:56.730Z · comments (22)
parenting rules
Dave Orr (dave-orr) · 2020-12-21T19:48:42.365Z · comments (9)
Formal verification, heuristic explanations and surprise accounting
Jacob_Hilton · 2024-06-25T15:40:03.535Z · comments (11)
Nonprofit Boards are Weird
HoldenKarnofsky · 2022-06-23T14:40:11.593Z · comments (26)
Password-locked models: a stress case for capabilities evaluation
Fabien Roger (Fabien) · 2023-08-03T14:53:12.459Z · comments (14)
Negative Feedback and Simulacra
Elizabeth (pktechgirl) · 2020-04-29T02:00:01.734Z · comments (24)
Current safety training techniques do not fully transfer to the agent setting
Simon Lermen (dalasnoin) · 2024-11-03T19:24:51.537Z · comments (8)
[link] The Death of Behavioral Economics
habryka (habryka4) · 2021-08-22T22:39:12.697Z · comments (24)
[link] Conjecture internal survey: AGI timelines and probability of human extinction from advanced AI
Maris Sala (maris-sala) · 2023-05-22T14:31:59.139Z · comments (5)
Introduction to Cartesian Frames
Scott Garrabrant · 2020-10-22T13:00:00.000Z · comments (32)
Meta Questions about Metaphilosophy
Wei Dai (Wei_Dai) · 2023-09-01T01:17:57.578Z · comments (78)
[question] things that confuse me about the current AI market.
DMMF · 2024-08-28T13:46:56.908Z · answers+comments (28)
[question] What are some beautiful, rationalist artworks?
jacobjacob · 2020-10-17T06:32:43.142Z · answers+comments (140)
Omicron Post #7
Zvi · 2021-12-16T17:30:01.676Z · comments (41)
Tips for Empirical Alignment Research
Ethan Perez (ethan-perez) · 2024-02-29T06:04:54.481Z · comments (4)
An Orthodox Case Against Utility Functions
abramdemski · 2020-04-07T19:18:12.043Z · comments (65)
Your posts should be on arXiv
JanB (JanBrauner) · 2022-08-25T10:35:12.087Z · comments (44)
Announcing Dialogues
Ben Pace (Benito) · 2023-10-07T02:57:39.005Z · comments (52)
[link] "Diamondoid bacteria" nanobots: deadly threat or dead-end? A nanotech investigation
titotal (lombertini) · 2023-09-29T14:01:15.453Z · comments (79)
o3
Zach Stein-Perlman · 2024-12-20T18:30:29.448Z · comments (155)
LessWrong Has Agree/Disagree Voting On All New Comment Threads
Ben Pace (Benito) · 2022-06-24T00:43:17.136Z · comments (217)
Matt Botvinick on the spontaneous emergence of learning algorithms
Adam Scholl (adam_scholl) · 2020-08-12T07:47:13.726Z · comments (87)
Emotionally Confronting a Probably-Doomed World: Against Motivation Via Dignity Points
TurnTrout · 2022-04-10T18:45:08.027Z · comments (7)
Curated conversations with brilliant rationalists
spencerg · 2021-05-28T14:23:30.631Z · comments (18)
A freshman year during the AI midgame: my approach to the next year
Buck · 2023-04-14T00:38:49.807Z · comments (15)
The Incredible Fentanyl-Detecting Machine
sarahconstantin · 2024-06-28T22:10:01.223Z · comments (26)
Sapir-Whorf for Rationalists
Duncan Sabien (Deactivated) (Duncan_Sabien) · 2023-01-25T07:58:46.794Z · comments (49)
Dyslucksia
Shoshannah Tekofsky (DarkSym) · 2024-05-09T19:21:33.874Z · comments (45)
[link] Will no one rid me of this turbulent pest?
Metacelsus · 2023-10-14T15:27:21.497Z · comments (23)
Potential Bottlenecks to Taking Over The World
johnswentworth · 2021-07-06T19:34:53.016Z · comments (22)
The Felt Sense: What, Why and How
Kaj_Sotala · 2020-10-05T15:57:50.545Z · comments (23)
New York Times, Please Do Not Threaten The Safety of Scott Alexander By Revealing His True Name
Zvi · 2020-06-23T12:20:00.788Z · comments (2)
Request: stop advancing AI capabilities
So8res · 2023-05-26T17:42:07.182Z · comments (24)
[link] ARC Evals new report: Evaluating Language-Model Agents on Realistic Autonomous Tasks
Beth Barnes (beth-barnes) · 2023-08-01T18:30:57.068Z · comments (12)
Omicron Post #4
Zvi · 2021-12-06T17:00:01.470Z · comments (66)
Apologizing is a Core Rationalist Skill
johnswentworth · 2024-01-02T17:47:35.950Z · comments (42)
OpenAI: Exodus
Zvi · 2024-05-20T13:10:03.543Z · comments (26)
AI: Practical Advice for the Worried
Zvi · 2023-03-01T12:30:00.703Z · comments (48)
Nate Soares' Life Advice
CatGoddess · 2022-08-23T02:46:43.369Z · comments (41)
The Commitment Races problem
Daniel Kokotajlo (daniel-kokotajlo) · 2019-08-23T01:58:19.669Z · comments (56)
Staying Split: Sabatini and Social Justice
Duncan Sabien (Deactivated) (Duncan_Sabien) · 2022-06-08T08:32:58.633Z · comments (28)
← previous page (newer posts) · next page (older posts) →