LessWrong 2.0 Reader

View: New · Old · Top

Restrict date range: Today · This week · This month · Last three months · This year · All time

next page (older posts) →

AGI Ruin: A List of Lethalities
Eliezer Yudkowsky (Eliezer_Yudkowsky) · 2022-06-05T22:05:52.224Z · comments (704)
Where I agree and disagree with Eliezer
paulfchristiano · 2022-06-19T19:15:55.698Z · comments (223)
SolidGoldMagikarp (plus, prompt generation)
Jessica Rumbelow (jessica-cooper) · 2023-02-05T22:02:35.854Z · comments (206)
What an actually pessimistic containment strategy looks like
lc · 2022-04-05T00:19:50.212Z · comments (138)
The Waluigi Effect (mega-post)
Cleo Nardo (strawberry calm) · 2023-03-03T03:22:08.619Z · comments (188)
[link] Simulators
janus · 2022-09-02T12:45:33.723Z · comments (168)
(The) Lightcone is nothing without its people: LW + Lighthaven's big fundraiser
habryka (habryka4) · 2024-11-30T02:55:16.077Z · comments (263)
Rationalism before the Sequences
Eric Raymond (eric-raymond) · 2021-03-30T14:04:15.254Z · comments (83)
Making Vaccine
johnswentworth · 2021-02-03T20:24:18.756Z · comments (249)
LessWrong's (first) album: I Have Been A Good Bing
habryka (habryka4) · 2024-04-01T07:33:45.242Z · comments (179)
[link] Pain is not the unit of Effort
alkjash · 2020-11-24T20:00:19.584Z · comments (90)
Let’s think about slowing down AI
KatjaGrace · 2022-12-22T17:40:04.787Z · comments (182)
What 2026 looks like
Daniel Kokotajlo (daniel-kokotajlo) · 2021-08-06T16:14:49.772Z · comments (156)
OpenAI Email Archives (from Musk v. Altman and OpenAI blog)
habryka (habryka4) · 2024-11-16T06:38:03.937Z · comments (80)
The Talk: a brief explanation of sexual dimorphism
Malmesbury (Elmer of Malmesbury) · 2023-09-18T16:23:56.073Z · comments (75)
The Redaction Machine
Ben (ben-lang) · 2022-09-20T22:03:15.309Z · comments (48)
[link] How much do you believe your results?
Eric Neyman (UnexpectedValues) · 2023-05-06T20:31:31.277Z · comments (18)
[link] Luck based medicine: my resentful story of becoming a medical miracle
Elizabeth (pktechgirl) · 2022-10-16T17:40:03.702Z · comments (121)
Alignment Faking in Large Language Models
ryan_greenblatt · 2024-12-18T17:19:06.665Z · comments (74)
Losing the root for the tree
Adam Zerner (adamzerner) · 2022-09-20T04:53:53.435Z · comments (31)
How To Write Quickly While Maintaining Epistemic Rigor
johnswentworth · 2021-08-28T17:52:21.692Z · comments (38)
100 Tips for a Better Life
Ideopunk · 2020-12-22T14:30:12.756Z · comments (130)
[link] The ants and the grasshopper
Richard_Ngo (ricraz) · 2023-06-04T22:00:04.577Z · comments (40)
Significantly Enhancing Adult Intelligence With Gene Editing May Be Possible
GeneSmith · 2023-12-12T18:14:51.438Z · comments (205)
I would have shit in that alley, too
Declan Molony (declan-molony) · 2024-06-18T04:41:06.545Z · comments (135)
Counter-theses on Sleep
Natália (Natália Mendonça) · 2022-03-21T23:21:07.943Z · comments (135)
It’s Probably Not Lithium
Natália (Natália Mendonça) · 2022-06-28T21:24:10.246Z · comments (187)
Focus on the places where you feel shocked everyone's dropping the ball
So8res · 2023-02-02T00:27:55.687Z · comments (63)
Steering GPT-2-XL by adding an activation vector
TurnTrout · 2023-05-13T18:42:41.321Z · comments (98)
[link] Douglas Hofstadter changes his mind on Deep Learning & AI risk (June 2023)?
gwern · 2023-07-03T00:48:47.131Z · comments (54)
chinchilla's wild implications
nostalgebraist · 2022-07-31T01:18:28.254Z · comments (128)
Bets, Bonds, and Kindergarteners
jefftk (jkaufman) · 2021-01-03T21:20:03.563Z · comments (35)
[link] Things I Learned by Spending Five Thousand Hours In Non-EA Charities
jenn (pixx) · 2023-06-01T20:48:03.940Z · comments (35)
(My understanding of) What Everyone in Technical Alignment is Doing and Why
Thomas Larsen (thomas-larsen) · 2022-08-29T01:23:58.073Z · comments (90)
Transformers Represent Belief State Geometry in their Residual Stream
Adam Shai (adam-shai) · 2024-04-16T21:16:11.377Z · comments (100)
The Best Tacit Knowledge Videos on Every Subject
Parker Conley (parker-conley) · 2024-03-31T17:14:31.199Z · comments (156)
Failures in Kindness
silentbob · 2024-03-26T21:30:11.052Z · comments (60)
GPTs are Predictors, not Imitators
Eliezer Yudkowsky (Eliezer_Yudkowsky) · 2023-04-08T19:59:13.601Z · comments (99)
[link] It Looks Like You're Trying To Take Over The World
gwern · 2022-03-09T16:35:35.326Z · comments (120)
You Are Not Measuring What You Think You Are Measuring
johnswentworth · 2022-09-20T20:04:22.899Z · comments (44)
Bing Chat is blatantly, aggressively misaligned
evhub · 2023-02-15T05:29:45.262Z · comments (181)
DeepMind alignment team opinions on AGI ruin arguments
Vika · 2022-08-12T21:06:40.582Z · comments (37)
Reliable Sources: The Story of David Gerard
TracingWoodgrains (tracingwoodgrains) · 2024-07-10T19:50:21.191Z · comments (54)
How I got 4.2M YouTube views without making a single video
Closed Limelike Curves · 2024-09-03T03:52:33.025Z · comments (36)
[link] Reflections on six months of fatherhood
jasoncrawford · 2022-01-31T05:28:09.154Z · comments (24)
The hostile telepaths problem
Valentine · 2024-10-27T15:26:53.610Z · comments (89)
Lies Told To Children
Eliezer Yudkowsky (Eliezer_Yudkowsky) · 2022-04-14T11:25:10.282Z · comments (94)
Reward is not the optimization target
TurnTrout · 2022-07-25T00:03:18.307Z · comments (123)
Anti-Aging: State of the Art
JackH · 2020-12-31T19:07:03.430Z · comments (176)
[link] A Mechanistic Interpretability Analysis of Grokking
Neel Nanda (neel-nanda-1) · 2022-08-15T02:41:36.245Z · comments (47)
next page (older posts) →