LessWrong 2.0 Reader

View: New · Old · Top

Restrict date range: Today · This week · This month · Last three months · This year · All time

← previous page (newer posts) · next page (older posts) →

One Minute Every Moment
abramdemski · 2023-09-01T20:23:56.391Z · comments (23)
Category Theory Without The Baggage
johnswentworth · 2020-02-03T20:03:13.586Z · comments (49)
The 99% principle for personal problems
Kaj_Sotala · 2023-10-02T08:20:07.379Z · comments (20)
Trust develops gradually via making bids and setting boundaries
Richard_Ngo (ricraz) · 2023-05-19T22:16:38.483Z · comments (12)
Moore's Law, AI, and the pace of progress
Veedrac · 2021-12-11T03:02:24.558Z · comments (38)
Baking is Not a Ritual
Sisi Cheng (sisi-cheng) · 2020-05-25T18:08:24.836Z · comments (28)
Goodhart's Law in Reinforcement Learning
jacek (jacek-karwowski) · 2023-10-16T00:54:11.669Z · comments (22)
The case for becoming a black-box investigator of language models
Buck · 2022-05-06T14:35:24.630Z · comments (20)
Think carefully before calling RL policies "agents"
TurnTrout · 2023-06-02T03:46:07.467Z · comments (35)
The Curse Of The Counterfactual
pjeby · 2019-11-01T18:34:41.186Z · comments (34)
High Status Eschews Quantification of Performance
niplav · 2023-03-19T22:14:16.523Z · comments (36)
The Wicked Problem Experience
HoldenKarnofsky · 2022-03-02T17:50:18.621Z · comments (6)
Re-Examining LayerNorm
Eric Winsor (EricWinsor) · 2022-12-01T22:20:23.542Z · comments (12)
An even deeper atheism
Joe Carlsmith (joekc) · 2024-01-11T17:28:31.843Z · comments (47)
Nuclear war is unlikely to cause human extinction
Jeffrey Ladish (jeff-ladish) · 2020-11-07T05:42:24.380Z · comments (47)
Advice for newly busy people
Severin T. Seehrich (sts) · 2023-05-11T16:46:15.313Z · comments (2)
[link] Fiber arts, mysterious dodecahedrons, and waiting on “Eureka!”
eukaryote · 2022-08-04T20:37:59.388Z · comments (15)
The Pointers Problem: Human Values Are A Function Of Humans' Latent Variables
johnswentworth · 2020-11-18T17:47:40.929Z · comments (49)
On infinite ethics
Joe Carlsmith (joekc) · 2022-01-31T07:04:44.244Z · comments (70)
[link] Bayesian Injustice
Kevin Dorst · 2023-12-14T15:44:08.664Z · comments (10)
Recommendation: Bug Bounties and Responsible Disclosure for Advanced ML Systems
Vaniver · 2023-02-17T20:11:39.255Z · comments (11)
Updatelessness doesn't solve most problems
Martín Soto (martinsq) · 2024-02-08T17:30:11.266Z · comments (43)
Invulnerable Incomplete Preferences: A Formal Statement
Sami Petersen (sami-petersen) · 2023-08-30T21:59:36.186Z · comments (32)
Cosmopolitan values don't come free
So8res · 2023-05-31T15:58:16.974Z · comments (82)
[link] Did ChatGPT just gaslight me?
ThomasW (ThomasWoodside) · 2022-12-01T05:41:46.560Z · comments (45)
There are (probably) no superhuman Go AIs: strong human players beat the strongest AIs
Taran · 2023-02-19T12:25:52.212Z · comments (33)
In Defense of Chatbot Romance
Kaj_Sotala · 2023-02-11T14:30:05.696Z · comments (52)
Community Notes by X
NicholasKees (nick_kees) · 2024-03-18T17:13:33.195Z · comments (15)
Transcript: "You Should Read HPMOR"
TurnTrout · 2021-11-02T18:20:53.161Z · comments (12)
Selection Theorems: A Program For Understanding Agents
johnswentworth · 2021-09-28T05:03:19.316Z · comments (28)
Movable Housing for Scalable Cities
Eliezer Yudkowsky (Eliezer_Yudkowsky) · 2020-05-15T21:21:05.395Z · comments (28)
Patient Observation
LoganStrohl (BrienneYudkowsky) · 2022-02-23T19:31:45.062Z · comments (4)
$250 prize for checking Jake Cannell's Brain Efficiency
Alexander Gietelink Oldenziel (alexander-gietelink-oldenziel) · 2023-04-26T16:21:06.035Z · comments (170)
How do we become confident in the safety of a machine learning system?
evhub · 2021-11-08T22:49:41.080Z · comments (5)
Firming Up Not-Lying Around Its Edge-Cases Is Less Broadly Useful Than One Might Initially Think
Zack_M_Davis · 2019-12-27T05:09:22.546Z · comments (43)
Will Capabilities Generalise More?
Ramana Kumar (ramana-kumar) · 2022-06-29T17:12:56.255Z · comments (39)
High schoolers can apply to the Atlas Fellowship: $50k scholarship + summer program
sydney (sydney-von-arx) · 2022-04-03T00:53:05.397Z · comments (18)
Soft takeoff can still lead to decisive strategic advantage
Daniel Kokotajlo (daniel-kokotajlo) · 2019-08-23T16:39:31.317Z · comments (47)
[link] The 300-year journey to the covid vaccine
jasoncrawford · 2020-11-09T23:06:45.790Z · comments (9)
[link] Report on Frontier Model Training
YafahEdelman (yafah-edelman-1) · 2023-08-30T20:02:46.317Z · comments (21)
Book review: The Checklist Manifesto
Swimmer963 (Miranda Dixon-Luinenburg) (Swimmer963) · 2021-09-17T23:09:09.590Z · comments (13)
[link] Who regulates the regulators? We need to go beyond the review-and-approval paradigm
jasoncrawford · 2023-05-04T22:11:17.465Z · comments (29)
Greyed Out Options
ozymandias · 2022-04-04T20:43:13.566Z · comments (12)
How LLMs are and are not myopic
janus · 2023-07-25T02:19:44.949Z · comments (14)
A Shutdown Problem Proposal
johnswentworth · 2024-01-21T18:12:48.664Z · comments (61)
Warning Shots Probably Wouldn't Change The Picture Much
So8res · 2022-10-06T05:15:39.391Z · comments (42)
Apocalypse insurance, and the hardline libertarian take on AI risk
So8res · 2023-11-28T02:09:52.400Z · comments (36)
Choice Writings of Dominic Cummings
Connor_Flexman · 2021-10-13T02:41:44.291Z · comments (75)
Why I'm joining Anthropic
evhub · 2023-01-05T01:12:13.822Z · comments (4)
There are no coherence theorems
Dan H (dan-hendrycks) · 2023-02-20T21:25:48.478Z · comments (114)
← previous page (newer posts) · next page (older posts) →