LessWrong 2.0 Reader

View: New · Old · Top

Restrict date range: Today · This week · This month · Last three months · This year · All time

← previous page (newer posts) · next page (older posts) →

[link] Consider giving money to people, not projects or organizations
Nina Rimsky (NinaR) · 2023-07-02T14:33:29.160Z · comments (30)
[link] Elon Musk announces xAI
Jan_Kulveit · 2023-07-13T09:01:01.278Z · comments (35)
Thoughts on “Process-Based Supervision”
Steven Byrnes (steve2152) · 2023-07-17T14:08:57.219Z · comments (4)
A reformulation of Finite Factored Sets
Matthias G. Mayer (matthias-georg-mayer) · 2023-07-24T13:02:25.382Z · comments (1)
Announcing Manifund Regrants
Austin Chen (austin-chen) · 2023-07-05T19:42:08.978Z · comments (8)
"Justice, Cherryl."
Zack_M_Davis · 2023-07-23T16:16:40.835Z · comments (20)
[link] Existential Risk Persuasion Tournament
PeterMcCluskey · 2023-07-17T18:04:02.794Z · comments (1)
A brief history of computers
Adam Zerner (adamzerner) · 2023-07-19T02:59:19.679Z · comments (18)
[link] Why You Should Never Update Your Beliefs
Arjun Panickssery (arjun-panickssery) · 2023-07-29T00:27:01.899Z · comments (17)
Drawn Out: a story
Richard_Ngo (ricraz) · 2023-07-11T00:08:09.286Z · comments (2)
Really Strong Features Found in Residual Stream
Logan Riggs (elriggs) · 2023-07-08T19:40:14.601Z · comments (6)
[link] Predictive history classes
dkl9 · 2023-07-17T20:48:31.363Z · comments (17)
Announcement: AI Narrations Available for All New LessWrong Posts
Solenoid_Entity · 2023-07-20T22:17:33.454Z · comments (28)
Six (and a half) intuitions for SVD
CallumMcDougall (TheMcDouglas) · 2023-07-04T19:23:19.688Z · comments (1)
[link] Alpha
Erich_Grunewald · 2023-07-01T16:05:55.940Z · comments (2)
Mech Interp Puzzle 1: Suspiciously Similar Embeddings in GPT-Neo
Neel Nanda (neel-nanda-1) · 2023-07-16T22:02:15.410Z · comments (15)
An Overview of the AI Safety Funding Situation
Stephen McAleese (stephen-mcaleese) · 2023-07-12T14:54:36.732Z · comments (3)
[link] News : Biden-⁠Harris Administration Secures Voluntary Commitments from Leading Artificial Intelligence Companies to Manage the Risks Posed by AI
Jonathan Claybrough (lelapin) · 2023-07-21T18:00:57.016Z · comments (9)
Open-minded updatelessness
Nicolas Macé (NicolasMace) · 2023-07-10T11:08:22.207Z · comments (21)
Visible loss landscape basins don't correspond to distinct algorithms
Mikhail Samin (mikhail-samin) · 2023-07-28T16:19:05.279Z · comments (13)
Meta-rationality and frames
Richard_Ngo (ricraz) · 2023-07-03T00:33:20.355Z · comments (2)
[link] Why no Roman Industrial Revolution?
jasoncrawford · 2023-07-26T19:34:41.682Z · comments (30)
[link] SSA rejects anthropic shadow, too
jessicata (jessica.liu.taylor) · 2023-07-27T17:25:17.728Z · comments (38)
Pulling the Rope Sideways: Empirical Test Results
Daniel Kokotajlo (daniel-kokotajlo) · 2023-07-27T22:18:01.072Z · comments (18)
[link] Dominant Assurance Contract Experiment #2: Berkeley House Dinners
Arjun Panickssery (arjun-panickssery) · 2023-07-05T00:13:15.255Z · comments (8)
Micro Habits that Improve One’s Day
silentbob · 2023-07-01T10:53:57.280Z · comments (9)
[question] I'm consistently overwhelmed by basic obligations. Are there any paradigm shifts or other rationality-based tips that would be helpful?
Benjamin Hendricks (benjamin-hendricks) · 2023-07-21T21:10:21.543Z · answers+comments (37)
AI #19: Hofstadter, Sutskever, Leike
Zvi · 2023-07-06T12:50:05.037Z · comments (16)
[question] Which rationality posts are begging for further practical development?
LoganStrohl (BrienneYudkowsky) · 2023-07-23T22:22:04.389Z · answers+comments (17)
AI #20: Code Interpreter and Claude 2.0 for Everyone
Zvi · 2023-07-13T14:00:08.266Z · comments (9)
[question] The literature on aluminum adjuvants is very suspicious. Small IQ tax is plausible - can any experts help me estimate it?
mikes · 2023-07-04T09:33:51.849Z · answers+comments (39)
[link] Forum Karma: view stats and find highly-rated comments for any LW user
Max H (Maxc) · 2023-07-01T15:36:28.881Z · comments (16)
(tentatively) Found 600+ Monosemantic Features in a Small LM Using Sparse Autoencoders
Logan Riggs (elriggs) · 2023-07-05T16:49:43.822Z · comments (1)
How to make real-money prediction markets on arbitrary topics (Outdated)
yutaka · 2023-07-30T02:11:47.050Z · comments (13)
Agency begets agency
Richard_Ngo (ricraz) · 2023-07-06T13:08:44.318Z · comments (1)
An upcoming US Supreme Court case may impede AI governance efforts
NickGabs · 2023-07-16T23:51:26.073Z · comments (17)
The virtue of determination
Richard_Ngo (ricraz) · 2023-07-10T05:11:00.412Z · comments (4)
Training Process Transparency through Gradient Interpretability: Early experiments on toy language models
robertzk (Technoguyrob) · 2023-07-21T14:52:09.311Z · comments (1)
[link] A review of Principia Qualia
jessicata (jessica.liu.taylor) · 2023-07-12T18:38:52.283Z · comments (6)
AXRP Episode 24 - Superalignment with Jan Leike
DanielFilan · 2023-07-27T04:00:02.106Z · comments (3)
Alignment Megaprojects: You're Not Even Trying to Have Ideas
NicholasKross · 2023-07-12T23:39:54.392Z · comments (30)
[link] Partial Transcript of Recent Senate Hearing Discussing AI X-Risk
Daniel_Eth · 2023-07-27T09:16:01.168Z · comments (0)
Aging and the geroscience hypothesis
DirectedEvolution (AllAmericanBreakfast) · 2023-07-12T07:16:04.516Z · comments (14)
AutoInterpretation Finds Sparse Coding Beats Alternatives
Hoagy · 2023-07-17T01:41:17.397Z · comments (1)
Optimized for Something other than Winning or: How Cricket Resists Moloch and Goodhart's Law
A.H. (AlfredHarwood) · 2023-07-05T12:33:07.166Z · comments (25)
Internal independent review for language model agent alignment
Seth Herd · 2023-07-07T06:54:11.552Z · comments (26)
Thoughts on Loss Landscapes and why Deep Learning works
beren · 2023-07-25T16:41:39.562Z · comments (4)
Boundary Placement Rebellion
tailcalled · 2023-07-20T17:40:00.190Z · comments (21)
Tiny Mech Interp Projects: Emergent Positional Embeddings of Words
Neel Nanda (neel-nanda-1) · 2023-07-18T21:24:41.990Z · comments (1)
Activation adding experiments with llama-7b
Nina Rimsky (NinaR) · 2023-07-16T04:17:58.529Z · comments (1)
← previous page (newer posts) · next page (older posts) →