LessWrong 2.0 Reader

View: New · Old · Top

Restrict date range: Today · This week · This month · Last three months · This year · All time

next page (older posts) →

Activation space interpretability may be doomed
bilalchughtai (beelal) · 2025-01-08T12:49:38.421Z · comments (15)
[link] Aristocracy and Hostage Capital
Arjun Panickssery (arjun-panickssery) · 2025-01-08T19:38:47.104Z · comments (3)
Tips On Empirical Research Slides
James Chua (james-chua) · 2025-01-08T05:06:44.942Z · comments (3)
[link] On Eating the Sun
jessicata (jessica.liu.taylor) · 2025-01-08T04:57:20.457Z · comments (34)
[link] Discursive Warfare and Faction Formation
Benquo · 2025-01-09T16:47:31.824Z · comments (0)
Implications of the AI Security Gap
Dan Braun (dan-braun-1) · 2025-01-08T08:31:36.789Z · comments (0)
AI Safety as a YC Startup
Lukas Petersson (lukas-petersson-1) · 2025-01-08T10:46:29.042Z · comments (4)
XX by Rian Hughes: Pretentious Bullshit
Yair Halberstadt (yair-halberstadt) · 2025-01-08T13:02:52.438Z · comments (5)
Last week of the Discussion Phase
Raemon · 2025-01-09T19:26:59.136Z · comments (0)
MATS mentor selection
DanielFilan · 2025-01-10T03:12:52.141Z · comments (1)
[link] Job Opening: SWE to help improve grant-making software
Ethan Ashkie (ethan-ashkie-1) · 2025-01-08T00:54:22.820Z · comments (1)
The absolute basics of representation theory of finite groups
Dmitry Vaintrob (dmitry-vaintrob) · 2025-01-08T09:47:13.136Z · comments (0)
[question] What is the most impressive game LLMs can play well?
Cole Wyeth (Amyr) · 2025-01-08T19:38:18.530Z · answers+comments (3)
Can we rescue Effective Altruism?
Elizabeth (pktechgirl) · 2025-01-09T16:40:02.405Z · comments (0)
AI #98: World Ends With Six Word Story
Zvi · 2025-01-09T16:30:07.341Z · comments (1)
[link] NAO Updates, January 2025
jefftk (jkaufman) · 2025-01-10T03:37:36.698Z · comments (0)
[link] Markov's Inequality Explained
criticalpoints · 2025-01-08T00:31:55.125Z · comments (2)
An exhaustive list of cosmic threats
Jordan Stone (jordan-stone) · 2025-01-09T19:59:08.368Z · comments (0)
Book review: Range by David Epstein
PatrickDFarley · 2025-01-08T04:27:26.391Z · comments (0)
AI Safety Outreach Seminar & Social (online)
Linda Linsefors · 2025-01-08T13:25:23.192Z · comments (0)
PIBBSS Fellowship 2025: Bounties and Cooperative AI Track Announcement
DusanDNesic · 2025-01-09T14:23:47.027Z · comments (0)
Dmitry's Koan
Dmitry Vaintrob (dmitry-vaintrob) · 2025-01-10T04:27:30.346Z · comments (0)
[question] How can humanity survive a multipolar AGI scenario?
Leonard Holloway (literally-best) · 2025-01-09T20:17:40.143Z · answers+comments (4)
[link] Is AI Hitting a Wall or Moving Faster Than Ever?
garrison · 2025-01-09T22:18:51.497Z · comments (1)
Ann Altman has filed a lawsuit in US federal court alleging that she was sexually abused by Sam Altman
quanticle · 2025-01-08T14:59:24.140Z · comments (3)
You are too dumb to understand insurance
Lorec · 2025-01-09T23:33:53.778Z · comments (6)
[link] AI Forecasting Benchmark: Congratulations to Q4 Winners + Q1 Practice Questions Open
ChristianWilliams · 2025-01-10T03:02:05.856Z · comments (0)
[question] How do you decide to phrase predictions you ask of others? (and how do you make your own?)
CstineSublime · 2025-01-10T02:44:26.737Z · answers+comments (0)
Thoughts on the In-Context Scheming AI Experiment
ExCeph · 2025-01-09T02:19:09.558Z · comments (0)
Many Worlds and the Problems of Evil
Jonah Wilberg (jrwilb@googlemail.com) · 2025-01-09T16:10:46.752Z · comments (1)
[link] What are polysemantic neurons?
Vishakha (vishakha-agrawal) · 2025-01-08T07:35:42.758Z · comments (0)
A Systematic Approach to AI Risk Analysis Through Cognitive Capabilities
Tom DAVID (tom-david) · 2025-01-09T00:18:04.608Z · comments (0)
[link] Expevolu, Part II: Buying land to create a country
Fernando · 2025-01-09T21:11:11.780Z · comments (0)
Can we have Epiphanies and Eureka moments more frequently?
CstineSublime · 2025-01-08T02:20:26.897Z · comments (0)
Governance Course - Week 1 Reflections
la .alis. (Diatom) · 2025-01-09T04:48:27.502Z · comments (0)
Gothenburg LW / ACX meetup
Stefan (stefan-1) · 2025-01-08T21:39:18.309Z · comments (0)
Activation Magnitudes Matter On Their Own: Insights from Language Model Distributional Analysis
Matt Levinson · 2025-01-10T06:53:02.228Z · comments (0)
The Type of Writing that Pushes Women Away
Dahlia (sdjfhkj-dkjfks) · 2025-01-08T18:54:52.070Z · comments (2)
The "Everyone Can't Be Wrong" Prior causes AI risk denial but helped prehistoric people
Knight Lee (Max Lee) · 2025-01-09T05:54:43.395Z · comments (0)
Ought We to Be Doing More Than We Are?
Jacob1 (JacobBowden) · 2025-01-09T18:12:32.149Z · comments (4)
Deleted
Yanling Guo (yanling-guo) · 2025-01-10T01:36:47.950Z · comments (0)
next page (older posts) →