LessWrong 2.0 Reader

View: New · Old · Top

Restrict date range: Today · This week · This month · Last three months · This year · All time

next page (older posts) →

How will we update about scheming?
ryan_greenblatt · 2025-01-06T20:21:52.281Z · comments (4)
OpenAI #10: Reflections
Zvi · 2025-01-07T17:00:07.348Z · comments (1)
What Indicators Should We Watch to Disambiguate AGI Timelines?
snewman · 2025-01-06T19:57:43.398Z · comments (26)
[Fiction] [Comic] Effective Altruism and Rationality meet at a Secular Solstice afterparty
tandem · 2025-01-07T19:11:21.238Z · comments (1)
[link] "We know how to build AGI" - Sam Altman
Nikola Jurkovic (nikolaisalreadytaken) · 2025-01-06T02:05:05.134Z · comments (5)
[link] Testing for Scheming with Model Deletion
Guive (GAA) · 2025-01-07T01:54:13.550Z · comments (12)
Building Big Science from the Bottom-Up: A Fractal Approach to AI Safety
Lauren Greenspan (LaurenGreenspan) · 2025-01-07T03:08:51.447Z · comments (2)
Role embeddings: making authorship more salient to LLMs
Nina Panickssery (NinaR) · 2025-01-07T20:13:16.677Z · comments (0)
Estimating the benefits of a new flu drug (BXM)
DirectedEvolution (AllAmericanBreakfast) · 2025-01-06T04:31:16.837Z · comments (2)
Alternative Cancer Care As Biohacking & Book Review: Surviving "Terminal" Cancer
DenizT · 2025-01-06T07:43:52.773Z · comments (4)
Childhood and Education #8: Dealing with the Internet
Zvi · 2025-01-06T14:00:09.604Z · comments (4)
D&D.Sci Dungeonbuilding: the Dungeon Tournament Evaluation & Ruleset
aphyer · 2025-01-07T05:02:25.929Z · comments (3)
Stream Entry
lsusr · 2025-01-07T23:56:13.530Z · comments (0)
[link] You should delay engineering-heavy research in light of R&D automation
Daniel Paleka · 2025-01-07T02:11:11.501Z · comments (3)
Disagreement on AGI Suggests It’s Near
tangerine · 2025-01-07T20:42:43.456Z · comments (1)
Measuring Nonlinear Feature Interactions in Sparse Crosscoders [Project Proposal]
Jason Gross (jason-gross) · 2025-01-06T04:22:12.633Z · comments (0)
[question] Meal Replacements in 2025?
alkjash · 2025-01-06T15:37:25.041Z · answers+comments (9)
Really radical empathy
MichaelStJules · 2025-01-06T17:46:31.269Z · comments (0)
Definition of alignment science I like
quetzal_rainbow · 2025-01-06T20:40:38.187Z · comments (0)
Will bird flu be the next Covid? "Little chance" says my dashboard.
Nathan Young · 2025-01-07T20:10:50.080Z · comments (0)
[link] AI safety content you could create
Adam Jones (domdomegg) · 2025-01-06T15:35:56.167Z · comments (0)
Rebuttals for ~all criticisms of AIXI
Cole Wyeth (Amyr) · 2025-01-07T17:41:10.557Z · comments (5)
Incredibow
jefftk (jkaufman) · 2025-01-07T03:30:02.197Z · comments (2)
Turning up the Heat on Deceptively-Misaligned AI
J Bostock (Jemist) · 2025-01-07T00:13:28.191Z · comments (9)
(My) self-referential reason to believe in free will
jacek (jacek-karwowski) · 2025-01-06T23:35:02.809Z · comments (5)
A Principled Cartoon Guide to NVC
plex (ete) · 2025-01-07T21:01:07.904Z · comments (1)
Latent Adversarial Training (LAT) Improves the Representation of Refusal
alexandraabbas · 2025-01-06T10:24:53.419Z · comments (4)
Guilt, Shame, and Depravity
Benquo · 2025-01-07T01:16:00.273Z · comments (2)
Don't fall for ontology pyramid schemes
Lorec · 2025-01-07T23:29:46.935Z · comments (1)
[link] On Eating the Sun
jessicata (jessica.liu.taylor) · 2025-01-08T04:57:20.457Z · comments (1)
[question] Is my distinctiveness evidence for being in a simulation?
AynonymousPrsn123 · 2025-01-06T21:27:13.280Z · answers+comments (40)
Tips On Empirical Research Slides
James Chua (james-chua) · 2025-01-08T05:06:44.942Z · comments (0)
Meditation insights as phase shifts in your self-model
Jonas Hallgren · 2025-01-07T10:09:35.854Z · comments (1)
Speedrunning Rationality: Day II
aproteinengine · 2025-01-06T03:59:25.488Z · comments (3)
Generating Cognateful Sentences with Large Language Models
vkethana (vijay-k) · 2025-01-06T18:40:09.564Z · comments (0)
[link] Markov's Inequality Explained
criticalpoints · 2025-01-08T00:31:55.125Z · comments (0)
Book review: Range by David Epstein
PatrickDFarley · 2025-01-08T04:27:26.391Z · comments (0)
Predicting AI Releases Through Side Channels
Reworr R (reworr-reworr) · 2025-01-07T19:06:41.584Z · comments (0)
[link] Bridgewater x Metaculus Forecasting Contest Goes Global — Feb 3, $25k, Opportunities
ChristianWilliams · 2025-01-07T21:40:30.899Z · comments (0)
Can we have Epiphanies and Eureka moments more frequently?
CstineSublime · 2025-01-08T02:20:26.897Z · comments (0)
[link] Independent research article analyzing consistent self-reports of experience in ChatGPT and Claude
rife (edgar-muniz) · 2025-01-06T17:34:01.505Z · comments (8)
Other implications of radical empathy
MichaelStJules · 2025-01-07T16:10:16.755Z · comments (0)
[link] Job Opening: SWE to help improve grant-making software
Ethan Ashkie (ethan-ashkie-1) · 2025-01-08T00:54:22.820Z · comments (0)
[link] My Experience With A Magnet Implant
Vale · 2025-01-07T03:01:21.410Z · comments (2)
Actualism, asymmetry and extinction
MichaelStJules · 2025-01-07T16:02:31.610Z · comments (0)
Alleviating shrimp pain is immoral.
G Wood (geoffrey-wood) · 2025-01-07T07:28:49.432Z · comments (0)
next page (older posts) →