LessWrong 2.0 Reader

View: New · Old · Top

Restrict date range: Today · This week · This month · Last three months · This year · All time

← previous page (newer posts) · next page (older posts) →

I'm Writing a Book About Liberalism
Yoav Ravid · 2024-12-19T00:13:33.895Z · comments (6)
Dance Differentiation
jefftk (jkaufman) · 2024-11-15T02:30:07.694Z · comments (0)
How I saved 1 human life (in expectation) without overthinking it
Christopher King (christopher-king) · 2024-12-22T20:53:13.492Z · comments (0)
Secular Solstice Songbook Update
jefftk (jkaufman) · 2024-11-17T17:30:07.404Z · comments (2)
Don't fall for ontology pyramid schemes
Lorec · 2025-01-07T23:29:46.935Z · comments (4)
[question] What epsilon do you subtract from "certainty" in your own probability estimates?
Dagon · 2024-11-26T19:13:46.795Z · answers+comments (6)
[link] Disentangling Representations through Multi-task Learning
Bogdan Ionut Cirstea (bogdan-ionut-cirstea) · 2024-11-24T13:10:26.307Z · comments (1)
The first AGI may be a good engineer but bad strategist
Knight Lee (Max Lee) · 2024-12-09T06:34:54.082Z · comments (2)
[link] NeuroAI for AI safety: A Differential Path
nz · 2024-12-16T13:17:12.527Z · comments (0)
[question] How can we prevent AGI value drift?
Dakara (chess-ice) · 2024-11-20T18:19:24.375Z · answers+comments (5)
[link] I, Token
Ivan Vendrov (ivan-vendrov) · 2024-11-25T02:20:35.629Z · comments (2)
Is the mind a program?
EuanMcLean (euanmclean) · 2024-11-28T09:42:02.892Z · comments (60)
Backdoors have universal representations across large language models
Amirali Abdullah (amirali-abdullah) · 2024-12-06T22:56:33.519Z · comments (0)
Importing Bluesky Comments
jefftk (jkaufman) · 2024-11-28T03:50:06.635Z · comments (0)
The low Information Density of Eliezer Yudkowsky & LessWrong
Felix Olszewski (quick-maths) · 2024-12-30T19:43:59.355Z · comments (7)
AXRP Episode 38.0 - Zhijing Jin on LLMs, Causality, and Multi-Agent Systems
DanielFilan · 2024-11-14T07:00:06.977Z · comments (0)
Predicting AI Releases Through Side Channels
Reworr R (reworr-reworr) · 2025-01-07T19:06:41.584Z · comments (0)
A pragmatic story about where we get our priors
Fiora from Rosebloom · 2025-01-02T10:16:54.019Z · comments (6)
Crosspost: Developing the middle ground on polarized topics
juliawise · 2024-11-25T14:39:53.041Z · comments (16)
[question] Is AI alignment a purely functional property?
Roko · 2024-12-15T21:42:50.674Z · answers+comments (7)
Comparing the AirFanta 3Pro to the Coway AP-1512
jefftk (jkaufman) · 2024-12-16T01:40:01.522Z · comments (0)
Mid-Generation Self-Correction: A Simple Tool for Safer AI
MrThink (ViktorThink) · 2024-12-19T23:41:00.702Z · comments (0)
Paraddictions: unreasonably compelling behaviors and their uses
Michael Cohn (michael-cohn) · 2024-11-22T20:53:59.479Z · comments (0)
Robbin's Farm Sledding Route
jefftk (jkaufman) · 2024-12-21T22:10:01.175Z · comments (1)
[question] Why is Gemini telling the user to die?
Burny · 2024-11-18T01:44:12.583Z · answers+comments (1)
Low-effort review of "AI For Humanity"
Charlie Steiner · 2024-12-11T09:54:42.871Z · comments (0)
[link] The lying p value
kqr · 2024-11-12T06:12:59.934Z · comments (7)
[link] AISN #45: Center for AI Safety 2024 Year in Review
Corin Katzke (corin-katzke) · 2024-12-19T18:15:56.416Z · comments (0)
Registrations Open for 2024 NYC Secular Solstice & Megameetup
Joe Rogero · 2024-11-12T17:50:10.827Z · comments (0)
[link] Markov's Inequality Explained
criticalpoints · 2025-01-08T00:31:55.125Z · comments (1)
A good way to build many air filters on the cheap
winstonBosan · 2024-12-08T01:47:58.236Z · comments (5)
Playing with Otamatones
jefftk (jkaufman) · 2025-01-02T19:50:01.781Z · comments (0)
(My) self-referential reason to believe in free will
jacek (jacek-karwowski) · 2025-01-06T23:35:02.809Z · comments (5)
2. Skim the Manual: Intelligent Voluntary Cooperation
Allison Duettmann (allison-duettmann) · 2025-01-02T19:02:06.864Z · comments (0)
[link] Linkpost: Rat Traps by Sheon Han in Asterisk Mag
Chris_Leong · 2024-12-03T03:22:45.424Z · comments (5)
AXRP Episode 38.1 - Alan Chan on Agent Infrastructure
DanielFilan · 2024-11-16T23:30:09.098Z · comments (0)
No Internally-Crispy Mac and Cheese
jefftk (jkaufman) · 2024-12-20T03:20:01.798Z · comments (5)
How Much to Give is a Pragmatic Question
jefftk (jkaufman) · 2024-12-24T04:20:01.480Z · comments (1)
Commenting Patterns by Platform
jefftk (jkaufman) · 2024-12-01T11:50:06.932Z · comments (0)
Simple Steganographic Computation Eval - gpt-4o and gemini-exp-1206 can't solve it yet
Filip Sondej · 2024-12-19T15:47:05.512Z · comments (2)
Reflections on ML4Good
james__p · 2024-11-25T02:40:32.586Z · comments (0)
[question] Who are the worthwhile non-European pre-Industrial thinkers?
Lorec · 2024-12-03T01:45:31.445Z · answers+comments (4)
Exploring the petertodd / Leilan duality in GPT-2 and GPT-J
mwatkins · 2024-12-23T13:17:53.755Z · comments (1)
[link] My AI timelines
xpostah · 2024-12-22T21:06:41.722Z · comments (2)
Preliminary Thoughts on Flirting Theory
la .alis. (Diatom) · 2024-12-24T07:37:47.045Z · comments (6)
Approaches to Group Singing
jefftk (jkaufman) · 2025-01-01T12:50:01.877Z · comments (1)
Sideloading: creating a model of a person via LLM with very large prompt
avturchin · 2024-11-22T16:41:28.293Z · comments (4)
[link] Forecast With GiveWell
ChristianWilliams · 2024-12-11T17:52:32.293Z · comments (0)
[question] What is the most impressive game LLMs can play well?
Cole Wyeth (Amyr) · 2025-01-08T19:38:18.530Z · answers+comments (1)
Reward Bases: A simple mechanism for adaptive acquisition of multiple reward type
Bogdan Ionut Cirstea (bogdan-ionut-cirstea) · 2024-11-23T12:45:01.067Z · comments (0)
← previous page (newer posts) · next page (older posts) →