LessWrong 2.0 Reader

View: New · Old · Top

Restrict date range: Today · This week · This month · Last three months · This year · All time

next page (older posts) →

Emergent Misalignment: Narrow finetuning can produce broadly misaligned LLMs
Jan Betley (jan-betley) · 2025-02-25T17:39:31.059Z · comments (18)
[link] what an efficient market feels from inside
DMMF · 2025-02-25T02:38:40.129Z · comments (8)
Economics Roundup #5
Zvi · 2025-02-25T13:40:07.086Z · comments (6)
[link] Upcoming Protest for AI Safety
Matt Vincent (matthew-milone) · 2025-02-25T03:04:03.153Z · comments (0)
Revisiting Conway's Law
annebrandes (annebrandes1@gmail.com) · 2025-02-25T08:33:52.421Z · comments (0)
Three Levels for Large Language Model Cognition
Eleni Angelou (ea-1) · 2025-02-25T23:14:00.306Z · comments (0)
[link] We Can Build Compassionate AI
Gordon Seidoh Worley (gworley) · 2025-02-25T16:37:06.160Z · comments (1)
Technical comparison of Deepseek, Novasky, S1, Helix, P0
Juliezhanggg · 2025-02-25T04:20:40.413Z · comments (0)
Levels of analysis for thinking about agency
Cole Wyeth (Amyr) · 2025-02-26T04:24:24.583Z · comments (0)
[question] Intellectual lifehacks repo
Antoine de Scorraille (Etoile de Scauchy) · 2025-02-25T16:32:09.814Z · answers+comments (4)
[link] The Stag Hunt—cultivating cooperation to reap rewards
James Stephen Brown (james-brown) · 2025-02-25T23:45:07.472Z · comments (0)
[link] [Crosspost] Strategic wealth accumulation under transformative AI expectations
arden446 · 2025-02-25T21:50:11.458Z · comments (0)
Making alignment a law of the universe
juggins · 2025-02-25T10:44:11.632Z · comments (1)
Demystifying the Pinocchio Paradox
Novak Zukowski (Zantarus) · 2025-02-25T06:16:57.219Z · comments (0)
next page (older posts) →