LessWrong 2.0 Reader

View: New · Old · Top

Restrict date range: Today · This week · This month · Last three months · This year · All time

next page (older posts) →

[link] Open Philanthropy Technical AI Safety RFP - $40M Available Across 21 Research Areas
jake_mendel · 2025-02-06T18:58:53.076Z · comments (0)
[link] Detecting Strategic Deception Using Linear Probes
Nicholas Goldowsky-Dill (nicholas-goldowsky-dill) · 2025-02-06T15:46:53.024Z · comments (0)
Voting Results for the 2023 Review
Raemon · 2025-02-06T08:00:37.461Z · comments (0)
MATS Applications + Research Directions I'm Currently Excited About
Neel Nanda (neel-nanda-1) · 2025-02-06T11:03:40.093Z · comments (0)
Chicanery: No
Screwtape · 2025-02-06T05:42:45.095Z · comments (3)
AI #102: Made in America
Zvi · 2025-02-06T14:20:06.733Z · comments (3)
Don't go bankrupt, don't go rogue
Nathan Young · 2025-02-06T10:31:14.312Z · comments (0)
Wild Animal Suffering Is The Worst Thing In The World
omnizoid · 2025-02-06T16:15:34.572Z · comments (9)
Illusory Safety: Redteaming DeepSeek R1 and the Strongest Fine-Tunable Models of OpenAI, Anthropic, and Google
ChengCheng (ccstan99) · 2025-02-07T03:57:30.904Z · comments (0)
BIDA Calendar iCal Feed
jefftk (jkaufman) · 2025-02-06T01:30:07.887Z · comments (0)
[link] Understanding Benchmarks and motivating Evaluations
markov (markovial) · 2025-02-06T01:32:49.331Z · comments (0)
When you downvote, explain why
KvmanThinking (avery-liu) · 2025-02-07T01:03:44.097Z · comments (5)
[link] Medical Windfall Prizes
PeterMcCluskey · 2025-02-06T23:33:27.263Z · comments (0)
Do No Harm? Navigating and Nudging AI Moral Choices
Sinem (sinem-erisken) · 2025-02-06T19:18:31.065Z · comments (0)
[link] AISN #47: Reasoning Models
Corin Katzke (corin-katzke) · 2025-02-06T18:52:29.843Z · comments (0)
[question] hypnosis question
KvmanThinking (avery-liu) · 2025-02-06T02:41:53.314Z · answers+comments (5)
[link] Biology, Ideology and Violence
Zero Contradictions · 2025-02-06T11:26:02.845Z · comments (1)
next page (older posts) →