LessWrong 2.0 Reader

View: New · Old · Top

Restrict date range: Today · This week · This month · Last three months · This year · All time

next page (older posts) →

[link] How AI Takeover Might Happen in 2 Years
joshc (joshua-clymer) · 2025-02-07T17:10:10.530Z · comments (2)
So You Want To Make Marginal Progress...
johnswentworth · 2025-02-07T23:22:19.825Z · comments (12)
Racing Towards Fusion and AI
Jeffrey Heninger (jeffrey-heninger) · 2025-02-07T20:40:56.798Z · comments (6)
Illusory Safety: Redteaming DeepSeek R1 and the Strongest Fine-Tunable Models of OpenAI, Anthropic, and Google
ChengCheng (ccstan99) · 2025-02-07T03:57:30.904Z · comments (0)
On the Meta and DeepMind Safety Frameworks
Zvi · 2025-02-07T13:10:08.449Z · comments (1)
A Problem to Solve Before Building a Deception Detector
Eleni Angelou (ea-1) · 2025-02-07T19:35:23.307Z · comments (0)
[link] Research directions Open Phil wants to fund in technical AI safety
jake_mendel · 2025-02-08T01:40:00.968Z · comments (0)
Reasons-based choice and cluelessness
JesseClifton · 2025-02-07T22:21:47.232Z · comments (0)
'High-Level Machine Intelligence' and 'Full Automation of Labor' in the AI Impacts Surveys
Jeffrey Heninger (jeffrey-heninger) · 2025-02-07T20:40:52.388Z · comments (0)
[link] Request for Information for a new US AI Action Plan (OSTP RFI)
agucova · 2025-02-07T20:40:36.034Z · comments (0)
[Translation] In the Age of AI don't Look for Unicorns
mushroomsoup · 2025-02-07T21:06:24.198Z · comments (0)
When you downvote, explain why
KvmanThinking (avery-liu) · 2025-02-07T01:03:44.097Z · comments (21)
[link] Request for proposals: improving capability evaluations
cb · 2025-02-07T18:51:34.926Z · comments (0)
Introducing SyDFAIS: A Systemic Design Framework for AI Safety Field-Building
Moneer Moukaddem (moneer-moukaddem) · 2025-02-07T18:51:24.067Z · comments (0)
the devil's ontology
lostinwilliamsburg · 2025-02-07T14:18:52.516Z · comments (3)
next page (older posts) →