LessWrong 2.0 Reader

View: New · Old · Top

Restrict date range: Today · This week · This month · Last three months · This year · All time

next page (older posts) →

Three Months In, Evaluating Three Rationalist Cases for Trump
Arjun Panickssery (arjun-panickssery) · 2025-04-18T08:27:27.257Z · comments (20)
Training AGI in Secret would be Unsafe and Unethical
Daniel Kokotajlo (daniel-kokotajlo) · 2025-04-18T12:27:35.795Z · comments (7)
What Makes an AI Startup "Net Positive" for Safety?
jacquesthibs (jacques-thibodeau) · 2025-04-18T20:33:22.682Z · comments (9)
Handling schemers if shutdown is not an option
Buck · 2025-04-18T14:39:18.609Z · comments (0)
Scaffolding Skills
Screwtape · 2025-04-18T17:39:25.634Z · comments (1)
o3 Will Use Its Tools For You
Zvi · 2025-04-18T21:20:02.566Z · comments (3)
[link] Inside OpenAI's Controversial Plan to Abandon its Nonprofit Roots
garrison · 2025-04-18T18:46:57.310Z · comments (0)
[question] Comprehensive up-to-date resources on the Chinese Communist Party's AI strategy, etc?
Mateusz Bagiński (mateusz-baginski) · 2025-04-18T04:58:32.037Z · answers+comments (3)
British and American Connotations
jefftk (jkaufman) · 2025-04-18T13:00:09.440Z · comments (2)
Emotional Theory for a Technical Manual on How Not to Freeze Completely
P. João (gabriel-brito) · 2025-04-19T09:12:56.615Z · comments (0)
AI Advances and Detection Strategy
jefftk (jkaufman) · 2025-04-19T11:40:07.264Z · comments (0)
[link] Conditional Forecasting as Model Parameterization
Molly (hickman-santini) · 2025-04-18T02:35:42.110Z · comments (0)
[Rockville] Rationalist Shabbat
maia · 2025-04-18T15:38:30.650Z · comments (0)
0 Motivation Mapping through Information Theory
P. João (gabriel-brito) · 2025-04-18T00:53:34.360Z · comments (0)
[link] The System Didn’t, and Doesn’t Need to be This Way ~ Thomas Paine on Economic Justice
James Stephen Brown (james-brown) · 2025-04-19T05:16:05.682Z · comments (0)
Karma Tests in Logical Counterfactual Simulations motivates strong agents to protect weak agents
Knight Lee (Max Lee) · 2025-04-18T11:11:23.239Z · comments (0)
One Night in Delphi
Eggs (donald-sampson) · 2025-04-18T02:17:04.957Z · comments (2)
The Case for White Box Control
J Rosser (j-rosser-uk) · 2025-04-18T16:10:57.823Z · comments (0)
Consequentialists should have a comprehensive set of deontological beliefs they adhere to
Jay95 · 2025-04-18T20:50:27.064Z · comments (2)
Towards Understanding the Representation of Belief State Geometry in Transformers
Karthik Viswanathan (vkarthik095) · 2025-04-18T12:39:01.251Z · comments (0)
[link] SecureDrop review
samuelshadrach (xpostah) · 2025-04-19T04:29:32.270Z · comments (0)
AI Control Methods Literature Review
Ram Potham (ram-potham) · 2025-04-18T21:15:34.682Z · comments (0)
Evaluating Collaborative AI Performance Subject to Sabotage
Matthew Khoriaty (matthew-khoriaty) · 2025-04-18T19:33:41.547Z · comments (0)
Could LLMs Learn to Detect Bias Autonomously, Like Tesla’s Self-Driving Cars?
Omnipheasant · 2025-04-18T18:45:36.242Z · comments (0)
Alignment Does Not Need to Be Opaque! An Introduction to Feature Steering with Reinforcement Learning
Jeremias Ferrao (jeremias-ferrao) · 2025-04-18T19:34:49.357Z · comments (0)
Measuring Beliefs of Language Models During Chain-of-Thought Reasoning
Baram Sosis (baram-sosis) · 2025-04-18T22:56:28.727Z · comments (0)
AI, Alignment & the Art of Relationship Design
Priyanka Bharadwaj (priyanka-bharadwaj) · 2025-04-19T00:47:02.591Z · comments (2)
I’m headed to DC this week. any tips?
Wes R · 2025-04-19T02:33:18.584Z · comments (0)
LLM-based Fact Checking for Popular Posts?
azergante · 2025-04-18T21:26:25.230Z · comments (1)
What If Galaxies Are Alive and Atoms Have Minds? A Thought Experiment on Life Across Scales
Saif Khan (saif-khan) · 2025-04-18T10:01:18.783Z · comments (4)
next page (older posts) →