LessWrong 2.0 Reader

View: New · Old · Top

next page (older posts) →

[link] AI Safety: A Climb To Armageddon?
kmenou · 2024-06-01T06:02:14.968Z · comments (3)
[question] What do coherence arguments actually prove about agentic behavior?
sunwillrise (andrei-alexandru-parfeni) · 2024-06-01T09:37:28.451Z · answers+comments (35)
Links for May
Kaj_Sotala · 2024-06-01T10:20:02.005Z · comments (16)
How do you know you are right when debating? Calculate your AmIRight score.
MrThink (ViktorThink) · 2024-06-01T15:55:10.722Z · comments (5)
[question] Turning latexed notes into blog posts
notfnofn · 2024-06-01T18:03:18.039Z · answers+comments (2)
Scanning your Brain with 100,000,000,000 wires?
Johannes C. Mayer (johannes-c-mayer) · 2024-06-01T18:37:25.548Z · comments (6)
Simulations and Altruism
FateGrinder (nicolo-moretti) · 2024-06-02T02:45:49.783Z · comments (2)
How it All Went Down: The Puzzle Hunt that took us way, way Less Online
A* (agendra) · 2024-06-02T08:01:40.109Z · comments (5)
Why write down the basics of logic if they are so evident?
Crazy philosopher (commissar Yarrick) · 2024-06-02T12:02:44.722Z · comments (9)
[link] Origins of the Lab Mouse
Niko_McCarty (niko-2) · 2024-06-02T15:40:32.932Z · comments (0)
[question] List of arguments for Bayesianism
Aryeh Englander (alenglander) · 2024-06-02T19:06:47.474Z · answers+comments (3)
How to Better Report Sparse Autoencoder Performance
J Bostock (Jemist) · 2024-06-02T19:34:22.803Z · comments (4)
[question] How do you shut down an escaped model?
quetzal_rainbow · 2024-06-02T19:51:58.880Z · answers+comments (8)
Politics is the mind-killer, but maybe we should talk about it anyway
Chris_Leong · 2024-06-03T06:37:57.037Z · comments (33)
Comments on Anthropic's Scaling Monosemanticity
Robert_AIZI · 2024-06-03T12:15:44.708Z · comments (8)
ACX Meetup
svfritz · 2024-06-03T13:02:49.935Z · comments (0)
Companies' safety plans neglect risks from scheming AI
Zach Stein-Perlman · 2024-06-03T15:00:20.236Z · comments (4)
AI catastrophes and rogue deployments
Buck · 2024-06-03T17:04:51.206Z · comments (16)
[question] How was Less Online for you?
Gordon Seidoh Worley (gworley) · 2024-06-03T17:10:33.766Z · answers+comments (4)
The Standard Analogy
Zack_M_Davis · 2024-06-03T17:15:42.327Z · comments (28)
Searching Magic Cards
jefftk (jkaufman) · 2024-06-03T17:40:02.207Z · comments (2)
Finding the estimate of the value of a state in RL agents
Clément Dumas (butanium) · 2024-06-03T20:26:59.385Z · comments (4)
[link] in defense of Linus Pauling
bhauth · 2024-06-03T21:27:43.962Z · comments (8)
ACI#8: Value as a Function of Possible Worlds
Akira Pyinya · 2024-06-03T21:49:02.345Z · comments (2)
Philosophers wrestling with evil, as a social media feed
David Gross (David_Gross) · 2024-06-03T22:25:22.507Z · comments (2)
[link] Masculinity—A Case For Courage
James Stephen Brown (james-brown) · 2024-06-04T00:04:48.411Z · comments (0)
(Not) Derailing the LessOnline Puzzle Hunt
Error · 2024-06-04T01:28:31.688Z · comments (2)
Just admit that you’ve zoned out
joec · 2024-06-04T02:51:27.594Z · comments (22)
Smartphone Etiquette: Suggestions for Social Interactions
Declan Molony (declan-molony) · 2024-06-04T06:01:03.336Z · comments (4)
Is Wittgenstein's Language Game used when helping Ai understand language?
[deleted] · 2024-06-04T07:41:53.725Z · comments (6)
[question] Has anyone here written about religious fictionalism?
SpectrumDT · 2024-06-04T12:10:21.987Z · answers+comments (4)
Circuit Board Ordering
jefftk (jkaufman) · 2024-06-04T14:00:02.084Z · comments (0)
[link] [Paper] Stress-testing capability elicitation with password-locked models
Fabien Roger (Fabien) · 2024-06-04T14:52:50.204Z · comments (10)
Is This Lie Detector Really Just a Lie Detector? An Investigation of LLM Probe Specificity.
Josh Levy (josh-levy) · 2024-06-04T15:45:54.399Z · comments (0)
Evidence of Learned Look-Ahead in a Chess-Playing Neural Network
Erik Jenner (ejenner) · 2024-06-04T15:50:47.475Z · comments (14)
Ideas for Next-Generation Writing Platforms, using LLMs
ozziegooen · 2024-06-04T18:40:24.636Z · comments (4)
Here's Why Indefinite Life Extension Will Never Work, Even Though it Does.
HomingHamster (hominghamster) · 2024-06-04T18:48:07.781Z · comments (5)
A Semiotic Critique of the Orthogonality Thesis
Nicolas Villarreal (nicolas-villarreal) · 2024-06-04T18:52:58.642Z · comments (10)
[link] A Reflection on Richard Hamming's "You and Your Research": Striving for Greatness
aysajan · 2024-06-04T20:07:01.422Z · comments (5)
Takeoff speeds presentation at Anthropic
Tom Davidson (tom-davidson-1) · 2024-06-04T22:46:35.448Z · comments (0)
On “first critical tries” in AI alignment
Joe Carlsmith (joekc) · 2024-06-05T00:19:02.814Z · comments (8)
[link] Former OpenAI Superalignment Researcher: Superintelligence by 2030
Julian Bradshaw · 2024-06-05T03:35:19.251Z · comments (30)
Second-Order Rationality, System Rationality, and a feature suggestion for LessWrong
Mati_Roy (MathieuRoy) · 2024-06-05T07:20:10.178Z · comments (2)
Announcing ILIAD — Theoretical AI Alignment Conference
Nora_Ammann · 2024-06-05T09:37:39.546Z · comments (18)
What and how much makes a difference?
Marius Adrian Nicoară · 2024-06-05T10:30:53.493Z · comments (0)
Aggregative Principles of Social Justice
Cleo Nardo (strawberry calm) · 2024-06-05T13:44:47.499Z · comments (10)
[link] Startup Stock Options: the Shortest Complete Guide for Employees
Boris T (Euphetar) · 2024-06-05T15:03:49.777Z · comments (2)
graphpatch: a Python Library for Activation Patching
Occam's Laser (evan-lloyd) · 2024-06-05T15:08:47.416Z · comments (2)
Nonreactivity: a simple model of meditation
cesiumquail · 2024-06-05T16:26:51.167Z · comments (4)
next page (older posts) →