LessWrong 2.0 Reader

View: New · Old · Top

next page (older posts) →

Distillation Of DeepSeek-Prover V1.5
IvanLin (matthewshing) · 2024-10-15T18:53:11.199Z · comments (0)
Improving Model-Written Evals for AI Safety Benchmarking
Sunishchal Dev (sunishchal-dev) · 2024-10-15T18:25:08.179Z · comments (0)
[link] Taking nonlogical concepts seriously
Kris Brown (kris-brown) · 2024-10-15T18:16:01.226Z · comments (0)
Rashomon - A newsbetting site
ideasthete · 2024-10-15T18:15:02.476Z · comments (0)
On the Practical Applications of Interpretability
Nick Jiang (nick-jiang) · 2024-10-15T17:18:25.280Z · comments (0)
[link] Anthropic's updated Responsible Scaling Policy
Zac Hatfield-Dodds (zac-hatfield-dodds) · 2024-10-15T16:46:48.727Z · comments (0)
[question] When is reward ever the optimization target?
Noosphere89 (sharmake-farah) · 2024-10-15T15:09:20.912Z · answers+comments (2)
[link] An Opinionated Evals Reading List
Marius Hobbhahn (marius-hobbhahn) · 2024-10-15T14:38:58.778Z · comments (0)
Anthropic's first RSP update
Zach Stein-Perlman · 2024-10-15T14:25:12.518Z · comments (7)
[Intuitive self-models] 5. Dissociative Identity (Multiple Personality) Disorder
Steven Byrnes (steve2152) · 2024-10-15T13:31:46.157Z · comments (2)
Economics Roundup #4
Zvi · 2024-10-15T13:20:06.923Z · comments (2)
[question] Is School of Thought related to the Rationality Community?
Shoshannah Tekofsky (DarkSym) · 2024-10-15T12:41:33.224Z · answers+comments (6)
Inverse Problems In Everyday Life
silentbob · 2024-10-15T11:42:30.276Z · comments (1)
[link] Thinking LLMs: General Instruction Following with Thought Generation
Bogdan Ionut Cirstea (bogdan-ionut-cirstea) · 2024-10-15T09:21:22.583Z · comments (0)
Thoughts On the Nature of Capability Elicitation via Fine-tuning
Theodore Chapman · 2024-10-15T08:39:19.909Z · comments (0)
Minimal Motivation of Natural Latents
johnswentworth · 2024-10-14T22:51:58.125Z · comments (1)
[link] How long should political (and other) terms be?
ohmurphy · 2024-10-14T21:38:43.050Z · comments (0)
Examples of How I Use LLMs
jefftk (jkaufman) · 2024-10-14T17:10:04.597Z · comments (2)
[link] Mechanistic Exploration of Gemma 2 List Generation
Gerard Boxo (gerard-boxo) · 2024-10-14T17:04:57.010Z · comments (0)
[question] LW resources on childhood experiences?
nahir91595 · 2024-10-14T17:04:07.810Z · answers+comments (7)
Free Will, Neurotypical Dominance, and the Path to ASI and Neuralinks: Evolving Beyond Scarcity
j_passeri · 2024-10-14T16:54:16.661Z · comments (0)
Breakthroughs, Neurodivergence, and Working Outside the System
j_passeri · 2024-10-14T16:54:16.617Z · comments (2)
The case for unlearning that removes information from LLM weights
Fabien Roger (Fabien) · 2024-10-14T14:08:04.775Z · comments (1)
Circuits in Superposition: Compressing many small neural networks into one
jake_mendel · 2024-10-14T13:06:14.596Z · comments (7)
Beyond Defensive Technology
ejk64 · 2024-10-14T11:34:24.595Z · comments (1)
[link] Why Stop AI is barricading OpenAI
Remmelt (remmelt-ellen) · 2024-10-14T07:12:43.049Z · comments (26)
[link] The Explore vs. Exploit Dilemma
nathanjzhao · 2024-10-14T06:20:25.526Z · comments (0)
AI Alignment via Slow Substrates: Early Empirical Results With StarCraft II
Lester Leong (lester-leong) · 2024-10-14T04:05:05.096Z · comments (9)
[link] some questionable space launch guns
bhauth · 2024-10-13T22:52:26.418Z · comments (0)
[question] What are your favorite books or blogs that are out of print, or whose domains have expired (especially if they also aren't on LibGen/Wayback/etc, or on Amazon)?
Arjun Panickssery (arjun-panickssery) · 2024-10-13T20:21:04.540Z · answers+comments (1)
The Hopium Wars: the AGI Entente Delusion
Max Tegmark (MaxTegmark) · 2024-10-13T17:00:29.033Z · comments (44)
Parental Writing Selection Bias
jefftk (jkaufman) · 2024-10-13T14:00:03.225Z · comments (3)
Personal Philosophy
Xor · 2024-10-13T03:01:59.324Z · comments (0)
[link] Contagious Beliefs—Simulating Political Alignment
James Stephen Brown (james-brown) · 2024-10-13T00:27:08.084Z · comments (0)
Binary encoding as a simple explicit construction for superposition
tailcalled · 2024-10-12T21:18:31.731Z · comments (0)
[question] How Should We Use Limited Time to Maximize Long-Term Impact?
queelius · 2024-10-12T20:02:46.801Z · answers+comments (3)
[link] A Percentage Model of a Person
Sable · 2024-10-12T17:55:07.560Z · comments (3)
Linguistic Price Tags: The Cost of Non-English LLM Prompting
Shantanu Darveshi (shantanu-darveshi-1) · 2024-10-12T17:37:04.337Z · comments (0)
AI Compute governance: Verifying AI chip location
Farhan · 2024-10-12T17:36:45.942Z · comments (0)
Geoffrey Hinton on the Past, Present, and Future of AI
Stephen McAleese (stephen-mcaleese) · 2024-10-12T16:41:56.796Z · comments (5)
[question] I = W/T?
HNX · 2024-10-12T15:15:36.806Z · answers+comments (3)
AI research assistants competition 2024Q3: Tie between Elicit and You.com
Elizabeth (pktechgirl) · 2024-10-12T15:10:05.417Z · comments (2)
SAE features for refusal and sycophancy steering vectors
neverix · 2024-10-12T14:54:48.022Z · comments (3)
[link] Prices are Bounties
Maxwell Tabarrok (maxwell-tabarrok) · 2024-10-12T14:51:40.689Z · comments (12)
Differential knowledge interconnection
Roman Leventov · 2024-10-12T12:52:36.267Z · comments (0)
Most arguments for AI Doom are either bad or weak
Logan Zoellner (logan-zoellner) · 2024-10-12T11:57:50.840Z · comments (90)
Kassel ACX/LW Meetup
Fernand0 · 2024-10-12T07:47:59.960Z · comments (0)
Neural Network And Newton's Second Law
Max Ma (max-ma) · 2024-10-12T06:25:59.072Z · comments (0)
[question] If the DoJ goes through with the Google breakup,where does Deepmind end up?
O O (o-o) · 2024-10-12T05:06:50.996Z · answers+comments (1)
My theory of change for working in AI healthtech
Andrew_Critch · 2024-10-12T00:36:30.925Z · comments (30)
next page (older posts) →