LessWrong 2.0 Reader

View: New · Old · Top

Restrict date range: Today · This week · This month · Last three months · This year · All time

← previous page (newer posts) · next page (older posts) →

[link] AI Safety Newsletter #40: California AI Legislation Plus, NVIDIA Delays Chip Production, and Do AI Safety Benchmarks Actually Measure Safety?
Corin Katzke (corin-katzke) · 2024-08-21T18:09:33.284Z · comments (0)
On epistemic autonomy
sanyer (santeri-koivula) · 2024-08-31T18:50:43.377Z · comments (0)
[question] Has Anyone Here Consciously Changed Their Passions?
Spade · 2024-09-09T01:36:26.197Z · answers+comments (12)
[link] Intention-to-Treat (Re: How harmful is music, really?)
kqr · 2024-09-18T18:44:41.128Z · comments (0)
[link] AISafety.info: What are Inductive Biases?
Algon · 2024-09-19T17:26:24.581Z · comments (4)
Switching to a 4GB SD
jefftk (jkaufman) · 2024-09-23T11:20:05.432Z · comments (1)
[link] A primer on ML in antibody engineering
Abhishaike Mahajan (abhishaike-mahajan) · 2024-09-23T17:03:07.628Z · comments (0)
[link] Comparing Forecasting Track Records for AI Benchmarking and Beyond
ChristianWilliams · 2024-09-25T21:01:15.975Z · comments (0)
Self location for LLMs by LLMs: Self-Assessment Checklist.
weightt an (weightt-an) · 2024-09-26T19:57:31.707Z · comments (0)
What is Randomness?
martinkunev · 2024-09-27T17:49:42.704Z · comments (2)
Beyond Defensive Technology
ejk64 · 2024-10-14T11:34:24.595Z · comments (1)
[link] The Computational Complexity of Circuit Discovery for Inner Interpretability
Bogdan Ionut Cirstea (bogdan-ionut-cirstea) · 2024-10-17T13:18:46.378Z · comments (2)
Conversational Signposts—An Antidote to Dull Social Interactions
Declan Molony (declan-molony) · 2024-10-22T05:37:56.175Z · comments (6)
Motte-and-Bailey: a Short Explanation
Lorec · 2024-10-23T22:29:55.074Z · comments (0)
[link] OpenAI’s cybersecurity is probably regulated by NIS Regulations
Adam Jones (domdomegg) · 2024-10-25T11:06:38.392Z · comments (2)
Substituting Talkbox for Breath Controller
jefftk (jkaufman) · 2024-10-27T19:10:03.768Z · comments (0)
Updating the NAO Simulator
jefftk (jkaufman) · 2024-10-30T13:50:06.908Z · comments (0)
Spooky Recommendation System Scaling
phdead · 2024-10-31T22:00:51.728Z · comments (0)
[link] [Linkpost] Building Altruistic and Moral AI Agent with Brain-inspired Affective Empathy Mechanisms
Gunnar_Zarncke · 2024-11-04T10:15:35.550Z · comments (0)
[link] Anthropic - The case for targeted regulation
anaguma · 2024-11-05T07:07:48.174Z · comments (0)
Fundamental Uncertainty: Chapter 9 - How do we live with uncertainty?
Gordon Seidoh Worley (gworley) · 2024-11-07T18:15:45.049Z · comments (2)
Using Dangerous AI, But Safely?
habryka (habryka4) · 2024-11-16T04:29:20.914Z · comments (2)
Switching to a Yamaha P-121 Keyboard
jefftk (jkaufman) · 2024-10-02T02:20:02.284Z · comments (0)
A Policy Proposal
phdead · 2024-09-29T20:45:34.745Z · comments (4)
[question] Where should I look for information on gut health?
FinalFormal2 · 2024-08-20T19:44:30.632Z · answers+comments (10)
Summer Tour Stops
jefftk (jkaufman) · 2024-07-09T19:10:05.659Z · comments (0)
Book Review: Safe Enough? A History of Nuclear Power and Accident Risk
ErickBall · 2024-07-09T01:12:28.730Z · comments (0)
[link] Underrated Proverbs
Arjun Panickssery (arjun-panickssery) · 2024-06-13T12:30:01.676Z · comments (9)
[link] Book Review: Replacing Guilt - On Having Something to Fight For
Cole Killian (cole-killian) · 2024-11-03T19:47:35.093Z · comments (0)
Review of METR’s public evaluation protocol
nahoj · 2024-06-30T22:03:08.945Z · comments (0)
A “Scaling Monosemanticity” Explainer
latterframe · 2024-06-29T17:50:49.855Z · comments (0)
[question] Using hex to get murder advice from GPT-4o
Laurence Freeman (laurence-freeman) · 2024-11-13T18:30:23.475Z · answers+comments (5)
[link] When to join a respectability cascade
B Jacobs (Bob Jacobs) · 2024-09-24T07:54:16.051Z · comments (1)
[question] How Should We Use Limited Time to Maximize Long-Term Impact?
queelius · 2024-10-12T20:02:46.801Z · answers+comments (3)
[question] I want a good multi-LLM API-powered chatbot
rotatingpaguro · 2024-09-08T09:40:52.736Z · answers+comments (3)
[question] LW resources on childhood experiences?
nahir91595 · 2024-10-14T17:04:07.810Z · answers+comments (7)
Pleasure and suffering are not conceptual opposites
MichaelStJules · 2024-08-11T18:32:30.359Z · comments (0)
On agentic generalist models: we're essentially using existing technology the weakest and worst way you can use it
Yuli_Ban · 2024-08-28T01:57:17.387Z · comments (2)
Analysis of key AI analogies
Kevin Kohler (KevinKohler) · 2024-06-29T10:55:21.925Z · comments (2)
Compute Governance Literature Review
sijarvis · 2024-06-25T22:41:46.694Z · comments (0)
Keyboard Gremlins
jefftk (jkaufman) · 2024-09-20T02:30:07.140Z · comments (0)
Request for advice: Research for Conversational Game Theory for LLMs
Rome Viharo (rome-viharo) · 2024-10-16T17:53:30.243Z · comments (0)
Just How Good Are Modern Chess Computers?
nem · 2024-09-19T18:57:21.254Z · comments (1)
Festival Stats 2024
jefftk (jkaufman) · 2024-11-12T02:00:04.831Z · comments (0)
[question] What's a good book for a technically-minded 11-year old?
Martin Sustrik (sustrik) · 2024-10-19T06:05:12.178Z · answers+comments (32)
Crafting Polysemantic Transformer Benchmarks with Known Circuits
Evan Anders (evan-anders) · 2024-08-23T22:03:15.288Z · comments (0)
[question] Has Eliezer publicly and satisfactorily responded to attempted rebuttals of the analogy to evolution?
kaler · 2024-07-28T12:23:40.671Z · answers+comments (13)
[link] How harmful is music, really?
dkl9 · 2024-09-17T14:53:25.426Z · comments (6)
aimless ace analyzes active amateur: a micro-aaaaalignment proposal
lukehmiles (lcmgcd) · 2024-07-21T12:37:39.925Z · comments (0)
[question] Does life actually locally *increase* entropy?
tailcalled · 2024-09-16T20:30:33.148Z · answers+comments (27)
← previous page (newer posts) · next page (older posts) →