LessWrong 2.0 Reader

View: New · Old · Top

Restrict date range: Today · This week · This month · Last three months · This year · All time

← previous page (newer posts) · next page (older posts) →

A bet for Samo Burja
Nathan Helm-Burger (nathan-helm-burger) · 2024-09-05T16:01:35.440Z · comments (2)
[link] Diffusion Guided NLP: better steering, mostly a good thing
Nathan Helm-Burger (nathan-helm-burger) · 2024-08-10T19:49:50.963Z · comments (0)
What can we learn from insecure domains?
Logan Zoellner (logan-zoellner) · 2024-11-01T23:53:30.066Z · comments (1)
[question] Looking for intuitions to extend bargaining notions
ProgramCrafter (programcrafter) · 2024-08-24T05:00:13.995Z · answers+comments (0)
[link] Metaculus Is Open Source
ChristianWilliams · 2024-10-07T19:55:31.035Z · comments (0)
Chevy Bolt Review
jefftk (jkaufman) · 2024-09-26T13:40:05.456Z · comments (2)
A Case for Conscious Significance rather than Free Will.
James Stephen Brown (james-brown) · 2024-10-25T23:20:30.834Z · comments (2)
Editing at the Take Level
jefftk (jkaufman) · 2024-09-24T11:30:04.914Z · comments (1)
Shutting down all competing AI projects might not buy a lot of time due to Internal Time Pressure
ThomasCederborg · 2024-10-03T00:01:34.011Z · comments (7)
SAE Probing: What is it good for? Absolutely something!
Subhash Kantamneni (subhashk) · 2024-11-01T19:23:55.418Z · comments (0)
Apartment Price Map Discontinuity
jefftk (jkaufman) · 2024-08-19T15:30:05.386Z · comments (0)
[question] If AI is in a bubble and the bubble bursts, what would you do?
Remmelt (remmelt-ellen) · 2024-08-19T10:56:03.948Z · answers+comments (12)
LLM Psychometrics and Prompt-Induced Psychopathy
Korbinian K. (korbinian-koch) · 2024-10-18T18:11:24.256Z · comments (2)
Critique of 'Many People Fear A.I. They Shouldn't' by David Brooks.
Axel Ahlqvist (axelahlqvist1995@gmail.com) · 2024-08-15T18:38:13.437Z · comments (8)
Amoeba roles in tech
Sindhu Shivaprasad (sindhu-shivaprasad) · 2024-10-04T17:25:46.568Z · comments (0)
Live Machinery: Interface Design Workshop for AI Safety @ EA Hotel
Sahil · 2024-11-01T17:24:09.957Z · comments (0)
Source Control for Prototyping and Analysis
jefftk (jkaufman) · 2024-09-26T01:50:04.145Z · comments (0)
[link] How to Fake Decryption
ohmurphy · 2024-09-05T09:18:41.586Z · comments (0)
Do you want to do a debate on youtube? I'm looking for polite, truth-seeking participants.
Nathan Young · 2024-10-10T09:32:59.162Z · comments (0)
Lenses of Control
WillPetillo · 2024-10-22T07:51:06.355Z · comments (0)
Contextual Constitutional AI
aksh-n · 2024-09-28T23:24:43.529Z · comments (1)
Binary encoding as a simple explicit construction for superposition
tailcalled · 2024-10-12T21:18:31.731Z · comments (0)
[link] AI & wisdom 3: AI effects on amortised optimisation
L Rudolf L (LRudL) · 2024-10-28T21:08:56.604Z · comments (0)
[link] AI & wisdom 2: growth and amortised optimisation
L Rudolf L (LRudL) · 2024-10-28T21:07:39.449Z · comments (0)
[link] AI Prejudices: Practical Implications
PeterMcCluskey · 2024-10-19T02:19:58.695Z · comments (0)
How I'd like alignment to get done (as of 2024-10-18)
TristanTrim · 2024-10-18T23:39:03.107Z · comments (2)
[Cross-post] Book Review: Bureaucracy, by James Q Wilson
davekasten · 2024-08-19T13:57:10.872Z · comments (0)
SYSTEMA ROBOTICA
Ali Ahmed (roboticali) · 2024-08-12T20:34:45.879Z · comments (2)
Clarifying Alignment Fundamentals Through the Lens of Ontology
eternal/ephemera · 2024-10-07T20:57:33.238Z · comments (4)
[link] GPT-4o Guardrails Gone: Data Poisoning & Jailbreak-Tuning
ChengCheng (ccstan99) · 2024-11-01T00:10:50.718Z · comments (0)
Beyond Defensive Technology
ejk64 · 2024-10-14T11:34:24.595Z · comments (1)
[link] Intention-to-Treat (Re: How harmful is music, really?)
kqr · 2024-09-18T18:44:41.128Z · comments (0)
Conversational Signposts—An Antidote to Dull Social Interactions
Declan Molony (declan-molony) · 2024-10-22T05:37:56.175Z · comments (6)
What You Can Give Instead of Advice
Karl Faulks (karl-faulks) · 2024-10-24T23:10:48.014Z · comments (2)
Restructuring Pop Songs for Contra
jefftk (jkaufman) · 2024-08-18T14:10:04.029Z · comments (0)
[unlisted] Beneficial applications for current-level AI in human information systems? More likely than you'd think!
mako yass (MakoYass) · 2024-08-16T20:49:57.582Z · comments (0)
Motte-and-Bailey: a Short Explanation
Lorec · 2024-10-23T22:29:55.074Z · comments (0)
[link] AI Safety Newsletter #40: California AI Legislation Plus, NVIDIA Delays Chip Production, and Do AI Safety Benchmarks Actually Measure Safety?
Corin Katzke (corin-katzke) · 2024-08-21T18:09:33.284Z · comments (0)
Substituting Talkbox for Breath Controller
jefftk (jkaufman) · 2024-10-27T19:10:03.768Z · comments (0)
Tall tales and long odds
Solenoid_Entity · 2024-08-10T15:22:16.958Z · comments (0)
Switching to a Yamaha P-121 Keyboard
jefftk (jkaufman) · 2024-10-02T02:20:02.284Z · comments (0)
Palisade is hiring: Exec Assistant, Content Lead, Ops Lead, and Policy Lead
Charlie Rogers-Smith (charlie.rs) · 2024-10-09T00:04:03.837Z · comments (0)
We Don't Just Let People Die—So What Next?
James Stephen Brown (james-brown) · 2024-08-03T01:04:49.756Z · comments (8)
Four Phases of AGI
Gabe M (gabe-mukobi) · 2024-08-05T13:15:23.406Z · comments (3)
[link] Mechanistic Anomaly Detection Research Update
Nora Belrose (nora-belrose) · 2024-08-06T10:33:26.031Z · comments (0)
Updating the NAO Simulator
jefftk (jkaufman) · 2024-10-30T13:50:06.908Z · comments (0)
What is Randomness?
martinkunev · 2024-09-27T17:49:42.704Z · comments (2)
Can startups be impactful in AI safety?
Esben Kran (esben-kran) · 2024-09-13T19:00:33.306Z · comments (0)
[link] AISafety.info: What are Inductive Biases?
Algon · 2024-09-19T17:26:24.581Z · comments (4)
Self location for LLMs by LLMs: Self-Assessment Checklist.
weightt an (weightt-an) · 2024-09-26T19:57:31.707Z · comments (0)
← previous page (newer posts) · next page (older posts) →