LessWrong 2.0 Reader

View: New · Old · Top

Restrict date range: Today · This week · This month · Last three months · This year · All time

← previous page (newer posts) · next page (older posts) →

[link] How to Fake Decryption
ohmurphy · 2024-09-05T09:18:41.586Z · comments (0)
Can startups be impactful in AI safety?
Esben Kran (esben-kran) · 2024-09-13T19:00:33.306Z · comments (0)
Editing at the Take Level
jefftk (jkaufman) · 2024-09-24T11:30:04.914Z · comments (1)
Source Control for Prototyping and Analysis
jefftk (jkaufman) · 2024-09-26T01:50:04.145Z · comments (0)
Contextual Constitutional AI
aksh-n · 2024-09-28T23:24:43.529Z · comments (1)
Shutting down all competing AI projects might not buy a lot of time due to Internal Time Pressure
ThomasCederborg · 2024-10-03T00:01:34.011Z · comments (7)
Amoeba roles in tech
Sindhu Shivaprasad (sindhu-shivaprasad) · 2024-10-04T17:25:46.568Z · comments (0)
Clarifying Alignment Fundamentals Through the Lens of Ontology
eternal/ephemera · 2024-10-07T20:57:33.238Z · comments (4)
Binary encoding as a simple explicit construction for superposition
tailcalled · 2024-10-12T21:18:31.731Z · comments (0)
LLM Psychometrics and Prompt-Induced Psychopathy
Korbinian K. (korbinian-koch) · 2024-10-18T18:11:24.256Z · comments (2)
How I'd like alignment to get done (as of 2024-10-18)
TristanTrim · 2024-10-18T23:39:03.107Z · comments (2)
[link] AI Prejudices: Practical Implications
PeterMcCluskey · 2024-10-19T02:19:58.695Z · comments (0)
Lenses of Control
WillPetillo · 2024-10-22T07:51:06.355Z · comments (0)
A Case for Conscious Significance rather than Free Will.
James Stephen Brown (james-brown) · 2024-10-25T23:20:30.834Z · comments (2)
A Poem Is All You Need: Jailbreaking ChatGPT, Meta & More
Sharat Jacob Jacob (sharat-jacob-jacob) · 2024-10-29T12:41:30.337Z · comments (0)
Goal: Understand Intelligence
Johannes C. Mayer (johannes-c-mayer) · 2024-11-03T21:20:02.900Z · comments (19)
The current state of RSPs
Zach Stein-Perlman · 2024-11-04T16:00:42.630Z · comments (0)
ML4Good (AI Safety Bootcamp) - Experience report
JanEbbing · 2024-11-05T01:18:43.554Z · comments (0)
GPT-4o Can In Some Cases Solve Moderately Complicated Captchas
dirk (abandon) · 2024-11-09T04:04:37.782Z · comments (2)
AXRP Episode 38.0 - Zhijing Jin on LLMs, Causality, and Multi-Agent Systems
DanielFilan · 2024-11-14T07:00:06.977Z · comments (0)
Evolutionary prompt optimization for SAE feature visualization
neverix · 2024-11-14T13:06:49.728Z · comments (0)
Dance Differentiation
jefftk (jkaufman) · 2024-11-15T02:30:07.694Z · comments (0)
[unlisted] Beneficial applications for current-level AI in human information systems? More likely than you'd think!
mako yass (MakoYass) · 2024-08-16T20:49:57.582Z · comments (0)
Restructuring Pop Songs for Contra
jefftk (jkaufman) · 2024-08-18T14:10:04.029Z · comments (0)
[link] [Linkpost] Building Altruistic and Moral AI Agent with Brain-inspired Affective Empathy Mechanisms
Gunnar_Zarncke · 2024-11-04T10:15:35.550Z · comments (0)
Switching to a 4GB SD
jefftk (jkaufman) · 2024-09-23T11:20:05.432Z · comments (1)
What is Randomness?
martinkunev · 2024-09-27T17:49:42.704Z · comments (2)
[link] AISafety.info: What are Inductive Biases?
Algon · 2024-09-19T17:26:24.581Z · comments (4)
[link] Anthropic - The case for targeted regulation
anaguma · 2024-11-05T07:07:48.174Z · comments (0)
Palisade is hiring: Exec Assistant, Content Lead, Ops Lead, and Policy Lead
Charlie Rogers-Smith (charlie.rs) · 2024-10-09T00:04:03.837Z · comments (0)
[link] The Computational Complexity of Circuit Discovery for Inner Interpretability
Bogdan Ionut Cirstea (bogdan-ionut-cirstea) · 2024-10-17T13:18:46.378Z · comments (2)
Spooky Recommendation System Scaling
phdead · 2024-10-31T22:00:51.728Z · comments (0)
[link] Comparing Forecasting Track Records for AI Benchmarking and Beyond
ChristianWilliams · 2024-09-25T21:01:15.975Z · comments (0)
[link] AI Safety Newsletter #40: California AI Legislation Plus, NVIDIA Delays Chip Production, and Do AI Safety Benchmarks Actually Measure Safety?
Corin Katzke (corin-katzke) · 2024-08-21T18:09:33.284Z · comments (0)
[link] OpenAI’s cybersecurity is probably regulated by NIS Regulations
Adam Jones (domdomegg) · 2024-10-25T11:06:38.392Z · comments (2)
[question] Has Anyone Here Consciously Changed Their Passions?
Spade · 2024-09-09T01:36:26.197Z · answers+comments (12)
Conversational Signposts—An Antidote to Dull Social Interactions
Declan Molony (declan-molony) · 2024-10-22T05:37:56.175Z · comments (6)
Beyond Defensive Technology
ejk64 · 2024-10-14T11:34:24.595Z · comments (1)
On epistemic autonomy
sanyer (santeri-koivula) · 2024-08-31T18:50:43.377Z · comments (0)
Substituting Talkbox for Breath Controller
jefftk (jkaufman) · 2024-10-27T19:10:03.768Z · comments (0)
[link] A Theory of Equilibrium in the Offense-Defense Balance
Maxwell Tabarrok (maxwell-tabarrok) · 2024-11-15T13:51:33.376Z · comments (2)
[link] A primer on ML in antibody engineering
Abhishaike Mahajan (abhishaike-mahajan) · 2024-09-23T17:03:07.628Z · comments (0)
Switching to a Yamaha P-121 Keyboard
jefftk (jkaufman) · 2024-10-02T02:20:02.284Z · comments (0)
Motte-and-Bailey: a Short Explanation
Lorec · 2024-10-23T22:29:55.074Z · comments (0)
Updating the NAO Simulator
jefftk (jkaufman) · 2024-10-30T13:50:06.908Z · comments (0)
[link] Intention-to-Treat (Re: How harmful is music, really?)
kqr · 2024-09-18T18:44:41.128Z · comments (0)
Self location for LLMs by LLMs: Self-Assessment Checklist.
weightt an (weightt-an) · 2024-09-26T19:57:31.707Z · comments (0)
A Policy Proposal
phdead · 2024-09-29T20:45:34.745Z · comments (4)
Request for advice: Research for Conversational Game Theory for LLMs
Rome Viharo (rome-viharo) · 2024-10-16T17:53:30.243Z · comments (0)
[question] I want a good multi-LLM API-powered chatbot
rotatingpaguro · 2024-09-08T09:40:52.736Z · answers+comments (3)
← previous page (newer posts) · next page (older posts) →