LessWrong 2.0 Reader

View: New · Old · Top

Restrict date range: Today · This week · This month · Last three months · This year · All time

← previous page (newer posts) · next page (older posts) →

Self location for LLMs by LLMs: Self-Assessment Checklist.
weightt an (weightt-an) · 2024-09-26T19:57:31.707Z · comments (0)
[link] A primer on ML in antibody engineering
Abhishaike Mahajan (abhishaike-mahajan) · 2024-09-23T17:03:07.628Z · comments (0)
Updating the NAO Simulator
jefftk (jkaufman) · 2024-10-30T13:50:06.908Z · comments (0)
[link] Intention-to-Treat (Re: How harmful is music, really?)
kqr · 2024-09-18T18:44:41.128Z · comments (0)
Switching to a 4GB SD
jefftk (jkaufman) · 2024-09-23T11:20:05.432Z · comments (1)
Conversational Signposts—An Antidote to Dull Social Interactions
Declan Molony (declan-molony) · 2024-10-22T05:37:56.175Z · comments (6)
Spooky Recommendation System Scaling
phdead · 2024-10-31T22:00:51.728Z · comments (0)
Sample Prevalence vs Global Prevalence
jefftk (jkaufman) · 2024-07-08T21:00:03.809Z · comments (0)
Motte-and-Bailey: a Short Explanation
Lorec · 2024-10-23T22:29:55.074Z · comments (0)
[link] The Computational Complexity of Circuit Discovery for Inner Interpretability
Bogdan Ionut Cirstea (bogdan-ionut-cirstea) · 2024-10-17T13:18:46.378Z · comments (2)
Beyond Defensive Technology
ejk64 · 2024-10-14T11:34:24.595Z · comments (1)
Tall tales and long odds
Solenoid_Entity · 2024-08-10T15:22:16.958Z · comments (0)
Using Dangerous AI, But Safely?
habryka (habryka4) · 2024-11-16T04:29:20.914Z · comments (2)
[link] Mechanistic Anomaly Detection Research Update
Nora Belrose (nora-belrose) · 2024-08-06T10:33:26.031Z · comments (0)
Palisade is hiring: Exec Assistant, Content Lead, Ops Lead, and Policy Lead
Charlie Rogers-Smith (charlie.rs) · 2024-10-09T00:04:03.837Z · comments (0)
Controlled Creative Destruction
Martin Sustrik (sustrik) · 2024-07-08T04:36:52.274Z · comments (0)
[question] Has Anyone Here Consciously Changed Their Passions?
Spade · 2024-09-09T01:36:26.197Z · answers+comments (12)
Switching to a Yamaha P-121 Keyboard
jefftk (jkaufman) · 2024-10-02T02:20:02.284Z · comments (0)
[question] Pondering how good or bad things will be in the AGI future
Sherrinford · 2024-07-09T22:46:31.874Z · answers+comments (9)
[link] Comparing Forecasting Track Records for AI Benchmarking and Beyond
ChristianWilliams · 2024-09-25T21:01:15.975Z · comments (0)
Organisation for Program Equilibrium reading group
Smaug123 · 2024-07-25T19:11:02.332Z · comments (14)
On passing Complete and Honest Ideological Turing Tests (CHITTs)
Aryeh Englander (alenglander) · 2024-07-10T04:01:33.567Z · comments (2)
[link] AI Safety Newsletter #40: California AI Legislation Plus, NVIDIA Delays Chip Production, and Do AI Safety Benchmarks Actually Measure Safety?
Corin Katzke (corin-katzke) · 2024-08-21T18:09:33.284Z · comments (0)
Which evals resources would be good?
Marius Hobbhahn (marius-hobbhahn) · 2024-11-16T14:24:48.012Z · comments (0)
Substituting Talkbox for Breath Controller
jefftk (jkaufman) · 2024-10-27T19:10:03.768Z · comments (0)
[link] AISafety.info: What are Inductive Biases?
Algon · 2024-09-19T17:26:24.581Z · comments (4)
We Don't Just Let People Die—So What Next?
James Stephen Brown (james-brown) · 2024-08-03T01:04:49.756Z · comments (8)
Restructuring Pop Songs for Contra
jefftk (jkaufman) · 2024-08-18T14:10:04.029Z · comments (0)
[link] OpenAI’s cybersecurity is probably regulated by NIS Regulations
Adam Jones (domdomegg) · 2024-10-25T11:06:38.392Z · comments (2)
On epistemic autonomy
sanyer (santeri-koivula) · 2024-08-31T18:50:43.377Z · comments (0)
[link] [Linkpost] Building Altruistic and Moral AI Agent with Brain-inspired Affective Empathy Mechanisms
Gunnar_Zarncke · 2024-11-04T10:15:35.550Z · comments (0)
A Policy Proposal
phdead · 2024-09-29T20:45:34.745Z · comments (4)
Krona Compare
jefftk (jkaufman) · 2024-07-20T01:10:03.994Z · comments (0)
Request for advice: Research for Conversational Game Theory for LLMs
Rome Viharo (rome-viharo) · 2024-10-16T17:53:30.243Z · comments (0)
Apply now: Get "unstuck" with the New IFS Self-Care Fellowship Program
Inga G. (inga-g) · 2024-07-16T08:18:11.436Z · comments (3)
[question] What's a good book for a technically-minded 11-year old?
Martin Sustrik (sustrik) · 2024-10-19T06:05:12.178Z · answers+comments (32)
Analysis of key AI analogies
Kevin Kohler (KevinKohler) · 2024-06-29T10:55:21.925Z · comments (2)
A “Scaling Monosemanticity” Explainer
latterframe · 2024-06-29T17:50:49.855Z · comments (0)
[question] Using hex to get murder advice from GPT-4o
Laurence Freeman (laurence-freeman) · 2024-11-13T18:30:23.475Z · answers+comments (5)
Festival Stats 2024
jefftk (jkaufman) · 2024-11-12T02:00:04.831Z · comments (0)
Review of METR’s public evaluation protocol
nahoj · 2024-06-30T22:03:08.945Z · comments (0)
Crafting Polysemantic Transformer Benchmarks with Known Circuits
Evan Anders (evan-anders) · 2024-08-23T22:03:15.288Z · comments (0)
[link] Book Review: Replacing Guilt - On Having Something to Fight For
Cole Killian (cole-killian) · 2024-11-03T19:47:35.093Z · comments (0)
On agentic generalist models: we're essentially using existing technology the weakest and worst way you can use it
Yuli_Ban · 2024-08-28T01:57:17.387Z · comments (2)
[question] Where should I look for information on gut health?
FinalFormal2 · 2024-08-20T19:44:30.632Z · answers+comments (10)
Book Review: Safe Enough? A History of Nuclear Power and Accident Risk
ErickBall · 2024-07-09T01:12:28.730Z · comments (0)
[question] I want a good multi-LLM API-powered chatbot
rotatingpaguro · 2024-09-08T09:40:52.736Z · answers+comments (3)
Summer Tour Stops
jefftk (jkaufman) · 2024-07-09T19:10:05.659Z · comments (0)
Pleasure and suffering are not conceptual opposites
MichaelStJules · 2024-08-11T18:32:30.359Z · comments (0)
[question] Does life actually locally *increase* entropy?
tailcalled · 2024-09-16T20:30:33.148Z · answers+comments (27)
← previous page (newer posts) · next page (older posts) →