LessWrong 2.0 Reader

View: New · Old · Top

Restrict date range: Today · This week · This month · Last three months · This year · All time

← previous page (newer posts) · next page (older posts) →

The Quantum Mars Teleporter: An Empirical Test Of Personal Identity Theories
avturchin · 2025-01-22T11:48:46.071Z · comments (18)
Outlaw Code
scarcegreengrass · 2025-01-30T23:41:57.239Z · comments (1)
[link] Training Data Attribution: Examining Its Adoption & Use Cases
Deric Cheng (deric-cheng) · 2025-01-22T15:41:19.744Z · comments (0)
[link] Gradual Disempowerment: Simplified
Annapurna (jorge-velez) · 2025-02-22T16:59:39.072Z · comments (1)
[link] Metaculus Q4 AI Benchmarking: Bots Are Closing The Gap
Molly (hickman-santini) · 2025-02-19T22:42:39.055Z · comments (0)
[link] Teaching AI to reason: this year's most important story
Benjamin_Todd · 2025-02-13T17:40:02.869Z · comments (0)
Dovetail's agent foundations fellowship talks & discussion
Alex_Altair · 2025-02-13T00:49:48.854Z · comments (0)
DeepSeek-R1 for Beginners
Anton Razzhigaev (anton-razzhigaev) · 2025-02-05T18:58:16.282Z · comments (0)
The Human Alignment Problem for AIs
rife (edgar-muniz) · 2025-01-22T04:06:10.872Z · comments (5)
[question] A Simulation of Automation economics?
qbolec · 2025-02-10T08:11:04.424Z · answers+comments (1)
[link] What are the differences between AGI, transformative AI, and superintelligence?
Vishakha (vishakha-agrawal) · 2025-01-23T10:03:31.886Z · comments (3)
[link] Predation as Payment for Criticism
Benquo · 2025-01-30T01:06:27.591Z · comments (6)
AXRP Episode 38.6 - Joel Lehman on Positive Visions of AI
DanielFilan · 2025-01-24T23:00:07.562Z · comments (0)
[link] OpenAI lied about SFT vs. RLHF
sanxiyn · 2025-02-10T03:24:16.625Z · comments (2)
[link] DeepSeek Made it Even Harder for US AI Companies to Ever Reach Profitability
garrison · 2025-02-19T21:02:42.879Z · comments (1)
[link] what an efficient market feels from inside
DMMF · 2025-02-25T02:38:40.129Z · comments (0)
Transformer Dynamics: a neuro-inspired approach to MechInterp
guitchounts · 2025-02-22T21:33:23.855Z · comments (0)
Call for Applications: XLab Summer Research Fellowship
JoNeedsSleep (joanna-j-1) · 2025-02-18T19:19:20.155Z · comments (0)
Revealing alignment faking with a single prompt
Florian_Dietz · 2025-01-29T21:01:15.000Z · comments (5)
BIDA Calendar iCal Feed
jefftk (jkaufman) · 2025-02-06T01:30:07.887Z · comments (0)
SWE Automation Is Coming: Consider Selling Your Crypto
A_donor · 2025-02-13T20:17:59.227Z · comments (8)
Some Theses on Motivational and Directional Feedback
abstractapplic · 2025-02-02T22:50:04.270Z · comments (3)
[link] Introduction to Expected Value Fanaticism
Petra Kosonen · 2025-02-14T19:05:26.556Z · comments (8)
What We Can Do to Prevent Extinction by AI
Joe Rogero · 2025-02-24T17:15:07.109Z · comments (0)
Machine Unlearning in Large Language Models: A Comprehensive Survey with Empirical Insights from the Qwen 1.5 1.8B Model
Saketh Baddam (saketh-baddam) · 2025-02-01T21:26:58.171Z · comments (2)
Liron Shapira vs Ken Stanley on Doom Debates. A review
TheManxLoiner · 2025-01-24T18:01:56.646Z · comments (0)
Recursive Self-Modeling as a Plausible Mechanism for Real-time Introspection in Current Language Models
rife (edgar-muniz) · 2025-01-22T18:36:45.226Z · comments (5)
If Neuroscientists Succeed
Mordechai Rorvig (mordechai-rorvig) · 2025-02-11T15:33:09.098Z · comments (6)
[link] AISN #46: The Transition
Corin Katzke (corin-katzke) · 2025-01-23T18:09:36.858Z · comments (0)
[link] Links and short notes, 2025-01-26: Atlas Shrugged and the irreplaceable founder, pumping stations and civic pride, and thoughts on the eve of AGI
jasoncrawford · 2025-01-26T20:52:51.416Z · comments (1)
[link] Understanding Benchmarks and motivating Evaluations
markov (markovial) · 2025-02-06T01:32:49.331Z · comments (0)
How *exactly* can AI take your job in the next few years?
Ansh Juneja (ansh-juneja) · 2025-01-30T02:33:13.475Z · comments (0)
A fable on AI x-risk
bgaesop · 2025-02-18T20:15:24.933Z · comments (4)
Starting an Egan High School
Chris Wintergreen · 2025-01-26T19:02:17.658Z · comments (2)
Talking to laymen about AI development
David Steel · 2025-02-17T18:42:23.289Z · comments (0)
'High-Level Machine Intelligence' and 'Full Automation of Labor' in the AI Impacts Surveys
Jeffrey Heninger (jeffrey-heninger) · 2025-02-07T20:40:52.388Z · comments (0)
Reconceptualizing the Nothingness and Existence
Htarlov (htarlov) · 2025-01-28T20:29:44.390Z · comments (1)
[link] Are SAE features from the Base Model still meaningful to LLaVA?
Shan23Chen (shan-chen) · 2025-02-18T22:16:14.449Z · comments (2)
The Structure of Professional Revolutions
SebastianG (JohnBuridan) · 2025-02-09T13:23:01.059Z · comments (0)
[link] Progress links and short notes, 2025-02-17
jasoncrawford · 2025-02-17T19:18:29.422Z · comments (0)
Sleeping Beauty: an Accuracy-based Approach
glauberdebona · 2025-02-10T15:40:29.619Z · comments (2)
THE ARCHIVE
Jason Reid (jason-reid) · 2025-02-17T01:12:41.486Z · comments (0)
[link] Cooperation for AI safety must transcend geopolitical interference
Matrice Jacobine · 2025-02-16T18:18:01.539Z · comments (6)
[question] Supposing that the "Dead Internet Theory" is true or largely true, how can we act on that information?
SpectrumDT · 2025-01-27T16:47:01.338Z · answers+comments (5)
Exploring how OthelloGPT computes its world model
JMaar (jim-maar) · 2025-02-02T21:29:09.433Z · comments (0)
What working on AI safety taught me about B2B SaaS sales
purple fire (jack-edwards) · 2025-02-04T20:50:19.990Z · comments (12)
Conditional Importance in Toy Models of Superposition
james__p · 2025-02-02T20:35:38.655Z · comments (2)
Post-hoc reasoning in chain of thought
Kyle Cox (klye) · 2025-02-05T18:58:29.802Z · comments (0)
Make Superintelligence Loving
Davey Morse (davey-morse) · 2025-02-21T06:07:17.235Z · comments (9)
Goals don't necesserily start to crystallize the moment AI is capable enough to fake alignment
Mikhail Samin (mikhail-samin) · 2025-02-08T23:44:46.081Z · comments (0)
← previous page (newer posts) · next page (older posts) →