LessWrong 2.0 Reader

View: New · Old · Top

Restrict date range: Today · This week · This month · Last three months · This year · All time

← previous page (newer posts) · next page (older posts) →

[link] The Best Bits From Build, Baby, Build
Maxwell Tabarrok (maxwell-tabarrok) · 2024-07-11T14:09:10.131Z · comments (0)
Inducing human-like biases in moral reasoning LMs
Artyom Karpov (artkpv) · 2024-02-20T16:28:11.424Z · comments (3)
AXRP Episode 34 - AI Evaluations with Beth Barnes
DanielFilan · 2024-07-28T03:30:07.192Z · comments (0)
The Garden of Eden
Alexander Turok · 2024-07-22T16:07:42.509Z · comments (2)
2024 Unofficial LW Community Census, Request for Comments
Screwtape · 2024-11-01T16:34:14.758Z · comments (32)
[link] Letter from an Alien Mind
Shoshannah Tekofsky (DarkSym) · 2024-12-27T13:20:49.277Z · comments (7)
Don’t Legalize Drugs
Declan Molony (declan-molony) · 2025-01-14T06:51:14.005Z · comments (7)
[link] NAO Updates, January 2025
jefftk (jkaufman) · 2025-01-10T03:37:36.698Z · comments (0)
Complete Feedback
abramdemski · 2024-11-01T16:58:50.183Z · comments (7)
Evolution's selection target depends on your weighting
tailcalled · 2024-11-19T18:24:53.117Z · comments (22)
[link] Human-AI Complementarity: A Goal for Amplified Oversight
rishubjain · 2024-12-24T09:57:55.111Z · comments (3)
The current state of RSPs
Zach Stein-Perlman · 2024-11-04T16:00:42.630Z · comments (2)
Improving Our Safety Cases Using Upper and Lower Bounds
Yonatan Cale (yonatan-cale-1) · 2025-01-16T00:01:49.043Z · comments (0)
From the outside, American schooling is weird
Jacob G-W (g-w1) · 2024-03-28T22:45:30.485Z · comments (4)
[link] Public computers can make addictive tools safe
dkl9 · 2024-12-11T19:55:22.818Z · comments (0)
[link] A Defense of Peer Review
Niko_McCarty (niko-2) · 2024-10-22T16:16:49.982Z · comments (1)
[link] [EA xpost] The Rationale-Shaped Hole At The Heart Of Forecasting
dschwarz · 2024-04-02T17:40:44.278Z · comments (2)
Less Anti-Dakka
Mateusz Bagiński (mateusz-baginski) · 2024-05-31T09:07:10.450Z · comments (5)
[link] Foundations - Why Britain has stagnated [crosspost]
Nathan Young · 2024-09-23T10:43:20.411Z · comments (1)
Would you benefit from, or object to, a page with LW users' reacts?
Raemon · 2024-08-20T16:35:47.568Z · comments (6)
Launching Adjacent News
Lucas Kohorst (lucas-kohorst) · 2024-10-16T17:58:10.289Z · comments (0)
[link] Increasing IQ by 10 Points is Possible
George3d6 · 2024-03-19T20:48:41.277Z · comments (51)
Rashomon - A newsbetting site
ideasthete · 2024-10-15T18:15:02.476Z · comments (8)
Apply to the Cooperative AI PhD Fellowship by October 14th!
Lewis Hammond (lewis-hammond-1) · 2024-10-05T12:41:24.093Z · comments (0)
[question] Money Pump Arguments assume Memoryless Agents. Isn't this Unrealistic?
Dalcy (Darcy) · 2024-08-16T04:16:23.159Z · answers+comments (6)
Disentangling Competence and Intelligence
Robert Kralisch (nonmali-1) · 2024-04-29T00:12:50.779Z · comments (7)
[link] The unreasonable effectiveness of plasmid sequencing as a service
Abhishaike Mahajan (abhishaike-mahajan) · 2024-10-08T02:02:55.352Z · comments (2)
Deception and Jailbreak Sequence: 1. Iterative Refinement Stages of Deception in LLMs
Winnie Yang (winnie-yang) · 2024-08-22T07:32:07.600Z · comments (1)
[link] The Offense-Defense Balance of Gene Drives
Maxwell Tabarrok (maxwell-tabarrok) · 2024-09-27T16:47:25.976Z · comments (1)
[link] [Talk transcript] What “structure” is and why it matters
Alex_Altair · 2024-07-25T15:49:00.844Z · comments (0)
[link] Should I Finish My Bachelor's Degree?
Zack_M_Davis · 2024-05-11T05:17:40.067Z · comments (13)
LessWrong audio: help us choose the new voice
PeterH · 2024-12-11T02:24:37.026Z · comments (0)
[link] Being Present is Not a Skill
Chipmonk · 2024-12-18T01:11:04.715Z · comments (8)
The Second Gemini
Zvi · 2024-12-17T15:50:06.373Z · comments (0)
Partitioned Book Club
jenn (pixx) · 2024-05-12T18:38:53.315Z · comments (6)
Gizmo Watch Review
jefftk (jkaufman) · 2024-06-18T20:00:02.247Z · comments (3)
New paper on aligning AI with human values
ryan.lowe · 2024-03-30T23:39:20.288Z · comments (3)
Offering service as a sensayer for simulationist-adjacent beliefs.
mako yass (MakoYass) · 2024-05-22T18:52:05.576Z · comments (0)
[link] social lemon markets
bhauth · 2024-04-25T02:18:04.480Z · comments (6)
Geoffrey Hinton on the Past, Present, and Future of AI
Stephen McAleese (stephen-mcaleese) · 2024-10-12T16:41:56.796Z · comments (5)
[link] Miles Brundage: Finding Ways to Credibly Signal the Benignness of AI Development and Deployment is an Urgent Priority
Zach Stein-Perlman · 2024-10-28T17:00:18.660Z · comments (4)
3a. Towards Formal Corrigibility
Max Harms (max-harms) · 2024-06-09T16:53:45.386Z · comments (2)
AI Safety Evaluations: A Regulatory Review
Elliot Mckernon (elliot) · 2024-03-19T15:05:23.769Z · comments (1)
[link] How to choose what to work on
jasoncrawford · 2024-09-18T20:39:12.316Z · comments (6)
Interpretability: Integrated Gradients is a decent attribution method
Lucius Bushnaq (Lblack) · 2024-05-20T17:55:22.893Z · comments (7)
[question] How was Less Online for you?
Gordon Seidoh Worley (gworley) · 2024-06-03T17:10:33.766Z · answers+comments (4)
Why Isn't Tesla Level 3?
jefftk (jkaufman) · 2024-12-11T14:50:01.159Z · comments (7)
"The Singularity Is Nearer" by Ray Kurzweil - Review
Lavender (Kevin92) · 2024-07-08T21:32:27.307Z · comments (0)
Why I'm bearish on mechanistic interpretability: the shards are not in the network
tailcalled · 2024-09-13T17:09:25.407Z · comments (40)
[link] Day Zero Antivirals for Future Pandemics
Niko_McCarty (niko-2) · 2024-08-26T15:18:33.858Z · comments (2)
← previous page (newer posts) · next page (older posts) →