LessWrong 2.0 Reader

View: New · Old · Top

Restrict date range: Today · This week · This month · Last three months · This year · All time

← previous page (newer posts) · next page (older posts) →

[link] Reinforcement Learning by AI Punishment
Abhishaike Mahajan (abhishaike-mahajan) · 2025-01-28T00:57:51.715Z · comments (0)
[link] You Have Two Brains
Eneasz · 2025-01-23T00:52:43.063Z · comments (5)
The Upcoming PEPFAR Cut Will Kill Millions, Many of Them Children
omnizoid · 2025-01-27T16:03:51.214Z · comments (2)
[link] You should read Hobbes, Locke, Hume, and Mill via EarlyModernTexts.com
Arjun Panickssery (arjun-panickssery) · 2025-01-30T12:35:03.564Z · comments (0)
[link] When does capability elicitation bound risk?
joshc (joshua-clymer) · 2025-01-22T03:42:36.289Z · comments (0)
[link] Gradual Disempowerment: Systemic Existential Risks from Incremental AI Development
Jan_Kulveit · 2025-01-30T17:03:45.545Z · comments (0)
so you have a chronic health issue
agencypilled · 2025-01-26T19:00:29.972Z · comments (9)
[link] Are we trying to figure out if AI is conscious?
Kristaps Zilgalvis (kristaps-zilgalvis-1) · 2025-01-27T01:05:07.001Z · comments (6)
A hierarchy of disagreement
Adam Zerner (adamzerner) · 2025-01-23T03:17:59.051Z · comments (4)
AI Strategy Updates that You Should Make
Alice Blair (Diatom) · 2025-01-27T21:10:41.838Z · comments (2)
QFT and neural nets: the basic idea
Dmitry Vaintrob (dmitry-vaintrob) · 2025-01-24T13:54:45.099Z · comments (0)
SAE regularization produces more interpretable models
Peter Lai (peter-lai) · 2025-01-28T20:02:56.662Z · comments (2)
Efficiency spectra and “bucket of circuits” cartoons
Dmitry Vaintrob (dmitry-vaintrob) · 2025-01-29T15:06:50.768Z · comments (0)
Monet: Mixture of Monosemantic Experts for Transformers Explained
CalebMaresca (caleb-maresca) · 2025-01-25T19:37:09.078Z · comments (2)
The memorization-generalization spectrum and learning coefficients
Dmitry Vaintrob (dmitry-vaintrob) · 2025-01-28T16:53:24.628Z · comments (0)
[link] Notes on Argentina
Annapurna (jorge-velez) · 2025-01-26T03:51:15.393Z · comments (5)
[question] How useful would alien alignment research be?
Donald Hobson (donald-hobson) · 2025-01-23T10:59:22.330Z · answers+comments (5)
The present perfect tense is ruining your life
PatrickDFarley · 2025-01-27T16:14:48.843Z · comments (7)
ARENA 5.0 - Call for Applicants
JamesH (AtlasOfCharts) · 2025-01-30T13:18:27.052Z · comments (0)
[link] Training Data Attribution (TDA): Examining Its Adoption & Use Cases
Deric Cheng (deric-cheng) · 2025-01-22T15:40:13.393Z · comments (0)
November-December 2024 Progress in Guaranteed Safe AI
Quinn (quinn-dougherty) · 2025-01-22T01:20:00.868Z · comments (0)
How different LLMs answered PhilPapers 2020 survey
Satron · 2025-01-27T21:41:12.334Z · comments (1)
[question] Do you consider perfect surveillance inevitable?
samuelshadrach (xpostah) · 2025-01-24T04:57:48.266Z · answers+comments (25)
[link] Is there such a thing as an impossible protein?
Abhishaike Mahajan (abhishaike-mahajan) · 2025-01-24T17:12:01.174Z · comments (3)
[link] Lazy Hasselback Pommes Anna
Brendan Long (korin43) · 2025-01-26T21:30:36.587Z · comments (18)
The Functionalist Case for Machine Consciousness: Evidence from Large Language Models
James Diacoumis (james-diacoumis) · 2025-01-22T17:43:41.215Z · comments (22)
Nvidia doesn’t just sell shovels
winstonBosan · 2025-01-28T04:56:38.720Z · comments (4)
[question] Those of you with lots of meditation experience: How did it influence your understanding of philosophy of mind and topics such as qualia?
SpectrumDT · 2025-01-28T14:29:47.034Z · answers+comments (14)
Why I'm Pouring Cold Water in My Left Ear, and You Should Too
Maloew (maloew-valenar) · 2025-01-24T23:13:52.340Z · comments (0)
[question] Should you publish solutions to corrigibility?
rvnnt · 2025-01-30T11:52:05.983Z · answers+comments (8)
[link] Anatomy of a Dance Class: A step by step guide
Nathan Young · 2025-01-26T18:02:04.974Z · comments (0)
My Mental Model of AI Optimist Opinions
tailcalled · 2025-01-29T18:44:36.485Z · comments (2)
What does success look like?
Raymond D · 2025-01-23T17:48:35.618Z · comments (0)
Contra Dances Getting Shorter and Earlier
jefftk (jkaufman) · 2025-01-23T23:30:03.595Z · comments (0)
[link] Uncontrollable: A Surprisingly Good Introduction to AI Risk
PeterMcCluskey · 2025-01-24T04:30:37.499Z · comments (0)
Learn to Develop Your Advantage
ReverendBayes (vedernikov-andrei) · 2025-01-29T22:06:00.641Z · comments (0)
[question] Recommendations for Recent Posts/Sequences on Instrumental Rationality?
Benjamin Hendricks (benjamin-hendricks) · 2025-01-26T00:41:08.577Z · answers+comments (3)
AXRP Episode 38.6 - Joel Lehman on Positive Visions of AI
DanielFilan · 2025-01-24T23:00:07.562Z · comments (0)
The Human Alignment Problem for AIs
rife (edgar-muniz) · 2025-01-22T04:06:10.872Z · comments (5)
The Quantum Mars Teleporter: An Empirical Test Of Personal Identity Theories
avturchin · 2025-01-22T11:48:46.071Z · comments (18)
[link] Training Data Attribution: Examining Its Adoption & Use Cases
Deric Cheng (deric-cheng) · 2025-01-22T15:41:19.744Z · comments (0)
[link] What are the differences between AGI, transformative AI, and superintelligence?
Vishakha (vishakha-agrawal) · 2025-01-23T10:03:31.886Z · comments (3)
Detecting out of distribution text with surprisal and entropy
Sandy Fraser (alex-fraser) · 2025-01-28T18:46:46.977Z · comments (3)
Revealing alignment faking with a single prompt
Florian_Dietz · 2025-01-29T21:01:15.000Z · comments (4)
Liron Shapira vs Ken Stanley on Doom Debates. A review
TheManxLoiner · 2025-01-24T18:01:56.646Z · comments (0)
Reconceptualizing the Nothingness and Existence
Htarlov (htarlov) · 2025-01-28T20:29:44.390Z · comments (1)
Recursive Self-Modeling as a Plausible Mechanism for Real-time Introspection in Current Language Models
rife (edgar-muniz) · 2025-01-22T18:36:45.226Z · comments (5)
[link] Links and short notes, 2025-01-26: Atlas Shrugged and the irreplaceable founder, pumping stations and civic pride, and thoughts on the eve of AGI
jasoncrawford · 2025-01-26T20:52:51.416Z · comments (1)
[link] AISN #46: The Transition
Corin Katzke (corin-katzke) · 2025-01-23T18:09:36.858Z · comments (0)
← previous page (newer posts) · next page (older posts) →