LessWrong 2.0 Reader

View: New · Old · Top

next page (older posts) →

Experiment on repeating choices
KatjaGrace · 2024-04-19T04:20:03.992Z · comments (0)
[link] Effective Altruists and Rationalists Views & The case for using marketing to highlight AI risks.
gilch · 2024-04-19T04:16:15.016Z · comments (1)
Cohesion and business problems
Adam Zerner (adamzerner) · 2024-04-19T00:45:00.269Z · comments (1)
The Thermodynamics of Death
Peter lawless · 2024-04-19T00:36:23.762Z · comments (0)
Backyard Office
jefftk (jkaufman) · 2024-04-19T00:31:01.924Z · comments (0)
[link] hydrogen tube transport
bhauth · 2024-04-18T22:47:08.790Z · comments (2)
LessOnline Festival Updates Thread
Ben Pace (Benito) · 2024-04-18T21:55:08.003Z · comments (10)
A Review of In-Context Learning Hypotheses for Automated AI Alignment Research
alamerton · 2024-04-18T18:29:33.892Z · comments (1)
I'm open for projects (sort of)
cousin_it · 2024-04-18T18:05:01.395Z · comments (5)
Blessed information, garbage information, cursed information
tailcalled · 2024-04-18T16:56:17.370Z · comments (2)
[link] [Fiction] A Confession
Arjun Panickssery (arjun-panickssery) · 2024-04-18T16:28:48.194Z · comments (3)
Discriminating Behaviorally Identical Classifiers: a model problem for applying interpretability to scalable oversight
Sam Marks (samuel-marks) · 2024-04-18T16:17:39.136Z · comments (0)
[link] Cooperation is optimal, with weaker agents too  -  tldr
Ryo (Flewrint Ophiuni) · 2024-04-18T15:03:47.245Z · comments (14)
[link] How to coordinate despite our biases? - tldr
Ryo (Flewrint Ophiuni) · 2024-04-18T15:03:18.908Z · comments (2)
Knowledge Base 7: Long-tail knowledge and collective intelligence
iwis · 2024-04-18T14:21:03.293Z · comments (0)
AI #60: Oh the Humanity
Zvi · 2024-04-18T14:10:02.281Z · comments (8)
UDT1.01: Logical Inductors and Implicit Beliefs (5/10)
Diffractor · 2024-04-18T08:39:13.368Z · comments (1)
An examination of GPT-2's boring yet effective glitch
MiguelDev (whitehatStoic) · 2024-04-18T05:26:35.898Z · comments (3)
[question] What if Ethics is Provably Self-Contradictory?
Yitz (yitz) · 2024-04-18T05:12:09.981Z · answers+comments (5)
The Mom Test: Summary and Thoughts
Adam Zerner (adamzerner) · 2024-04-18T03:34:21.020Z · comments (1)
Express interest in an "FHI of the West"
habryka (habryka4) · 2024-04-18T03:32:58.592Z · comments (10)
Why Would Belief-States Have A Fractal Structure, And Why Would That Matter For Interpretability? An Explainer
johnswentworth · 2024-04-18T00:27:43.451Z · comments (13)
AXRP Episode 28 - Suing Labs for AI Risk with Gabriel Weil
DanielFilan · 2024-04-17T21:42:46.992Z · comments (0)
[link] LLM Evaluators Recognize and Favor Their Own Generations
Arjun Panickssery (arjun-panickssery) · 2024-04-17T21:09:12.007Z · comments (1)
SFS: Foundations of Forecasting
MAD2 (mohamed-elmustafa-hammad) · 2024-04-17T17:46:31.172Z · comments (0)
An ethical framework to supersede Utilitarianism
metalcrow · 2024-04-17T17:18:17.493Z · comments (4)
[link] Moving on from community living
Vika · 2024-04-17T17:02:11.357Z · comments (6)
Staged release
Zach Stein-Perlman · 2024-04-17T16:00:19.402Z · comments (4)
[question] Discomfort Stacking
Lewis O’Brien (lewis-o-brien) · 2024-04-17T14:49:25.835Z · answers+comments (11)
[link] FHI (Future of Humanity Institute) has shut down (2005–2024)
gwern · 2024-04-17T13:54:16.791Z · comments (21)
Childhood and Education Roundup #5
Zvi · 2024-04-17T13:00:03.015Z · comments (3)
Should we maximize the Geometric Expectation of Utility?
A.H. (AlfredHarwood) · 2024-04-17T10:37:24.759Z · comments (12)
[link] Claude 3 Opus can operate as a Turing machine
Gunnar_Zarncke · 2024-04-17T08:41:57.209Z · comments (2)
When is a mind me?
Rob Bensinger (RobbBB) · 2024-04-17T05:56:38.482Z · comments (45)
Mid-conditional love
KatjaGrace · 2024-04-17T04:00:08.341Z · comments (13)
Spending Update 2024
jefftk (jkaufman) · 2024-04-17T02:30:02.285Z · comments (0)
Anti MMAcevedo Protocol
Logan Zoellner (logan-zoellner) · 2024-04-16T22:32:28.629Z · comments (1)
Transformers Represent Belief State Geometry in their Residual Stream
Adam Shai (adam-shai) · 2024-04-16T21:16:11.377Z · comments (43)
[link] Tinker
Richard_Ngo (ricraz) · 2024-04-16T18:26:38.679Z · comments (0)
[link] Paul Christiano named as US AI Safety Institute Head of AI Safety
Joel Burget (joel-burget) · 2024-04-16T16:22:06.937Z · comments (37)
Creating unrestricted AI Agents with Command R+
Simon Lermen (dalasnoin) · 2024-04-16T14:52:50.917Z · comments (10)
[link] What should the EA community learn from the FTX / SBF disaster? An in-depth discussion with Will MacAskill on the Clearer Thinking podcast
spencerg · 2024-04-16T13:11:30.562Z · comments (0)
{Book Summary} The Art of Gathering
Tristan Williams (tristan-williams) · 2024-04-16T10:48:41.528Z · comments (0)
[link] Essay competition on the Automation of Wisdom and Philosophy — $25k in prizes
owencb · 2024-04-16T10:10:13.338Z · comments (4)
Announcing SPAR Summer 2024!
laurenmarie12 · 2024-04-16T08:30:31.339Z · comments (1)
[link] The argument for near-term human disempowerment through AI
Chris_Leong · 2024-04-16T04:50:53.828Z · comments (2)
My experience using financial commitments to overcome akrasia
William Howard (william-howard) · 2024-04-15T22:57:32.574Z · comments (16)
A New Response To Newcomb's Paradox
Daniel Birnbaum (daniel-birnbaum) · 2024-04-15T20:38:24.909Z · comments (2)
An evaluation of circuit evaluation metrics
Iván Arcuschin (arcus) · 2024-04-15T19:38:53.457Z · comments (0)
Experiments with an alternative method to promote sparsity in sparse autoencoders
Eoin Farrell · 2024-04-15T18:21:48.771Z · comments (7)
next page (older posts) →