LessWrong 2.0 Reader

View: New · Old · Top

next page (older posts) →

Why Would Belief-States Have A Fractal Structure, And Why Would That Matter For Interpretability? An Explainer
johnswentworth · 2024-04-18T00:27:43.451Z · comments (0)
AXRP Episode 28 - Tort Law for AI Risk with Gabriel Weil
DanielFilan · 2024-04-17T21:42:46.992Z · comments (0)
[link] LLM Evaluators Recognize and Favor Their Own Generations
Arjun Panickssery (arjun-panickssery) · 2024-04-17T21:09:12.007Z · comments (0)
SFS: Foundations of Forecasting
MAD2 (mohamed-elmustafa-hammad) · 2024-04-17T17:46:31.172Z · comments (0)
An ethical framework to supersede Utilitarianism
metalcrow · 2024-04-17T17:18:17.493Z · comments (4)
[link] Moving on from community living
Vika · 2024-04-17T17:02:11.357Z · comments (6)
Staged release
Zach Stein-Perlman · 2024-04-17T16:00:19.402Z · comments (2)
[question] Discomfort Stacking
Lewis O’Brien (lewis-o-brien) · 2024-04-17T14:49:25.835Z · answers+comments (3)
[link] FHI (Future of Humanity Institute) has shut down (2005–2024)
gwern · 2024-04-17T13:54:16.791Z · comments (11)
Childhood and Education Roundup #5
Zvi · 2024-04-17T13:00:03.015Z · comments (2)
Should we maximize the Geometric Expectation of Utility?
A.H. (AlfredHarwood) · 2024-04-17T10:37:24.759Z · comments (11)
[link] Claude 3 Opus can operate as a Turing machine
Gunnar_Zarncke · 2024-04-17T08:41:57.209Z · comments (2)
When is a mind me?
Rob Bensinger (RobbBB) · 2024-04-17T05:56:38.482Z · comments (22)
Mid-conditional love
KatjaGrace · 2024-04-17T04:00:08.341Z · comments (10)
Spending Update 2024
jefftk (jkaufman) · 2024-04-17T02:30:02.285Z · comments (0)
Anti MMAcevedo Protocol
Logan Zoellner (logan-zoellner) · 2024-04-16T22:32:28.629Z · comments (1)
Transformers Represent Belief State Geometry in their Residual Stream
Adam Shai (adam-shai) · 2024-04-16T21:16:11.377Z · comments (30)
[link] Tinker
Richard_Ngo (ricraz) · 2024-04-16T18:26:38.679Z · comments (0)
[link] Paul Christiano named as US AI Safety Institute Head of AI Safety
Joel Burget (joel-burget) · 2024-04-16T16:22:06.937Z · comments (25)
Creating unrestricted AI Agents with Command R+
Simon Lermen (dalasnoin) · 2024-04-16T14:52:50.917Z · comments (7)
[link] What should the EA community learn from the FTX / SBF disaster? An in-depth discussion with Will MacAskill on the Clearer Thinking podcast
spencerg · 2024-04-16T13:11:30.562Z · comments (0)
{Book Summary} The Art of Gathering
Tristan Williams (tristan-williams) · 2024-04-16T10:48:41.528Z · comments (0)
[link] Essay competition on the Automation of Wisdom and Philosophy — $25k in prizes
owencb · 2024-04-16T10:10:13.338Z · comments (4)
Announcing SPAR Summer 2024!
laurenmarie12 · 2024-04-16T08:30:31.339Z · comments (1)
[link] The argument for near-term human disempowerment through AI
Chris_Leong · 2024-04-16T04:50:53.828Z · comments (2)
My experience using financial commitments to overcome akrasia
William Howard (william-howard) · 2024-04-15T22:57:32.574Z · comments (16)
A New Response To Newcomb's Paradox
Daniel Birnbaum (daniel-birnbaum) · 2024-04-15T20:38:24.909Z · comments (2)
An evaluation of circuit evaluation metrics
Iván Arcuschin (arcus) · 2024-04-15T19:38:53.457Z · comments (0)
Experiments with an alternative method to promote sparsity in sparse autoencoders
Eoin Farrell · 2024-04-15T18:21:48.771Z · comments (3)
Effectively Handling Disagreements - Introducing a New Workshop
Camille Berger (Camille Berger) · 2024-04-15T16:33:50.339Z · comments (1)
Four Local Gigs
jefftk (jkaufman) · 2024-04-15T16:00:02.389Z · comments (0)
Taking into account preferences of past selves
g-w1 · 2024-04-15T13:15:10.545Z · comments (7)
Monthly Roundup #17: April 2024
Zvi · 2024-04-15T12:10:03.126Z · comments (4)
Reconsider the anti-cavity bacteria if you are Asian
Lao Mein (derpherpize) · 2024-04-15T07:02:02.655Z · comments (29)
Anthropic AI made the right call
bhauth · 2024-04-15T00:39:27.078Z · comments (19)
May 2024 Newton meetup???
duck_master · 2024-04-14T22:28:00.161Z · comments (0)
Clipboard Filtering
jefftk (jkaufman) · 2024-04-14T20:50:02.256Z · comments (0)
[link] A High Decoupling Failure
Maxwell Tabarrok (maxwell-tabarrok) · 2024-04-14T19:46:09.552Z · comments (5)
ACX Zwolle meetup
Shaedys · 2024-04-14T13:09:11.376Z · comments (0)
A quick experiment on LMs’ inductive biases in performing search
Alex Mallen (alex-mallen) · 2024-04-14T03:41:08.671Z · comments (2)
UDT1.01 Essential Miscellanea (4/10)
Diffractor · 2024-04-14T02:23:38.755Z · comments (0)
[link] [Cosmology Talks] New Probability Axioms Could Fix Cosmology's Multiverse (Partially) - Sylvia Wenmackers
mako yass (MakoYass) · 2024-04-14T01:26:38.515Z · comments (1)
Speedrun ruiner research idea
lukehmiles (lcmgcd) · 2024-04-13T23:42:29.479Z · comments (11)
Text Posts from the Kids Group: 2020
jefftk (jkaufman) · 2024-04-13T22:30:05.326Z · comments (2)
[question] What convincing warning shot could help prevent extinction from AI?
Charbel-Raphaël (charbel-raphael-segerie) · 2024-04-13T18:09:29.096Z · answers+comments (17)
My experience at ML4Good AI Safety Bootcamp
TheManxLoiner · 2024-04-13T10:55:46.621Z · comments (0)
Consequentialism is a compass, not a judge
Neil (neil-warren) · 2024-04-13T10:47:44.980Z · comments (6)
[link] Carl Sagan, nuking the moon, and not nuking the moon
eukaryote · 2024-04-13T04:08:50.166Z · comments (6)
[question] Barcoding LLM Training Data Subsets. Anyone trying this for interpretability?
right..enough? (howwrongcanitbe) · 2024-04-13T03:09:23.436Z · answers+comments (0)
Prompts for Big-Picture Planning
Raemon · 2024-04-13T03:04:24.523Z · comments (0)
next page (older posts) →