LessWrong 2.0 Reader

View: New · Old · Top

next page (older posts) →

Gradient hacking: definitions and examples
Richard_Ngo (ricraz) · 2022-06-29T21:35:37.166Z · comments (2)
[link] Progress links and tweets, 2022-06-29
jasoncrawford · 2022-06-29T21:33:40.285Z · comments (0)
[question] Correcting human error vs doing exactly what you're told - is there literature on this in context of general system design?
Jan Czechowski (przemyslaw-czechowski) · 2022-06-29T21:30:05.753Z · answers+comments (0)
Latent Adversarial Training
Adam Jermyn (adam-jermyn) · 2022-06-29T20:04:00.249Z · comments (13)
Game Review: This Merchant Life
Zvi · 2022-06-29T18:30:00.816Z · comments (0)
Limits to Legibility
Jan_Kulveit · 2022-06-29T17:42:19.338Z · comments (11)
Will Capabilities Generalise More?
Ramana Kumar (ramana-kumar) · 2022-06-29T17:12:56.255Z · comments (39)
Kevin Kelly's "103 Bits of Advice," Expanded
Dalton Mabery (dalton-mabery) · 2022-06-29T13:36:13.160Z · comments (0)
The table of different sampling assumptions in anthropics
avturchin · 2022-06-29T10:41:18.872Z · comments (5)
Can We Align AI by Having It Learn Human Preferences? I’m Scared (summary of last third of Human Compatible)
apollonianblues · 2022-06-29T04:09:06.213Z · comments (3)
[link] Kurzgesagt – The Last Human (Youtube)
habryka (habryka4) · 2022-06-29T03:28:44.213Z · comments (7)
[question] Literature on How to Maximize Preferences
josh (soren-d) · 2022-06-28T22:41:38.152Z · answers+comments (0)
Challenge: A Much More Alien Message
kman · 2022-06-28T21:50:59.877Z · comments (7)
It’s Probably Not Lithium
Natália (Natália Mendonça) · 2022-06-28T21:24:10.246Z · comments (187)
Reflections on Living in "Guess Culture"
Dalton Mabery (dalton-mabery) · 2022-06-28T21:00:39.680Z · comments (1)
[question] What is the LessWrong Logo(?) Supposed to Represent?
DragonGod · 2022-06-28T20:20:52.321Z · answers+comments (6)
What Are You Tracking In Your Head?
johnswentworth · 2022-06-28T19:30:06.164Z · comments (83)
[link] Why is so much political commentary misleading?
contrarianbrit · 2022-06-28T17:10:58.743Z · comments (5)
CFAR Handbook: Introduction
CFAR!Duncan (CFAR 2017) · 2022-06-28T16:53:53.312Z · comments (12)
Units of Exchange
CFAR!Duncan (CFAR 2017) · 2022-06-28T16:53:53.069Z · comments (28)
[link] Scott Aaronson and Steven Pinker Debate AI Scaling
Liron · 2022-06-28T16:04:58.515Z · comments (7)
A physicist's approach to Origins of Life
pchvykov · 2022-06-28T15:23:23.310Z · comments (6)
[link] What success looks like
Marius Hobbhahn (marius-hobbhahn) · 2022-06-28T14:38:42.758Z · comments (4)
Four reasons I find AI safety emotionally compelling
KatWoods (ea247) · 2022-06-28T14:10:35.216Z · comments (3)
Some alternative AI safety research projects
Michele Campolo · 2022-06-28T14:09:27.661Z · comments (0)
Doom doubts - is inner alignment a likely problem?
Crissman · 2022-06-28T12:42:16.197Z · comments (7)
Low-Friction MBTA Predictions
jefftk (jkaufman) · 2022-06-28T12:30:01.714Z · comments (0)
What Diet Books Don't Teach: A book review and a request for more reading
Lone Pine (conor-sullivan) · 2022-06-28T12:27:04.847Z · comments (34)
Assessing AlephAlphas Multimodal Model
p.b. · 2022-06-28T09:28:10.921Z · comments (5)
[question] Is there any way someone could post about public policy relating to abortion access (or another sensitive subject) on LessWrong without getting super downvoted?
Evan_Gaensbauer · 2022-06-28T05:45:17.831Z · answers+comments (20)
[Test Post Please Ignore] Testing polling features
Lone Pine (conor-sullivan) · 2022-06-28T04:35:09.467Z · comments (5)
Yann LeCun, A Path Towards Autonomous Machine Intelligence [link]
Bill Benzon (bill-benzon) · 2022-06-27T23:29:55.384Z · comments (1)
Limits of Bodily Autonomy
jefftk (jkaufman) · 2022-06-27T19:50:01.813Z · comments (18)
[question] Systems Biology for self study
Ulisse Mini (ulisse-mini) · 2022-06-27T19:36:32.707Z · answers+comments (2)
[link] [Yann Lecun] A Path Towards Autonomous Machine Intelligence
DragonGod · 2022-06-27T19:24:50.543Z · comments (13)
Exploring Mild Behaviour in Embedded Agents
Megan Kinniment (megan-kinniment) · 2022-06-27T18:56:34.794Z · comments (4)
Epistemic modesty and how I think about AI risk
Aryeh Englander (alenglander) · 2022-06-27T18:47:35.827Z · comments (4)
Deliberation Everywhere: Simple Examples
Oliver Sourbut · 2022-06-27T17:26:20.848Z · comments (3)
Deliberation, Reactions, and Control: Tentative Definitions and a Restatement of Instrumental Convergence
Oliver Sourbut · 2022-06-27T17:25:45.986Z · comments (0)
[question] Are long-form dating profiles productive?
AABoyles · 2022-06-27T17:03:35.266Z · answers+comments (32)
[link] Custom iPhone Widget to Encourage Less Wrong Use
Will Payne (will-payne) · 2022-06-27T16:14:50.141Z · comments (2)
Announcing the Inverse Scaling Prize ($250k Prize Pool)
Ethan Perez (ethan-perez) · 2022-06-27T15:58:19.135Z · comments (14)
[link] Announcing Epoch: A research organization investigating the road to Transformative AI
Jsevillamol · 2022-06-27T13:55:51.451Z · comments (2)
Air Conditioner Repair
Zvi · 2022-06-27T12:40:01.514Z · comments (34)
[question] Why Are Posts in the Sequences Tagged [Personal Blog] Instead of [Frontpage]?
DragonGod · 2022-06-27T09:35:26.778Z · answers+comments (2)
Contest: An Alien Message
DaemonicSigil · 2022-06-27T05:54:54.144Z · comments (100)
[link] Robin Hanson asks "Why Not Wait On AI Risk?"
Gunnar_Zarncke · 2022-06-26T23:32:19.436Z · comments (4)
Sex Fairy Lore
pchvykov · 2022-06-26T20:42:38.636Z · comments (10)
[link] King David's %: Establishing a new symbol for Bayesian probability.
Paul Logan (paul-logan) · 2022-06-26T19:47:57.047Z · comments (1)
Training Trace Priors and Speed Priors
Adam Jermyn (adam-jermyn) · 2022-06-26T18:07:08.746Z · comments (0)
next page (older posts) →