LessWrong 2.0 Reader

View: New · Old · Top

next page (older posts) →

Learning the smooth prior
Geoffrey Irving · 2022-04-29T21:10:18.064Z · comments (0)
[question] Do FDT (or similar) recommend reparations?
David Scott Krueger (formerly: capybaralet) (capybaralet) · 2022-04-29T17:34:48.479Z · answers+comments (3)
Saying no to the Appleman
Johannes C. Mayer (johannes-c-mayer) · 2022-04-29T10:39:48.693Z · comments (12)
Prize for Alignment Research Tasks
stuhlmueller · 2022-04-29T08:57:04.290Z · comments (38)
Increasing Demandingness in EA
jefftk (jkaufman) · 2022-04-29T01:20:01.507Z · comments (22)
[question] What is a training "step" vs. "episode" in machine learning?
Evan R. Murphy · 2022-04-28T21:53:24.785Z · answers+comments (4)
Facts Matter
mrdlm (mridul.mohan.m@gmail.com) · 2022-04-28T21:19:38.599Z · comments (2)
[question] Is alignment possible?
Shay · 2022-04-28T21:18:25.891Z · answers+comments (5)
Two Prosocial Rejection Norms
Emrik (Emrik North) · 2022-04-28T20:53:15.850Z · comments (21)
Dath Ilan vs. Sid Meier's Alpha Centauri: Pareto Improvements
David Udell · 2022-04-28T19:26:26.664Z · comments (16)
[link] A Parable Of Explainability
George3d6 · 2022-04-28T16:46:24.280Z · comments (5)
[link] Keep your protos in one repo
RobertM (T3t) · 2022-04-28T15:53:26.803Z · comments (4)
Covid 4/28/22: Take My Paxlovid, Please
Zvi · 2022-04-28T15:20:01.378Z · comments (14)
3-bit filters
iivonen · 2022-04-28T11:55:46.403Z · comments (0)
[link] Jaan Tallinn's 2021 Philanthropy Overview
jaan · 2022-04-28T09:55:50.789Z · comments (2)
Doom sooner
Flaglandbase · 2022-04-28T07:24:10.276Z · comments (0)
How Might an Alignment Attractor Look like?
Shmi (shminux) · 2022-04-28T06:46:11.139Z · comments (15)
Virtue signaling is sometimes the best or the only metric we have
Holly_Elmore · 2022-04-28T04:52:53.884Z · comments (43)
The Gospel of Martin Luther
lsusr · 2022-04-28T04:29:58.601Z · comments (2)
Letter to my Squire
lsusr · 2022-04-28T04:16:38.905Z · comments (0)
Slides: Potential Risks From Advanced AI
Aryeh Englander (alenglander) · 2022-04-28T02:15:20.040Z · comments (0)
Naive comments on AGIlignment
Ericf · 2022-04-28T01:08:02.507Z · comments (4)
AI Alternative Futures: Scenario Mapping Artificial Intelligence Risk - Request for Participation (*Closed*)
Kakili (Greenboat88) · 2022-04-27T22:07:57.906Z · comments (2)
The Speed + Simplicity Prior is probably anti-deceptive
[deleted] · 2022-04-27T19:30:20.173Z · comments (28)
If you’re very optimistic about ELK then you should be optimistic about outer alignment
Sam Marks (samuel-marks) · 2022-04-27T19:30:11.785Z · comments (8)
[link] The Game of Masks
Slimepriestess (Hivewired) · 2022-04-27T18:03:12.423Z · comments (18)
Law-Following AI 3: Lawless AI Agents Undermine Stabilizing Agreements
Cullen (Cullen_OKeefe) · 2022-04-27T17:30:25.915Z · comments (2)
Law-Following AI 2: Intent Alignment + Superintelligence → Lawless AI (By Default)
Cullen (Cullen_OKeefe) · 2022-04-27T17:27:24.210Z · comments (2)
Law-Following AI 1: Sequence Introduction and Structure
Cullen (Cullen_OKeefe) · 2022-04-27T17:26:57.004Z · comments (10)
[Intro to brain-like-AGI safety] 13. Symbol grounding & human social instincts
Steven Byrnes (steve2152) · 2022-04-27T13:30:33.773Z · comments (15)
The case for turning glowfic into Sequences
Thomas Kwa (thomas-kwa) · 2022-04-27T06:58:57.395Z · comments (29)
[Link] Evidence of Fabricated Data in a Vitamin C trial by Paul E Marik et al in CHEST
Kenny · 2022-04-27T06:48:06.597Z · comments (1)
SERI ML Alignment Theory Scholars Program 2022
Ryan Kidd (ryankidd44) · 2022-04-27T00:43:38.221Z · comments (6)
EU Maximizing in a Gloomy World
David Udell · 2022-04-27T00:28:58.494Z · comments (2)
Why Copilot Accelerates Timelines
Michaël Trazzi (mtrazzi) · 2022-04-26T22:06:19.507Z · comments (14)
[link] Universals of Morality: Toward Human-Centric Communication Platforms
scafaria · 2022-04-26T21:15:50.520Z · comments (3)
[$20K in Prizes] AI Safety Arguments Competition
Dan H (dan-hendrycks) · 2022-04-26T16:13:16.351Z · comments (518)
[link] Continental Philosophy as Undergraduate Mathematics
Jan (jan-2) · 2022-04-26T08:05:17.433Z · comments (3)
dalle2 comments
nostalgebraist · 2022-04-26T05:30:07.748Z · comments (14)
[link] Make a neural network in ~10 minutes
Arjun Yadav · 2022-04-26T05:24:57.507Z · comments (0)
Framings of Deceptive Alignment
peterbarnett · 2022-04-26T04:25:56.115Z · comments (7)
[link] Why pessimism sounds smart
jasoncrawford · 2022-04-25T20:10:31.344Z · comments (15)
[question] What is being improved in recursive self improvement?
Lone Pine (conor-sullivan) · 2022-04-25T18:30:47.848Z · answers+comments (6)
21 on 21
Amir Bolous (amir-gamil) · 2022-04-25T18:22:23.110Z · comments (5)
[question] Rationalist Inspired Coming-of-age Rituals
iceplant · 2022-04-25T17:22:35.789Z · answers+comments (3)
[Request for Distillation] Coherence of Distributed Decisions With Different Inputs Implies Conditioning
johnswentworth · 2022-04-25T17:01:08.767Z · comments (14)
[question] Quadratic voting with automatic collusion?
SarahSrinivasan (GuySrinivasan) · 2022-04-25T16:15:49.117Z · answers+comments (5)
Intuitions about solving hard problems
Richard_Ngo (ricraz) · 2022-04-25T15:29:04.253Z · comments (23)
Ukraine Post #11: Longer Term Predictions
Zvi · 2022-04-25T14:10:01.119Z · comments (6)
Key questions about artificial sentience: an opinionated guide
Robbo · 2022-04-25T12:09:39.322Z · comments (31)
next page (older posts) →