LessWrong 2.0 Reader

View: New · Old · Top

Restrict date range: Today · This week · This month · Last three months · This year · All time

← previous page (newer posts) · next page (older posts) →

Safety Implications of LeCun's path to machine intelligence
Ivan Vendrov (ivan-vendrov) · 2022-07-15T21:47:44.411Z · comments (18)
The Maker of MIND
Tomás B. (Bjartur Tómas) · 2021-11-20T16:28:56.327Z · comments (19)
Omicron Post #5
Zvi · 2021-12-09T21:10:00.469Z · comments (18)
Trauma, Meditation, and a Cool Scar
Logan Riggs (elriggs) · 2019-08-06T16:17:39.912Z · comments (17)
[link] A Chess-GPT Linear Emergent World Representation
Adam Karvonen (karvonenadam) · 2024-02-08T04:25:15.222Z · comments (14)
[link] Poker is a bad game for teaching epistemics. Figgie is a better one.
rossry · 2024-07-08T06:05:20.459Z · comments (47)
[link] Explaining grokking through circuit efficiency
Vikrant Varma (amrav) · 2023-09-08T14:39:23.910Z · comments (11)
Oversight Misses 100% of Thoughts The AI Does Not Think
johnswentworth · 2022-08-12T16:30:24.060Z · comments (50)
Contra Yudkowsky on Doom from Foom #2
jacob_cannell · 2023-04-27T00:07:20.360Z · comments (76)
The LessWrong 2018 Review
Raemon · 2019-11-21T02:50:58.262Z · comments (91)
Coase's "Nature of the Firm" on Polyamory
1a3orn · 2021-08-13T13:15:47.709Z · comments (34)
Open Thread With Experimental Feature: Reactions
jimrandomh · 2023-05-24T16:46:39.367Z · comments (189)
On Dwarksh’s Podcast with Leopold Aschenbrenner
Zvi · 2024-06-10T12:40:03.348Z · comments (7)
Parasitic Language Games: maintaining ambiguity to hide conflict while burning the commons
Hazard · 2023-03-12T05:25:26.496Z · comments (16)
AI #4: Introducing GPT-4
Zvi · 2023-03-21T14:00:01.161Z · comments (32)
Finding gliders in the game of life
paulfchristiano · 2022-12-01T20:40:04.230Z · comments (7)
LLM Applications I Want To See
sarahconstantin · 2024-08-19T21:10:03.101Z · comments (5)
Sam Altman's sister, Annie Altman, claims Sam has severely abused her
prometheus5015 (pl5015) · 2023-10-07T21:06:49.396Z · comments (107)
[link] Book review: WEIRDest People
PeterMcCluskey · 2020-11-30T03:33:17.510Z · comments (57)
[link] The Big Picture Of Alignment (Talk Part 1)
johnswentworth · 2022-02-21T05:49:34.962Z · comments (35)
Two (very different) kinds of donors
Duncan Sabien (Deactivated) (Duncan_Sabien) · 2021-12-22T01:43:52.498Z · comments (19)
Me, Myself, and AI: the Situational Awareness Dataset (SAD) for LLMs
L Rudolf L (LRudL) · 2024-07-08T22:24:38.441Z · comments (28)
Maze-solving agents: Add a top-right vector, make the agent go to the top-right
TurnTrout · 2023-03-31T19:20:48.658Z · comments (17)
[link] Some Lessons Learned from Studying Indirect Object Identification in GPT-2 small
KevinRoWang · 2022-10-28T23:55:44.755Z · comments (9)
LessWrong readers are invited to apply to the Lurkshop
Jonas V (Jonas Vollmer) · 2022-11-22T09:19:05.412Z · comments (41)
"Win First" vs "Chill First"
lionhearted (Sebastian Marshall) (lionhearted) · 2020-09-28T06:48:21.511Z · comments (20)
Kelly Bet on Everything
Jacob Falkovich (Jacobian) · 2020-07-10T02:48:12.868Z · comments (20)
10 reasons why lists of 10 reasons might be a winning strategy
trevor (TrevorWiesinger) · 2023-04-06T21:24:17.896Z · comments (7)
Covid 3/12: New CDC Guidelines Available
Zvi · 2021-03-12T17:20:01.392Z · comments (28)
Search versus design
Alex Flint (alexflint) · 2020-08-16T16:53:18.923Z · comments (40)
Covid 2/11: As Expected
Zvi · 2021-02-11T18:30:01.438Z · comments (79)
Behavioral red-teaming is unlikely to produce clear, strong evidence that models aren't scheming
Buck · 2024-10-10T13:36:53.810Z · comments (3)
Don't accelerate problems you're trying to solve
Andrea_Miotti (AndreaM) · 2023-02-15T18:11:30.595Z · comments (27)
[link] Knowledge Neurons in Pretrained Transformers
evhub · 2021-05-17T22:54:50.494Z · comments (7)
Bad at Arithmetic, Promising at Math
cohenmacaulay · 2022-12-18T05:40:37.088Z · comments (19)
A Case for the Least Forgiving Take On Alignment
Thane Ruthenis · 2023-05-02T21:34:49.832Z · comments (84)
A simple model of math skill
Alex_Altair · 2024-07-21T18:57:33.697Z · comments (16)
[link] Discovering Language Model Behaviors with Model-Written Evaluations
evhub · 2022-12-20T20:08:12.063Z · comments (34)
Productive Mistakes, Not Perfect Answers
adamShimi · 2022-04-07T16:41:50.290Z · comments (11)
Instead of technical research, more people should focus on buying time
Akash (akash-wasil) · 2022-11-05T20:43:45.215Z · comments (45)
AI #8: People Can Do Reasonable Things
Zvi · 2023-04-20T15:50:00.826Z · comments (16)
The Power to Demolish Bad Arguments
Liron · 2019-09-02T12:57:23.341Z · comments (83)
Concrete Reasons for Hope about AI
Zac Hatfield-Dodds (zac-hatfield-dodds) · 2023-01-14T01:22:18.723Z · comments (13)
Reflections on the cryonics sequence
mingyuan · 2021-02-03T01:17:13.556Z · comments (11)
Chaos Induces Abstractions
johnswentworth · 2021-03-18T20:08:21.739Z · comments (13)
Covid 11/12: The Winds of Winter
Zvi · 2020-11-12T14:30:01.387Z · comments (64)
wrapper-minds are the enemy
nostalgebraist · 2022-06-17T01:58:04.919Z · comments (41)
[link] LessOnline (May 31—June 2, Berkeley, CA)
Ben Pace (Benito) · 2024-03-26T02:34:00.000Z · comments (24)
I'm still mystified by the Born rule
So8res · 2021-03-04T02:35:32.301Z · comments (44)
AI Alignment Research Engineer Accelerator (ARENA): call for applicants
CallumMcDougall (TheMcDouglas) · 2023-04-17T20:30:03.965Z · comments (9)
← previous page (newer posts) · next page (older posts) →