LessWrong 2.0 Reader

View: New · Old · Top

Restrict date range: Today · This week · This month · Last three months · This year · All time

← previous page (newer posts) · next page (older posts) →

The Incredible Fentanyl-Detecting Machine
sarahconstantin · 2024-06-28T22:10:01.223Z · comments (26)

Ironing Out the Squiggles
Zack_M_Davis · 2024-04-29T16:13:00.371Z · comments (36)

OpenAI: Exodus
Zvi · 2024-05-20T13:10:03.543Z · comments (26)

Apologizing is a Core Rationalist Skill
johnswentworth · 2024-01-02T17:47:35.950Z · comments (42)

[question] things that confuse me about the current AI market.
DMMF · 2024-08-28T13:46:56.908Z · answers+comments (28)

Tips for Empirical Alignment Research
Ethan Perez (ethan-perez) · 2024-02-29T06:04:54.481Z · comments (4)

My takes on SB-1047
leogao · 2024-09-09T18:38:37.799Z · comments (8)

Meta Questions about Metaphilosophy
Wei Dai (Wei_Dai) · 2023-09-01T01:17:57.578Z · comments (78)

2023 Survey Results
Screwtape · 2024-02-16T22:24:28.132Z · comments (26)

[link] Using axis lines for good or evil
dynomight · 2024-03-06T14:47:10.989Z · comments (39)

[link] Daniel Dennett has died (1942-2024)
kave · 2024-04-19T16:17:04.742Z · comments (5)

[link] Will no one rid me of this turbulent pest?
Metacelsus · 2023-10-14T15:27:21.497Z · comments (23)

On Devin
Zvi · 2024-03-18T13:20:04.779Z · comments (34)

LoRA Fine-tuning Efficiently Undoes Safety Training from Llama 2-Chat 70B
Simon Lermen (dalasnoin) · 2023-10-12T19:58:02.119Z · comments (29)

[link] Vernor Vinge, who coined the term "Technological Singularity", dies at 79
Kaj_Sotala · 2024-03-21T22:14:14.699Z · comments (24)

Discussion: Challenges with Unsupervised LLM Knowledge Discovery
Seb Farquhar · 2023-12-18T11:58:39.379Z · comments (21)

The U.S. is becoming less stable
lc · 2023-08-18T21:13:11.909Z · comments (67)

Priors and Prejudice
MathiasKB (MathiasKirkBonde) · 2024-04-22T15:00:41.782Z · comments (31)

Liability regimes for AI
Ege Erdil (ege-erdil) · 2024-08-19T01:25:01.006Z · comments (34)

Some (problematic) aesthetics of what constitutes good work in academia
Steven Byrnes (steve2152) · 2024-03-11T17:47:28.835Z · comments (12)

The Plan - 2023 Version
johnswentworth · 2023-12-29T23:34:19.651Z · comments (39)

Does davidad's uploading moonshot work?
jacobjacob · 2023-11-03T02:21:51.720Z · comments (35)

The "public debate" about AI is confusing for the general public and for policymakers because it is a three-sided debate
Adam David Long (adam-david-long-1) · 2023-08-01T00:08:30.908Z · comments (30)

Leading The Parade
johnswentworth · 2024-01-31T22:39:56.499Z · comments (31)

The Hopium Wars: the AGI Entente Delusion
Max Tegmark (MaxTegmark) · 2024-10-13T17:00:29.033Z · comments (50)

[link] Moral Reality Check (a short story)
jessicata (jessica.liu.taylor) · 2023-11-26T05:03:18.254Z · comments (44)

6 non-obvious mental health issues specific to AI safety
Igor Ivanov (igor-ivanov) · 2023-08-18T15:46:09.938Z · comments (24)

OpenAI o1
Zach Stein-Perlman · 2024-09-12T17:30:31.958Z · comments (41)

Deep atheism and AI risk
Joe Carlsmith (joekc) · 2024-01-04T18:58:47.745Z · comments (22)

[link] Nursing doubts
dynomight · 2024-08-30T02:25:36.826Z · comments (20)

The Information: OpenAI shows 'Strawberry' to feds, races to launch it
Martín Soto (martinsq) · 2024-08-27T23:10:18.155Z · comments (15)

[link] If you weren't such an idiot...
kave · 2024-03-02T00:01:37.314Z · comments (74)

[link] Stanislav Petrov Quarterly Performance Review
Ricki Heicklen (bayesshammai) · 2024-09-26T21:20:11.646Z · comments (3)

LLMs for Alignment Research: a safety priority?
abramdemski · 2024-04-04T20:03:22.484Z · comments (24)

My theory of change for working in AI healthtech
Andrew_Critch · 2024-10-12T00:36:30.925Z · comments (35)

[link] That Alien Message - The Animation
Writer · 2024-09-07T14:53:30.604Z · comments (9)

Value Claims (In Particular) Are Usually Bullshit
johnswentworth · 2024-05-30T06:26:21.151Z · comments (18)

Responses to apparent rationalist confusions about game / decision theory
Anthony DiGiovanni (antimonyanthony) · 2023-08-30T22:02:12.218Z · comments (14)

[link] The Checklist: What Succeeding at AI Safety Will Involve
Sam Bowman (sbowman) · 2024-09-03T18:18:34.230Z · comments (49)

Loudly Give Up, Don't Quietly Fade
Screwtape · 2023-11-13T23:30:25.308Z · comments (11)

AI Views Snapshots
Rob Bensinger (RobbBB) · 2023-12-13T00:45:50.016Z · comments (61)

Survey: How Do Elite Chinese Students Feel About the Risks of AI?
Nick Corvino (nick-corvino) · 2024-09-02T18:11:11.867Z · comments (13)

[link] Decomposing Agency — capabilities without desires
owencb · 2024-07-11T09:38:48.509Z · comments (32)

At 87, Pearl is still able to change his mind
rotatingpaguro · 2023-10-18T04:46:29.339Z · comments (15)

0. CAST: Corrigibility as Singular Target
Max Harms (max-harms) · 2024-06-07T22:29:12.934Z · comments (12)

[link] Why I’m not a Bayesian
Richard_Ngo (ricraz) · 2024-10-06T15:22:45.644Z · comments (60)

What good is G-factor if you're dumped in the woods? A field report from a camp counselor.
Hastings (hastings-greer) · 2024-01-12T13:17:23.829Z · comments (22)

Comparing Anthropic's Dictionary Learning to Ours
Robert_AIZI · 2023-10-07T23:30:32.402Z · comments (8)

My experience using financial commitments to overcome akrasia
William Howard (william-howard) · 2024-04-15T22:57:32.574Z · comments (31)

[link] Fields that I reference when thinking about AI takeover prevention
Buck · 2024-08-13T23:08:54.950Z · comments (15)

← previous page (newer posts) · next page (older posts) →

Archive

Recent comments

christiankl on Open Thread Fall 2024

If you search for "Less Wrong Census" you will find the existing surveys of the LessWrong readership.

purplehermann on "Slow" takeoff is a terrible term for "maybe even faster takeoff, actually"

Something like iterative/cliff, with fast and slow expressing time scales

nathan-helm-burger on The Hopium Wars: the AGI Entente Delusion

Dear Max, If you would like more confirmation of the immediacy and likely trajectory of the biorisk from AI, please have a private chat with Kevin Esvalt who is also at MIT. I speak with such concern about biorisk from AI because I've been helping his new AI Biorisk Eval team at SecureBio for the past year. Things are seeming pretty scary on that front.

adam-b on Concrete benefits of making predictions

I think it's still very useful to be able to predict your own behaviour (including in the case where you know you've made a prediction about it).

Things can get weird if you care more about the outcome of the prediction than the outcome of the event in itself, but this should rarely be the case - and is worth avoiding, I think.

purplehermann on "Slow" takeoff is a terrible term for "maybe even faster takeoff, actually"

Can you sort the poll options by popularity?

purplehermann on "Slow" takeoff is a terrible term for "maybe even faster takeoff, actually"

Iterative/Sudden

purplehermann on Overview of strong human intelligence amplification methods

I can only describe the Product, not the tech. The idea would be to plug in a bigger working memory in the area of the brain currently holding working memory. This is the piece I think matters most

On reflection something like wolfram alpha should be enough for calculations, and a well indexed reservoir of knowledge with an LLM pulling up relevant links with summaries should be good enough for the rest

purplehermann on Species as Canonical Referents of Super-Organisms

Inside the super organism you are correct, but the genome is influenced by outside forces as whole over the ages - and any place where this breaks down for long enough you eventually get two species instead of one.

Therefore outside groups can treat the species as a super organism in general, the individual members must be dealt with individually when there is previous loyalty to another member of the other species.

For example, an Englishman and his dog vs an eskimo and his dog. The two humans may be against each other, the dogs may be against each other, but the opposite human/dog interactions would be standard if they weren't already attached to other in-species members.

purplehermann on Species as Canonical Referents of Super-Organisms

This gives the bones of a proper theoretical foundation on the moral duties between members of different species.

For example, this would back the intuition of eating dog to be worse than eating a bear or octupus, regardless of intelligence, and of killing rats out of hand

purplehermann on Isaac King's Shortform

They'd not identical. First, they have a different status, much the same as citizens and aliens have different rights. Second, different species of animals have different relationships with humanity: Dogs are bred to be symbiotic companions Cats are parasites if allowed, pest control if tolerated Rats are disease vector scavengers Chickens are livestock - they lay infertile eggs for human consumption!