LessWrong 2.0 Reader

View: New · Old · Top

Restrict date range: Today · This week · This month · Last three months · This year · All time

← previous page (newer posts) · next page (older posts) →

Fucking Goddamn Basics of Rationalist Discourse
LoganStrohl (BrienneYudkowsky) · 2023-02-04T01:47:32.578Z · comments (103)

Sharing Information About Nonlinear
Ben Pace (Benito) · 2023-09-07T06:51:11.846Z · comments (323)

Feature Selection
Zack_M_Davis · 2021-11-01T00:22:29.993Z · comments (24)

You don't know how bad most things are nor precisely how they're bad.
Solenoid_Entity · 2024-08-04T14:12:54.136Z · comments (48)

[link] EA Vegan Advocacy is not truthseeking, and it’s everyone’s problem
Elizabeth (pktechgirl) · 2023-09-28T23:30:03.390Z · comments (250)

Model Organisms of Misalignment: The Case for a New Pillar of Alignment Research
evhub · 2023-08-08T01:30:10.847Z · comments (29)

“PR” is corrosive; “reputation” is not.
AnnaSalamon · 2021-02-14T03:32:24.985Z · comments (95)

[link] When do "brains beat brawn" in Chess? An experiment
titotal (lombertini) · 2023-06-28T13:33:23.854Z · comments (103)

[link] I got dysentery so you don’t have to
eukaryote · 2024-10-22T04:55:58.422Z · comments (4)

What Goes Without Saying
sarahconstantin · 2024-12-20T18:00:06.363Z · comments (27)

The Case Against AI Control Research
johnswentworth · 2025-01-21T16:03:10.143Z · comments (74)

Models Don't "Get Reward"
Sam Ringer · 2022-12-30T10:37:11.798Z · comments (61)

Alignment Grantmaking is Funding-Limited Right Now
johnswentworth · 2023-07-19T16:49:08.811Z · comments (68)

Book Review: How Minds Change
bc4026bd4aaa5b7fe (bc4026bd4aaa5b7fe0bdcd47da7a22b453953f990d35286b9d315a619b23667a) · 2023-05-25T17:55:32.218Z · comments (52)

Six Dimensions of Operational Adequacy in AGI Projects
Eliezer Yudkowsky (Eliezer_Yudkowsky) · 2022-05-30T17:00:30.833Z · comments (66)

The Parable of the King and the Random Process
moridinamael · 2023-03-01T22:18:59.734Z · comments (26)

Epistemic Legibility
Elizabeth (pktechgirl) · 2022-02-09T18:10:06.591Z · comments (30)

[link] Industrial literacy
jasoncrawford · 2020-09-30T16:39:06.520Z · comments (130)

Guide to rationalist interior decorating
mingyuan · 2023-06-19T06:47:13.704Z · comments (49)

Universal Basic Income and Poverty
Eliezer Yudkowsky (Eliezer_Yudkowsky) · 2024-07-26T07:23:50.151Z · comments (136)

Leaky Delegation: You are not a Commodity
Darmani · 2021-01-25T02:04:55.942Z · comments (38)

[link] Great minds might not think alike
Eric Neyman (UnexpectedValues) · 2020-12-26T19:51:05.978Z · comments (45)

On not getting contaminated by the wrong obesity ideas
Natália (Natália Mendonça) · 2023-01-28T20:18:21.322Z · comments (69)

On how various plans miss the hard bits of the alignment challenge
So8res · 2022-07-12T02:49:50.454Z · comments (89)

[link] Sleeper Agents: Training Deceptive LLMs that Persist Through Safety Training
evhub · 2024-01-12T19:51:01.021Z · comments (95)

Would catching your AIs trying to escape convince AI developers to slow down or undeploy?
Buck · 2024-08-26T16:46:18.872Z · comments (77)

LW Team is adjusting moderation policy
Raemon · 2023-04-04T20:41:07.603Z · comments (185)

Why Agent Foundations? An Overly Abstract Explanation
johnswentworth · 2022-03-25T23:17:10.324Z · comments (56)

Speaking to Congressional staffers about AI risk
[deleted] · 2023-12-04T23:08:52.055Z · comments (25)

A challenge for AGI organizations, and a challenge for readers
Rob Bensinger (RobbBB) · 2022-12-01T23:11:44.279Z · comments (33)

An Unexpected Victory: Container Stacking at the Port of Long Beach
Zvi · 2021-10-28T14:40:00.497Z · comments (41)

Heads I Win, Tails?—Never Heard of Her; Or, Selective Reporting and the Tragedy of the Green Rationalists
Zack_M_Davis · 2019-09-24T04:12:07.560Z · comments (40)

Lies, Damn Lies, and Fabricated Options
Duncan Sabien (Deactivated) (Duncan_Sabien) · 2021-10-17T02:47:24.909Z · comments (132)

The Field of AI Alignment: A Postmortem, and What To Do About It
johnswentworth · 2024-12-26T18:48:07.614Z · comments (158)

EfficientZero: How It Works
1a3orn · 2021-11-26T15:17:08.321Z · comments (50)

Science in a High-Dimensional World
johnswentworth · 2021-01-08T17:52:02.261Z · comments (53)

LessWrong is providing feedback and proofreading on drafts as a service
Ruby · 2021-09-07T01:33:10.666Z · comments (53)

Gentleness and the artificial Other
Joe Carlsmith (joekc) · 2024-01-02T18:21:34.746Z · comments (33)

Two-year update on my personal AI timelines
Ajeya Cotra (ajeya-cotra) · 2022-08-02T23:07:48.698Z · comments (60)

[link] Is Success the Enemy of Freedom? (Full)
alkjash · 2020-10-26T20:25:50.503Z · comments (69)

AI Timelines
habryka (habryka4) · 2023-11-10T05:28:24.841Z · comments (133)

Predictable updating about AI risk
Joe Carlsmith (joekc) · 2023-05-08T21:53:34.730Z · comments (25)

Study Guide
johnswentworth · 2021-11-06T01:23:09.552Z · comments (48)

[link] Pausing AI Developments Isn't Enough. We Need to Shut it All Down by Eliezer Yudkowsky
jacquesthibs (jacques-thibodeau) · 2023-03-29T23:16:19.431Z · comments (297)

Politics is way too meta
Rob Bensinger (RobbBB) · 2021-03-17T07:04:42.187Z · comments (46)

[link] Intentionally Making Close Friends
Neel Nanda (neel-nanda-1) · 2021-06-27T23:06:49.269Z · comments (35)

[link] Towards Monosemanticity: Decomposing Language Models With Dictionary Learning
Zac Hatfield-Dodds (zac-hatfield-dodds) · 2023-10-05T21:01:39.767Z · comments (22)

Non-Disparagement Canaries for OpenAI
aysja · 2024-05-30T19:20:13.022Z · comments (51)

[link] Scale Was All We Needed, At First
Gabe M (gabe-mukobi) · 2024-02-14T01:49:16.184Z · comments (33)

Accidentally Load Bearing
jefftk (jkaufman) · 2023-07-13T16:10:00.806Z · comments (17)

← previous page (newer posts) · next page (older posts) →

Archive

Recent comments

gunnar_zarncke on Reviewing LessWrong: Screwtape's Basic Answer

About archipelago: c2.com, the original wiki tried this. They created a federated wiki but that didn't seem to work. My guess: the volume was too low.

And LW has already all the filtering you need: just subscribe to the people and topics you are interested. There is also the unfinishe reading list.

I get tha this may not feel like its own community. Within LW this could be done with ongoing open threads about a topic. But tgat requires an organizer and participation. And we are back at volume. And at needing good writers.

tsvibt on evhub's Shortform

(Interesting. FWIW I've recently been thinking that it's a mistake to think of this type of thing--"what to do after the acute risk period is safed"--as being a waste of time / irrelevant; it's actually pretty important, specifically because you want people trying to advance AGI capabilities to have an alternative, actually-good vision of things. A hypothesis I have is that many of them are in a sense genuinely nihilistic/accelerationist; "we can't imagine the world after AGI, so we can't imagine it being good, so it cannot be good, so there is no such thing as a good future, so we cannot be attached to a good future, so we should accelerate because that's just what is happening".)

purple-fire on What working on AI safety taught me about B2B SaaS sales

I also think monopolizing talent enables software companies to make sure those high fixed costs stay nice and high.

If you disagreed with this, is it because you think it is literally false or because you don't agree with the implied argument that software companies are doing this on purpose?

ape-in-the-coat on Subjective Naturalism in Decision Theory: Savage vs. Jeffrey–Bolker

Richness: The model must include all the propositions the agent can meaningfully consider, including those about herself. If the agent can form a proposition “I will do X”, then that belongs in the space of propositions over which she has beliefs and (where appropriate) desirabilities.

I see a potential problem here, depending on what exactly is meant by "can meaningfully consider".

Consider this set up:

You participate in the experiment for seven days. Every day you wake up in a room and can choose between two envelopes. One of them has 100$ the other is empty. Then your memory of this act is erased. At the end of the experiment you get all the money that you've managed to win.
On day one money are assigned to an envelope randomly. However, on all the next days the money are put in the envelope that you didn't pick on the previous day. You do not have any access to random number generators.

Is the model supposed to include credence for proposition "Today the money is in envelope 1" when you wake up participating in such experiment?

purple-fire on What working on AI safety taught me about B2B SaaS sales

Hm, this violates my model of the world.

there are too many AI companies for this deal to work on all of them

Realistically, I think there are like 3-4 labs^[1] that matter, OAI, DM, Anthropic, Meta.

some of these AI companies will have strong kinda-ideological commitments to not doing this

Even if that was true, they will be at the whim of investors who are almost all big tech companies.

this is better done by selling (even at a lower revenue) to anyone who wants an AI SWE than selling just to Oracle.

This is the explicit claim I was making with the WTP argument. I think this is firmly not true, and OpenAI will make more money by selling just to Oracle. What evidence causes you to disagree?

^{^}
American/Western labs.

t3t on Nick Land: Orthogonality

I hadn't downvoted this post, but I am not sure why OP is surprised given the first four paragraphs, rather than explaining what the post is about, instead celebrate tree murder and insult their (imagined) audience:

so that no references are needed but those any LW-rationalist is expected to have committed to memory by the time of their first Lighthaven cuddle puddle

quetzal_rainbow on Subjective Naturalism in Decision Theory: Savage vs. Jeffrey–Bolker

I think austerity has a weird relationship with counterfactuals?

tsvibt on Do you have High-Functioning Asperger's Syndrome?

It's not clear why anyone would want to claim a self-diagnosis of that, since little about it is 'egosyntonic', as the psychiatrists say.

Since a friend mentioned I might be schizoid, I've been like "...yeah? somewhat? maybe? seems mixed? aren't I just avoidant? but I feel more worried about relating than about being rejected?", though I'm not very motivated to learn a bunch about it. So IDK. But anyway, re/ egosyntonicity:

Compared to avoidant, schizoid seems vaguely similar, but less pathetic; less needy or cowardly.
Schizoid has some benefits of "disagreeability, but not as much of an asshole". Thinking for yourself, not being taken in by common modes of being.
Schizoid is maybe kinda like "I have a really really high bar for who I want to relate to", which is kinda high-status.

ustice on We Fell For It

I describe myself as a techno-hippy. I was reading Cory Doctorow while waiting for the next chapter of HPMoR. I’ve often wanted a good leftrat community. I feel ya. I have an intuition that Consequentialism is a dangerous philosophy to adopt for optimizers, especially when they get scared. It’s not a huge hop from “saving the world” to “saving people from themselves.”

When I was in my twenties, I’d spend hours a day in forums and on Reddit. I’ve moderated and participated. Now, I’m 47 I just don’t have it in me. Planting a banner works, but it’s a lot of work.

This is about as close as I get to social media nowadays. It’s too easy to get people angry online, and to use that anger as a weapon. Right now I’m hoping I can guide my son through his teen-age years without him being weaponized. Hopefully kindness and the virtues of Stoicism stand stronger against inversion than Consequentialism.

davey-morse on Davey Morse's Shortform

Two things lead me to think human content online will soon become way more valuable.

Scarcity. As AI agents begin fill the internet with tons of slop, human content will be relatively scarcer. Other humans will seek it out.
Better routing. As AI leads to the improvement of search/recommendation systems, human content will be routed to exactly the people who will value it most. (This is far from the case Twitter/Reddit today). As human content is able to reach more of the humans that value it, it gets valued more. That includes existing human content: most of the content online that is eerily relevant to you... you haven't seen yet because surfacing algorithms are bad.

The implication: make tons of digital stuff. Write/Draw/Voice-record/etc