LessWrong 2.0 Reader

View: New · Old · Top

Restrict date range: Today · This week · This month · Last three months · This year · All time

← previous page (newer posts) · next page (older posts) →

Geometric Exploration, Arithmetic Exploitation
Scott Garrabrant · 2022-11-24T15:36:30.334Z · comments (4)

Taking the parameters which seem to matter and rotating them until they don't
Garrett Baker (D0TheMath) · 2022-08-26T18:26:47.667Z · comments (48)

Cup-Stacking Skills (or, Reflexive Involuntary Mental Motions)
Duncan Sabien (Deactivated) (Duncan_Sabien) · 2021-10-11T07:16:45.950Z · comments (36)

Utilitarianism Meets Egalitarianism
Scott Garrabrant · 2022-11-21T19:00:12.168Z · comments (16)

My Understanding of Paul Christiano's Iterated Amplification AI Safety Research Agenda
Chi Nguyen · 2020-08-15T20:02:00.205Z · comments (20)

[link] Matt Levine on "Fraud is no fun without friends."
Raemon · 2021-01-19T18:23:20.614Z · comments (24)

[link] DontDoxScottAlexander.com - A Petition
Ben Pace (Benito) · 2020-06-25T05:44:50.050Z · comments (32)

[link] On hiding the source of knowledge
jessicata (jessica.liu.taylor) · 2020-01-26T02:48:51.310Z · comments (40)

Quintin's alignment papers roundup - week 1
Quintin Pope (quintin-pope) · 2022-09-10T06:39:01.773Z · comments (6)

How to Bounded Distrust
Zvi · 2023-01-09T13:10:00.942Z · comments (17)

Evidence of Learned Look-Ahead in a Chess-Playing Neural Network
Erik Jenner (ejenner) · 2024-06-04T15:50:47.475Z · comments (14)

AI Alignment Metastrategy
Vanessa Kosoy (vanessa-kosoy) · 2023-12-31T12:06:11.433Z · comments (13)

Stampy's AI Safety Info soft launch
steven0461 · 2023-10-05T22:13:04.632Z · comments (9)

Compendium of problems with RLHF
Charbel-Raphaël (charbel-raphael-segerie) · 2023-01-29T11:40:53.147Z · comments (16)

"Zero Sum" is a misnomer.
abramdemski · 2020-09-30T18:25:30.603Z · comments (34)

Land Ho!
Zvi · 2022-01-20T13:30:01.262Z · comments (4)

[link] The Alignment Problem: Machine Learning and Human Values
Rohin Shah (rohinmshah) · 2020-10-06T17:41:21.138Z · comments (7)

Omicron Variant Post #2
Zvi · 2021-11-29T16:30:01.368Z · comments (34)

[link] Paper: LLMs trained on “A is B” fail to learn “B is A”
[deleted] · 2023-09-23T19:55:53.427Z · comments (74)

Convincing All Capability Researchers
Logan Riggs (elriggs) · 2022-04-08T17:40:25.488Z · comments (70)

[question] Which skincare products are evidence-based?
Vanessa Kosoy (vanessa-kosoy) · 2024-05-02T15:22:12.597Z · answers+comments (48)

Future ML Systems Will Be Qualitatively Different
jsteinhardt · 2022-01-11T19:50:11.377Z · comments (10)

Views on when AGI comes and on strategy to reduce existential risk
TsviBT · 2023-07-08T09:00:19.735Z · comments (55)

Problem relaxation as a tactic
TurnTrout · 2020-04-22T23:44:42.398Z · comments (8)

Late 2021 MIRI Conversations: AMA / Discussion
Rob Bensinger (RobbBB) · 2022-02-28T20:03:05.318Z · comments (199)

Delta Strain: Fact Dump and Some Policy Takeaways
Connor_Flexman · 2021-07-28T03:38:34.455Z · comments (60)

[link] The Failed Strategy of Artificial Intelligence Doomers
Ben Pace (Benito) · 2025-01-31T18:56:06.784Z · comments (69)

Perpetual Dickensian Poverty?
jefftk (jkaufman) · 2021-12-21T13:30:03.543Z · comments (18)

Revealing Intentionality In Language Models Through AdaVAE Guided Sampling
jdp · 2023-10-20T07:32:28.749Z · comments (15)

FHI paper published in Science: interventions against COVID-19
SoerenMind · 2020-12-16T21:19:00.441Z · comments (0)

[link] The Dangers of Mirrored Life
Niko_McCarty (niko-2) · 2024-12-12T20:58:32.750Z · comments (7)

RTFB: On the New Proposed CAIP AI Bill
Zvi · 2024-04-10T18:30:08.410Z · comments (14)

Christiano, Cotra, and Yudkowsky on AI progress
Eliezer Yudkowsky (Eliezer_Yudkowsky) · 2021-11-25T16:45:32.482Z · comments (95)

Preventing Language Models from hiding their reasoning
Fabien Roger (Fabien) · 2023-10-31T14:34:04.633Z · comments (15)

Unwitting cult leaders
Kaj_Sotala · 2021-02-11T11:10:04.504Z · comments (9)

AI catastrophes and rogue deployments
Buck · 2024-06-03T17:04:51.206Z · comments (16)

A bird's eye view of ARC's research
Jacob_Hilton · 2024-10-23T15:50:06.123Z · comments (12)

A Significant Portion of COVID-19 Transmission Is Presymptomatic
jimrandomh · 2020-03-14T05:52:33.734Z · comments (22)

Solving the Mechanistic Interpretability challenges: EIS VII Challenge 1
StefanHex (Stefan42) · 2023-05-09T19:41:10.528Z · comments (1)

[question] What are your greatest one-shot life improvements?
Mark Xu (mark-xu) · 2020-05-16T16:53:40.608Z · answers+comments (171)

Parable of the Dammed
johnswentworth · 2020-12-10T00:08:44.493Z · comments (29)

Full-time AGI Safety!
Steven Byrnes (steve2152) · 2021-03-01T12:42:14.813Z · comments (3)

[question] How do we prepare for final crunch time?
Eli Tyre (elityre) · 2021-03-30T05:47:54.654Z · answers+comments (30)

Why I'm joining Anthropic
evhub · 2023-01-05T01:12:13.822Z · comments (4)

AGI and the EMH: markets are not expecting aligned or unaligned AI in the next 30 years
basil.halperin (bhalperin) · 2023-01-10T16:06:52.329Z · comments (44)

[question] Why The Focus on Expected Utility Maximisers?
DragonGod · 2022-12-27T15:49:36.536Z · answers+comments (84)

Scissors Statements for President?
AnnaSalamon · 2024-11-06T10:38:21.230Z · comments (32)

The Standard Analogy
Zack_M_Davis · 2024-06-03T17:15:42.327Z · comments (28)

Mental health benefits and downsides of psychedelic use in ACX readers: survey results
RationalElf · 2021-10-25T22:55:09.522Z · comments (18)

A List of 45+ Mech Interp Project Ideas from Apollo Research’s Interpretability Team
Lee Sharkey (Lee_Sharkey) · 2024-07-18T14:15:50.248Z · comments (18)

← previous page (newer posts) · next page (older posts) →

Archive

Recent comments

richard_kennaway on the devil's ontology

I still don’t think I understand this.

The particular words the posting seems to be about are DEI and woke stuff. But Trump is taking a wrecking ball to them, and got elected on a platform to do just that.

I don’t see what the requisite variety stuff has to do with it either. (I think requisite variety is a crock, but that would be another conversation.) When Indiana Jones is confronted with a sword-fighter who performs all manner of dazzlingly fast swordplay, he just draws a gun and shoots him. What value the “variety” of the swordsman’s skills then?

winstonbosan on winstonBosan's Shortform

I got an EMG done for my carpal tunnel, and my doctor won't believe that I'm still feeling phantom shocks. If you're experiencing semi-regular shock sensations after your EMG, you're not alone. I was still getting these irregularly scheduled shocks weeks after the test. Or in jargon form, my nervous subsystem in the arm keeps generating evoked responses to an absent but expected stimulus in the form of electric shocks.

For the unknowing ones, EMG (Electromyography) is something they do to you if you misbehave or if you have symptoms of carpal tunnel. By shocking you in specific spots and measuring the electric potential curve at various fixed points along your arm/leg, docs can pattern match you to anything between "You have Olympic-level reflexes" to "I'm sure a piece of wood would have done better." In my case, I'm closer to the Olympian than to the confused taxonomic group of trees. I'm grateful for that.

But if your doctor tells you that you're mistaken, or that "EMG is one of the safest procedures and is not generally known to cause adverse side-effects afterward," know this: at least I believe you more than they do.

hivewired on Thread for Sense-Making on Recent Murders and How to Sanely Respond

well uh....that pastebin seems exceptionally confused in addition to being written by notorious serial rapist and federal informant Lauralei Bailey? let's go through the claims here anyway though just for consistency

it seems to be implying that i'm "the leader of the cult" or something to that effect? which is an absolutely hilarious claim.
i have no idea who this "12" that is being referred to is.
the pastebin seems to be implying that i/"the cult" am protecting/allied with "a cis man raping his way through the west coast trans community" which i am guessing is an extremely confused reference to JD? also a hilarious claim for anyone who actually knows the situation.
The flowerbynoothername post linked where i confess to having worked with JD was written after i wrote my own callout of JD where i also declared myself a zizian before someone pulled me aside and was like 'it's not very anarchist to put that much weight in one person's perspective' which, fair enough.

LB seems extremely confused about the situation overall and i would not consider her a reliable source of information, which also makes it extremely sus that she seems to have used this extremely confused understanding of the situation to justify defacing that fediverse instance.

vladimir_nesov on When you downvote, explain why

If the reasons to leave are too legible, they are either toothless or will be gamed and become too costly to actually enforce, including in injustice and drama. Trivial inconveniences that differentially apply to people that should leave anyway are still effective, but don't have these downsides.

(My own policy is to almost always avoid downvoting precisely when I have a comment to make. Otherwise the vote is all the feedback I have to give, so I'm going to give it rather than metaphorically slash their tires [LW · GW] by staying silent and maintaining a misleading impression about the reception of their post/comment.)

johnswentworth on The Risk of Gradual Disempowerment from AI

Most importantly, current proposed technical plans are necessary but not sufficient to stop this. Even if the technical side fully succeeds no one knows what to do with that.

I don't think that's quite accurate. In particular, gradual disempowerment is exactly the sort of thing which corrigibility would solve. (At least for "corrigibility" in the sense David and I use the term [LW · GW], and probably Yudkowsky, but not Christiano's sense [LW(p) · GW(p)]; he uses the term to mean a very different thing.)

A general-purpose corrigible AI (in the sense we use the term) is pretty accurately thought-of as an extension of the user. Building and using such an AI is much more like "uplifting" the user than like building an independent agent. It's the cognitive equivalent of gaining prosthetic legs, as opposed to having someone carry you around on a sedan. Another way to state it: a corrigible subsystem acts like it's a part of a larger agent, serving a particular purpose as a component of the larger agent, as opposed to acting like an agent in its own right.

... admittedly corrigibility is still very much in the "conceptual" stage, far from an actual technical plan. But it's at least a technical research direction which would pretty directly address the disempowerment problem.

vladimir_nesov on When you downvote, explain why

These considerations also apply to upvotes (to the extent that they do).

lumpenspace on Nick Land: Orthogonality

propaganda of nick land's idea

wait - are you aware that the texts in question are nick land's? i think it should be pretty clear from the editor's note.

besides, in the first extract, the labels part was entirely incidental - and has literally no import to any of the rest. it was an historical artefact; the meat of the first section was, well, the thing indicated by its title and its text. i definitely see the issue of fixating on labels, now, tho - and i thank you for providing an object lesson.

ideological turing test

the purpose of the idelogical turing test is to represent the opposing views in ways that your opponent would find satisfactory. I have it from reliable sources that Bostrom found the opening paragraphs, until "sun's eventual expansion", satisfactory.

i really cannot shake the feeling that you hadn't read the post to begin with, and that now you are simply scanning it in order to find rebuttals to my comments. your grasp of basic, factual statements seems to falter, to the point of suggesting that my engagement with what purport to be more fundamental points might be a suboptimal allocation of resources.

simon-berens on Wired on: "DOGE personnel with admin access to Federal Payment System"

Re twitter’s profitability, Musk about doubled EBITDA despite revenue halving, i.e. he more than tripled EBITDA margin

https://www.teslarati.com/elon-musk-x-doubled-ebitda-since-2022-takeover-report/amp

maxwell-peterson on Wired on: "DOGE personnel with admin access to Federal Payment System"

Well okay then :)! You giving a disagree-vote makes a lot of sense. Thanks for explaining.

gbear605 on Wired on: "DOGE personnel with admin access to Federal Payment System"

I disagree with basically all of them.

As I see it, the large majority of government employees are neither incompetent nor corrupt, and the Federal government overall works extremely well given all of the tasks that it's asked to do. The president is supposed to execute the will of the legislature according to the law (which he isn't, he's shutting down agencies that Congress has created and subverting other agencies to not do what Congress has instructed them to do). Musk did a bad job of it with Twitter (it's less profitable now than it was when he bought it, last time I checked the data) and it's a bad policy for a new CEO coming in with goals other than "destroy the old system because I think a well functioning system is bad".

Trump is radically reinterpreting the job of the executive branch to include "determine which laws I want to exist and only enforce those", which is a massive expansion of executive power.

If Congress passed laws for all of these things, I think it would be a bad choice, but at least it wouldn't be an unconstitutional coup.