LessWrong 2.0 Reader

View: New · Old · Top

next page (older posts) →

Learning the smooth prior
Geoffrey Irving · 2022-04-29T21:10:18.064Z · comments (0)

[question] Do FDT (or similar) recommend reparations?
David Scott Krueger (formerly: capybaralet) (capybaralet) · 2022-04-29T17:34:48.479Z · answers+comments (3)

Saying no to the Appleman
Johannes C. Mayer (johannes-c-mayer) · 2022-04-29T10:39:48.693Z · comments (12)

Prize for Alignment Research Tasks
stuhlmueller · 2022-04-29T08:57:04.290Z · comments (38)

Increasing Demandingness in EA
jefftk (jkaufman) · 2022-04-29T01:20:01.507Z · comments (22)

[question] What is a training "step" vs. "episode" in machine learning?
Evan R. Murphy · 2022-04-28T21:53:24.785Z · answers+comments (4)

Facts Matter
mrdlm (mridul.mohan.m@gmail.com) · 2022-04-28T21:19:38.599Z · comments (2)

[question] Is alignment possible?
Shay · 2022-04-28T21:18:25.891Z · answers+comments (5)

Two Prosocial Rejection Norms
Emrik (Emrik North) · 2022-04-28T20:53:15.850Z · comments (21)

Dath Ilan vs. Sid Meier's Alpha Centauri: Pareto Improvements
David Udell · 2022-04-28T19:26:26.664Z · comments (16)

[link] A Parable Of Explainability
George3d6 · 2022-04-28T16:46:24.280Z · comments (5)

[link] Keep your protos in one repo
RobertM (T3t) · 2022-04-28T15:53:26.803Z · comments (4)

Covid 4/28/22: Take My Paxlovid, Please
Zvi · 2022-04-28T15:20:01.378Z · comments (14)

3-bit filters
iivonen · 2022-04-28T11:55:46.403Z · comments (0)

[link] Jaan Tallinn's 2021 Philanthropy Overview
jaan · 2022-04-28T09:55:50.789Z · comments (2)

Doom sooner
Flaglandbase · 2022-04-28T07:24:10.276Z · comments (0)

How Might an Alignment Attractor Look like?
Shmi (shminux) · 2022-04-28T06:46:11.139Z · comments (15)

Virtue signaling is sometimes the best or the only metric we have
Holly_Elmore · 2022-04-28T04:52:53.884Z · comments (43)

The Gospel of Martin Luther
lsusr · 2022-04-28T04:29:58.601Z · comments (2)

Letter to my Squire
lsusr · 2022-04-28T04:16:38.905Z · comments (0)

Slides: Potential Risks From Advanced AI
Aryeh Englander (alenglander) · 2022-04-28T02:15:20.040Z · comments (0)

Naive comments on AGIlignment
Ericf · 2022-04-28T01:08:02.507Z · comments (4)

AI Alternative Futures: Scenario Mapping Artificial Intelligence Risk - Request for Participation (*Closed*)
Kakili (Greenboat88) · 2022-04-27T22:07:57.906Z · comments (2)

The Speed + Simplicity Prior is probably anti-deceptive
[deleted] · 2022-04-27T19:30:20.173Z · comments (28)

If you’re very optimistic about ELK then you should be optimistic about outer alignment
Sam Marks (samuel-marks) · 2022-04-27T19:30:11.785Z · comments (8)

[link] The Game of Masks
Slimepriestess (Hivewired) · 2022-04-27T18:03:12.423Z · comments (18)

Law-Following AI 3: Lawless AI Agents Undermine Stabilizing Agreements
Cullen (Cullen_OKeefe) · 2022-04-27T17:30:25.915Z · comments (2)

Law-Following AI 2: Intent Alignment + Superintelligence → Lawless AI (By Default)
Cullen (Cullen_OKeefe) · 2022-04-27T17:27:24.210Z · comments (2)

Law-Following AI 1: Sequence Introduction and Structure
Cullen (Cullen_OKeefe) · 2022-04-27T17:26:57.004Z · comments (10)

[Intro to brain-like-AGI safety] 13. Symbol grounding & human social instincts
Steven Byrnes (steve2152) · 2022-04-27T13:30:33.773Z · comments (15)

The case for turning glowfic into Sequences
Thomas Kwa (thomas-kwa) · 2022-04-27T06:58:57.395Z · comments (29)

[Link] Evidence of Fabricated Data in a Vitamin C trial by Paul E Marik et al in CHEST
Kenny · 2022-04-27T06:48:06.597Z · comments (1)

SERI ML Alignment Theory Scholars Program 2022
Ryan Kidd (ryankidd44) · 2022-04-27T00:43:38.221Z · comments (6)

EU Maximizing in a Gloomy World
David Udell · 2022-04-27T00:28:58.494Z · comments (2)

Why Copilot Accelerates Timelines
Michaël Trazzi (mtrazzi) · 2022-04-26T22:06:19.507Z · comments (14)

[link] Universals of Morality: Toward Human-Centric Communication Platforms
scafaria · 2022-04-26T21:15:50.520Z · comments (3)

[$20K in Prizes] AI Safety Arguments Competition
Dan H (dan-hendrycks) · 2022-04-26T16:13:16.351Z · comments (518)

[link] Continental Philosophy as Undergraduate Mathematics
Jan (jan-2) · 2022-04-26T08:05:17.433Z · comments (3)

dalle2 comments
nostalgebraist · 2022-04-26T05:30:07.748Z · comments (14)

[link] Make a neural network in ~10 minutes
Arjun Yadav · 2022-04-26T05:24:57.507Z · comments (0)

Framings of Deceptive Alignment
peterbarnett · 2022-04-26T04:25:56.115Z · comments (7)

[link] Why pessimism sounds smart
jasoncrawford · 2022-04-25T20:10:31.344Z · comments (15)

[question] What is being improved in recursive self improvement?
Lone Pine (conor-sullivan) · 2022-04-25T18:30:47.848Z · answers+comments (6)

21 on 21
Amir Bolous (amir-gamil) · 2022-04-25T18:22:23.110Z · comments (5)

[question] Rationalist Inspired Coming-of-age Rituals
iceplant · 2022-04-25T17:22:35.789Z · answers+comments (3)

[Request for Distillation] Coherence of Distributed Decisions With Different Inputs Implies Conditioning
johnswentworth · 2022-04-25T17:01:08.767Z · comments (14)

[question] Quadratic voting with automatic collusion?
SarahNibs (GuySrinivasan) · 2022-04-25T16:15:49.117Z · answers+comments (5)

Intuitions about solving hard problems
Richard_Ngo (ricraz) · 2022-04-25T15:29:04.253Z · comments (23)

Ukraine Post #11: Longer Term Predictions
Zvi · 2022-04-25T14:10:01.119Z · comments (6)

Key questions about artificial sentience: an opinionated guide
Robbo · 2022-04-25T12:09:39.322Z · comments (31)

next page (older posts) →

Archive

2006
2007
2008
2009
2010
2011
2012
2013
2014
2015
2016
2017
2018
2019
2020
2021
2022
- January
- February
- March
- April
- May
- June
- July
- August
- September
- October
- November
- December
2023
2024
2025

Recent comments

willpetillo on Explaining the Joke: Pausing is The Way

Thanks for the link! It's important to distinguish here between:

(1) support for the movement,
(2) support for the cause, and
(3) active support for the movement (i.e. attracting other activists to show up at future demonstrations)

Most of the paper focuses on 1, and also on activist's beliefs about the impact of their actions. I am more interested in 2 and 3. To be fair, the paper gives some evidence for detrimental impacts on 2 in the Trump example. It's not clear, however, whether the nature of the cause matters here. Support for Trump is highly polarized and entangled with culture, whereas global warming (Hallam's cause) and AI risk (PauseAI's) have relatively broad but frustratingly lukewarm public support. There are also many other factors when looking past short-term onlooker sentiment to the larger question of affecting social change, which the paper readily admits in the Discussion section. I'd list these points, but they largely overlap with the points I made in my post...though it was interesting to see how much was speculative. More research is needed.

In any case, I bring up the extreme case to illustrate that the issue is far more nuanced than "regular people get squeamish--net negative!" This is actually somewhat irrelevant to PauseAI in particular, because most of our actions are around public education and lobbying, and even the protests are legal and non-disruptive. I've been in two myself and have seen nothing but positive sentiment from onlookers (with the exception of the occasional "good luck with that!" snark). The hard part with all of these is getting people to show up. (This last paragraph is not a rebuttal to anything you have said, it's a reminder of context)

satron on Untrusted monitoring insights from watching ChatGPT play coordination games

This discussion is very interesting to read and I look forward to hearing @Fabien Roger [LW · GW]'s thoughts on your latest comment.

lsusr on How AI Takeover Might Happen in 2 Years

My story was posted before James_Miller's. Does this mean I invented a (sub-sub-)genre of science fiction?

stephen-fowler on Stephen Fowler's Shortform

Thinking of trying the latest Gemini model? Be aware that it is almost impossible to disable the "Gemini in Docs" and "Gemini in Gmail" services once you have purchased a Google One AI Premium plan.

knight-lee on A collection of approaches to confronting doom, and my thoughts on them

Life is an insane gift and death is merely its absence.

An argument for afterlife

If believing in doom is too painful, I have a religion to sell to you. I might be able to convince you of an afterlife (for you and those you love).

My afterlife argument starts with a thought experiment. Suppose a teleportation machine destroyed you, but created an identical copy of you somewhere else. Would that copy be you? Should you anticipate the experiences of that new copy of you? I think most people would say yes. After all, your brain state continues to exist in the new copy.

Now suppose the teleportation machine doesn't create a copy of you right now, but a copy of you from 1 second ago. Would you still anticipate the experiences of that new copy? I think most people will still say yes. What's wrong with one second ago?

But what if it's a copy of you from one year ago? Or a copy of you when you were a baby? At some point, the new copy will deviate from you so much that it won't be you anymore, and you should not anticipate his/her future experiences, but anticipate death, the completely cessation of any experiences.

The fuzzy transition

For me, it starts to feel fuzzy, in between life and death, if the new copy deviates from me 10 years ago. I don't know the exactly time period which feels the fuzziest to you, but try to imagine the time period where the copy of you is partially similar to you, and where you half anticipate experiencing his/her experiences.

Doesn't that feel weird? "This person would be me, but only kind of me. If he/she has a happy life, I would kind of anticipate me having a happy life as him/her, but I would also kind of anticipate that's just someone else having a happy life, and I meanwhile will be destroyed by the teleportation machine and experience nothing."

So. What do you anticipate seeing after you walk into the teleportation machine? Dark nothingness? Or walking out the other side of the machine a little younger, with your memories erased, and very confused how you got there?

It's fuzzy.

Looking for an objective answer

Given this fuzziness, you decide that before you walk into the machine, you will consult Reason to see if she will give you an objective answer for whether you will keep living, or become nothing.

But Reason is completely silent, and says not a word. Given the hypothesis where you keep existing, and keep experiencing life and all its joys as this new person, the configuration of atoms in the universe is exactly the same as the hypothesis where you cease to exist, and experience pure nothingness. The two hypotheses make the exact same predictions about the world, and Reason tells you that they are in fact the same hypothesis.

Reason might further tell you, that there is no such thing as "you-ness." It is a meaningless attribute which exists only in your map and not the territory. Whether an entity has the attribute of "being you," does not affect its behaviour in any way.

Whether an entity "is you," only affects what experiences you anticipate. But there is no objectively correct answer for "what experience you should anticipate." ...which is insane if you think about it!

Anticipating experiences

After you absorb the shocking revelation and admit there is no objectively correct answer for "what experience you should anticipate," Reason lets you observe the old Hermit of Immortality. The Hermit of Immortality lives in a cabin in the woods, and has never seen another soul. Every 100 years, he forgets all his memories, and gets a random personality change. The only way to recall his past, is to read his journal about his past life.

Reason tells you that his next transition is about to happen. You watch the Hermit grumble while writing on his journal. "Annoyingly, the time to forget my memories is soon approaching. It is a major annoyance, and my journal isn't very organized this time, so after I forget my memories I will have a hard time studying it. Oh well, I'll eventually figure it out. My life will eventually get simple and happy again after this brief confusing period, just like last time."

The Hermit walks to a designated square outside his cabin, and you watch in horror as a massive box falls down from the sky and crushes him. A door opens on the side of a box, and a young man walks out.

Reason tells you that you may think the Hermit dies, while the Hermit thinks he merely forgets everything and gets a random personality change. But there is no objective law of nature to settle the dispute and prove who is right. The anticipation of experiences is a purely subjective matter.

Your choice

Reason tells you that it is completely your choice whether you anticipate pure nothingness after you die, or whether you anticipate someone else's experiences just like the Hermit. The anticipation of experiences exists only in your map, not the territory. It is not even a belief which can be right or wrong, but a belief about belief, (or something akin to that).

Reason asks you, what do you choose?

You tell Reason, "I would rather choose nothingness, than to anticipate existence without my family who I love so much!"

Well, it seems you see them as a fundamental part of you. But why not anticipate your whole family, becoming some other whole family? That too, is allowed.

But don't get too greedy. If you try to anticipate the experiences of the very happiest people, your intuition will find it less credible, and you will actually anticipate very little. Try to anticipate something a little bit more average.

Fin

What do you think about my pseudoreligion? :)

cubefox on xpostah's Shortform

Yeah. I proposed a while ago that all the AI content was becoming so dominant that it should be hived off to the Alignment Forum while LessWrong is for all the rest. This was rejected.

samuelshadrach on xpostah's Shortform

Yes but then it becomes a forum within a forum kinda thing. You need a critical mass of users who all agree to filter out the AI tag, and not have to preface their every post with "I dont buy your short timelines worldview, I am here to discuss something different".

Building critical mass is difficult unless the forum is conducive to it. There's is ultimately only one upvote button and one front-page so the forum will get taken over by the top few topics that its members are paying attention to.

I don't think there's anything wrong with a forum that's mostly focussed on AI xrisk and transhumanist stuff. Better to do one thing well than half ass ten things. But it also means I may need to go elsewhere.

tslarm on NormanPerlmutter's Shortform

The Krome thing is all rumor

I don’t have evidence against

If the truth is hard to determine, I think that in itself is very worrying. When you have vulnerable people imprisoned and credible fears that they are being mistreated, any response from those in power other than transparency is a bad sign. Giving them the benefit of the doubt as long as they can prevent definitive evidence from coming out is bad epistemics and IMO even worse politics (not in a party-political sense; just in a 'how to disincentivise human rights abuses' sense).

drake-morrison on A Slow Guide to Confronting Doom, v1

This is my favorite guide to confronting doom yet

vladimir_nesov on Milan W's Shortform

A power seeker is ambitious without an ambition, which is not an implication of being agentic.