LessWrong 2.0 Reader

View: New · Old · Top

Restrict date range: Today · This week · This month · Last three months · This year · All time

← previous page (newer posts) · next page (older posts) →

The "Think It Faster" Exercise
Raemon · 2024-12-11T19:14:10.427Z · comments (35)

The Most Forbidden Technique
Zvi · 2025-03-12T13:20:04.732Z · comments (9)

Momentum of Light in Glass
Ben (ben-lang) · 2024-10-09T20:19:42.088Z · comments (44)

Applying traditional economic thinking to AGI: a trilemma
Steven Byrnes (steve2152) · 2025-01-13T01:23:00.397Z · comments (32)

What o3 Becomes by 2028
Vladimir_Nesov · 2024-12-22T12:37:20.929Z · comments (15)

Why Have Sentence Lengths Decreased?
Arjun Panickssery (arjun-panickssery) · 2025-04-03T17:50:29.962Z · comments (32)

Survey: How Do Elite Chinese Students Feel About the Risks of AI?
Nick Corvino (nick-corvino) · 2024-09-02T18:11:11.867Z · comments (13)

Why Don't We Just... Shoggoth+Face+Paraphraser?
Daniel Kokotajlo (daniel-kokotajlo) · 2024-11-19T20:53:52.084Z · comments (58)

[link] A computational no-coincidence principle
Eric Neyman (UnexpectedValues) · 2025-02-14T21:39:39.277Z · comments (38)

Planning for Extreme AI Risks
joshc (joshua-clymer) · 2025-01-29T18:33:14.844Z · comments (4)

Passages I Highlighted in The Letters of J.R.R.Tolkien
Ivan Vendrov (ivan-vendrov) · 2024-11-25T01:47:59.071Z · comments (38)

Auditing language models for hidden objectives
Sam Marks (samuel-marks) · 2025-03-13T19:18:32.638Z · comments (14)

What Indicators Should We Watch to Disambiguate AGI Timelines?
snewman · 2025-01-06T19:57:43.398Z · comments (57)

[link] The Hidden Cost of Our Lies to AI
Nicholas Andresen (nicholas-andresen) · 2025-03-06T05:03:47.239Z · comments (17)

[Fiction] [Comic] Effective Altruism and Rationality meet at a Secular Solstice afterparty
tandem · 2025-01-07T19:11:21.238Z · comments (5)

My experience using financial commitments to overcome akrasia
William Howard (william-howard) · 2024-04-15T22:57:32.574Z · comments (33)

Anomalous Tokens in DeepSeek-V3 and r1
henry (henry-bass) · 2025-01-25T22:55:41.232Z · comments (2)

Hire (or Become) a Thinking Assistant
Raemon · 2024-12-23T03:58:42.061Z · comments (49)

The Milton Friedman Model of Policy Change
JohnofCharleston · 2025-03-04T00:38:56.778Z · comments (17)

[link] The Failed Strategy of Artificial Intelligence Doomers
Ben Pace (Benito) · 2025-01-31T18:56:06.784Z · comments (78)

On saying "Thank you" instead of "I'm Sorry"
Michael Cohn (michael-cohn) · 2024-07-08T03:13:50.663Z · comments (16)

[Completed] The 2024 Petrov Day Scenario
Ben Pace (Benito) · 2024-09-26T08:08:32.495Z · comments (114)

An Extremely Opinionated Annotated List of My Favourite Mechanistic Interpretability Papers v2
Neel Nanda (neel-nanda-1) · 2024-07-07T17:39:35.064Z · comments (16)

Loving a world you don’t trust
Joe Carlsmith (joekc) · 2024-06-18T19:31:36.581Z · comments (13)

How it All Went Down: The Puzzle Hunt that took us way, way Less Online
A* (agendra) · 2024-06-02T08:01:40.109Z · comments (5)

[question] Which things were you surprised to learn are not metaphors?
Eric Neyman (UnexpectedValues) · 2024-11-21T18:56:18.025Z · answers+comments (88)

Ten people on the inside
Buck · 2025-01-28T16:41:22.990Z · comments (28)

Limitations on Formal Verification for AI Safety
Andrew Dickson · 2024-08-19T23:03:52.706Z · comments (60)

Why I don't believe in the placebo effect
transhumanist_atom_understander · 2024-06-10T02:37:07.776Z · comments (22)

[link] Simple probes can catch sleeper agents
Monte M (montemac) · 2024-04-23T21:10:47.784Z · comments (21)

[question] How Much Are LLMs Actually Boosting Real-World Programmer Productivity?
Thane Ruthenis · 2025-03-04T16:23:39.296Z · answers+comments (51)

OpenAI #12: Battle of the Board Redux
Zvi · 2025-03-31T15:50:02.156Z · comments (0)

Parasites (not a metaphor)
lemonhope (lcmgcd) · 2024-08-08T20:07:13.593Z · comments (19)

[link] "AI achieves silver-medal standard solving International Mathematical Olympiad problems"
gjm · 2024-07-25T15:58:57.638Z · comments (38)

A Dozen Ways to Get More Dakka
Davidmanheim · 2024-04-08T04:45:19.427Z · comments (11)

[link] Training on Documents About Reward Hacking Induces Reward Hacking
evhub · 2025-01-21T21:32:24.691Z · comments (14)

Circuits in Superposition: Compressing many small neural networks into one
Lucius Bushnaq (Lblack) · 2024-10-14T13:06:14.596Z · comments (9)

Tell me about yourself: LLMs are aware of their learned behaviors
Martín Soto (martinsq) · 2025-01-22T00:47:15.023Z · comments (5)

[link] "Can AI Scaling Continue Through 2030?", Epoch AI (yes)
gwern · 2024-08-24T01:40:32.929Z · comments (4)

Some articles in “International Security” that I enjoyed
Buck · 2025-01-31T16:23:27.061Z · comments (10)

The Paris AI Anti-Safety Summit
Zvi · 2025-02-12T14:00:07.383Z · comments (21)

Building AI Research Fleets
Ben Goldhaber (bgold) · 2025-01-12T18:23:09.682Z · comments (11)

How I started believing religion might actually matter for rationality and moral philosophy
zhukeepa · 2024-08-23T17:40:47.341Z · comments (41)

Near-mode thinking on AI
Olli Järviniemi (jarviniemi) · 2024-08-04T20:47:28.085Z · comments (9)

Human takeover might be worse than AI takeover
Tom Davidson (tom-davidson-1) · 2025-01-10T16:53:27.043Z · comments (54)

Anthropic, and taking "technical philosophy" more seriously
Raemon · 2025-03-13T01:48:54.184Z · comments (29)

The Pearly Gates
lsusr · 2024-05-30T04:01:14.198Z · comments (6)

[link] Parkinson's Law and the Ideology of Statistics
Benquo · 2025-01-04T15:49:21.247Z · comments (7)

Pantheon Interface
NicholasKees (nick_kees) · 2024-07-08T19:03:51.681Z · comments (22)

The Pando Problem: Rethinking AI Individuality
Jan_Kulveit · 2025-03-28T21:03:28.374Z · comments (11)

← previous page (newer posts) · next page (older posts) →

Archive

Recent comments

stephen-fowler on Stephen Fowler's Shortform

Thinking of trying the latest Gemini model? Be aware that it is almost impossible to disable the "Gemini in Docs" and "Gemini in Gmail" services once you have purchased a Google One AI Premium plan.

knight-lee on A collection of approaches to confronting doom, and my thoughts on them

Life is an insane gift and death is merely its absence.

An argument for afterlife

If believing in doom is too painful, I have a religion to sell to you, and I might be able to convince you of an afterlife (for you and those you love).

My afterlife argument starts with a thought experiment. Suppose a teleportation machine destroyed you, but created an identical copy of you somewhere else. Would that copy be you? Should you anticipate the experiences of that new copy of you? I think most people would say yes. After all, your brain state continues to exist in the new copy.

Now suppose the teleportation machine doesn't create a copy of you right now, but a copy of you from 1 second ago. Would you still anticipate the experiences of that new copy? I think most people will still say yes. What's wrong with one second ago?

But what if it's a copy of you from one year ago? Or a copy of you when you were a baby? At some point, the new copy will deviate from you so much that it won't be you anymore, and you should not anticipate his future experiences, but anticipate death, the completely cessation of any experiences.

The fuzzy transition

For me, it starts to feel fuzzy, in between life and death, if the new copy deviates from me 10 years ago. I don't know the exactly time period which feels the fuzziest to you, but try to imagine the time period where the copy of you is partially similar to you, and where you half anticipate experiencing his experiences.

Doesn't that feel weird? "This person would be me, but only kind of me. If he has a happy life, I would kind of anticipate me having a happy life as him, but I would also kind of anticipate that's just someone else having a happy life, and I meanwhile will be destroyed by the teleportation machine and experience nothing."

So. What do you anticipate seeing after you walk into the teleportation machine? Dark nothingness? Or life from that earlier time period, with your memories erased?

It's fuzzy.

Looking for an objective answer

Given this fuzziness, you decide that before you walk into the machine, you will consult Reason to see if she will give you an objective answer for whether you will keep living, or become nothing.

But Reason is completely silent, and says not a word. Given the hypothesis where you keep existing, and keep experiencing life and all its joys as this new person, the configuration of atoms in the universe is exactly the same as the hypothesis where you cease to exist, and experience pure nothingness. The two hypotheses make the exact same predictions about the world, and Reason tells you that they are in fact the same hypothesis.

Reason might further tell you, that there is no such thing as "you-ness." It is a meaningless attribute which exists only in your map and not the territory. Whether an entity has the attribute of "being you," does not affect its behaviour in any way.

Whether an entity "is you," only affects what experiences you anticipate. But there is no objectively correct answer for "what experience you should anticipate." ...which is insane if you think about it!

Anticipating experiences

After you absorb the shocking revelation and admit there is no objectively correct answer for "what experience you should anticipate," Reason lets you observe the old Hermit of Immortality. The Hermit of Immortality lives in a cabin in the woods, and has never seen another soul. Every 100 years, he forgets all his memories, and gets a random personality change. The only way to recall his past, is to read his journal about his past life.

Reason tells you that his next transition is about to happen. You watch the Hermit grumble while writing on his journal. "Annoyingly, the time to forget my memories is soon approaching. It is a major annoyance, and my journal isn't very organized this time, so after I forget my memories I will have a hard time studying it. Oh well. I'll eventually figure it out, and my life will eventually get simple and happy again after this brief confusing time."

The Hermit walks to a designated square outside his cabin, and you watch in horror as a massive box falls down from the sky and crushes him. A door opens on the side of a box, and a young man walks out.

Reason tells you that you may think the Hermit dies, while the Hermit thinks he merely forgets everything and gets a random personality change. But there is no objective law of nature to settle the dispute and prove who is right. The anticipation of experiences is a purely subjective matter.

Your choice

Reason tells you that it is completely your choice whether you anticipate pure nothingness after you die, or whether you anticipate someone else's experiences just like the Hermit. The anticipation of experiences exists only in your map, not the territory. It is not even a belief which can be right or wrong, but a belief about belief, (or something akin to that).

Reason asks you, what do you choose?

You tell Reason, "I would rather choose nothingness, than to anticipate existence without my family who I love so much!"

Well, it seems you see them as a fundamental part of you. But why not anticipate your whole family, becoming some other whole family? That too, is allowed.

But don't get too greedy. If you try to anticipate the experiences of the very happiest people, your intuition will find it less credible, and you will actually anticipate very little. Try to anticipate something a little bit more average.

Fin

What do you think about my pseudoreligion? :)

cubefox on xpostah's Shortform

Yeah. I proposed a while ago that all the AI content was becoming so dominant that it should be hived off to the Alignment Forum while LessWrong is for all the rest. This was rejected.

samuelshadrach on xpostah's Shortform

Yes but then it becomes a forum within a forum kinda thing. You need a critical mass of users who all agree to filter out the AI tag, and not have to preface their every post with "I dont buy your short timelines worldview, I am here to discuss something different".

Building critical mass is difficult unless the forum is conducive to it. There's is ultimately only one upvote button and one front-page so the forum will get taken over by the top few topics that its members are paying attention to.

I don't think there's anything wrong with a forum that's mostly focussed on AI xrisk and transhumanist stuff. Better to do one thing well than half ass ten things. But it also means I may need to go elsewhere.

tslarm on NormanPerlmutter's Shortform

The Krome thing is all rumor

I don’t have evidence against

If the truth is hard to determine, I think that in itself is very worrying. When you have vulnerable people imprisoned and credible fears that they are being mistreated, any response from those in power other than transparency is a bad sign. Giving them the benefit of the doubt as long as they can prevent definitive evidence from coming out is bad epistemics and IMO even worse politics (not in a party-political sense; just in a 'how to disincentivise human rights abuses' sense).

drake-morrison on A Slow Guide to Confronting Doom, v1

This is my favorite guide to confronting doom yet

vladimir_nesov on Milan W's Shortform

A power seeker is ambitious without an ambition, which is not an implication of being agentic.

andrew-sauer on What is Evil about creating House Elves?

Now you can!

nosignalnonoise on How much progress actually happens in theoretical physics?

Take a fixed number of humans with a fixed intelligence (both average and outliers) then let mathematics advance. It will advance to the point that there is a vanishingly small number of people who can even understand the state of the art

This ignores the possibility of advances in the teaching of math (or physics, or any other discipline). If improved teaching methods lower the level of intelligence required to reach a given level of knowledge, then a field can advance considerably.

Not to mention that the human population has been growing, and average intelligence has been increasing.

Finally, there's specialization. It doesn't take much intelligence to know everything that was known about genetics when Darwin was alive, but probably nobody is smart enough to know everything that was known about it in 2000. But there have still been make advances since then thanks to people specialized in subfields like DNA sequencing.

o-o on O O's Shortform

The costs of capex go way up. It costs a lot more to build datacenters. It will cost a lot more to buy GPUs. It might cost less to buy energy? Lenders will be in poorer shape. AI companies will lose funding. I think it's already quite tenuous, given how little moat AI companies have. Costs are exploding and pretraining scaling seems too diminishing to be worth it. It's also not clear how AI labs will solve the reliability issue (at least to investors).

I also expect Taiwan to start ignoring export controls if our obscenely high tariffs on them remain.