LessWrong 2.0 Reader

View: New · Old · Top

Restrict date range: Today · This week · This month · Last three months · This year · All time

← previous page (newer posts) · next page (older posts) →

[question] How might language influence how an AI "thinks"?
bodry (plosique) · 2024-10-30T17:41:04.460Z · answers+comments (0)

[link] What is Confidence—in Game Theory and Life?
James Stephen Brown (james-brown) · 2024-12-10T23:06:24.072Z · comments (0)

Personal Philosophy
Xor · 2024-10-13T03:01:59.324Z · comments (0)

[question] How do we quantify non-philanthropic contributions from Buffet and Soros?
Philosophistry (philip-dhingra) · 2024-12-20T22:50:32.260Z · answers+comments (0)

[question] How do you decide to phrase predictions you ask of others? (and how do you make your own?)
CstineSublime · 2025-01-10T02:44:26.737Z · answers+comments (0)

[link] Both-Sidesism—When Fair & Balanced Goes Wrong
James Stephen Brown (james-brown) · 2024-11-02T03:04:03.820Z · comments (15)

5. Uphold Voluntarism: Digital Defense
Allison Duettmann (allison-duettmann) · 2025-01-02T19:05:33.963Z · comments (0)

The boat
RomanS · 2024-11-22T12:56:45.050Z · comments (0)

On the Practical Applications of Interpretability
Nick Jiang (nick-jiang) · 2024-10-15T17:18:25.280Z · comments (1)

[link] The Polite Coup
Charlie Sanders (charlie-sanders) · 2024-12-04T14:03:36.663Z · comments (0)

[link] AI Safety at the Frontier: Paper Highlights, December '24
gasteigerjo · 2025-01-11T22:54:02.625Z · comments (0)

3. Improve Cooperation: Better Technologies
Allison Duettmann (allison-duettmann) · 2025-01-02T19:03:16.588Z · comments (2)

AI Training Opt-Outs Reinforce Global Power Asymmetries
kushagra (kushagra-tiwari) · 2024-11-30T22:08:06.426Z · comments (0)

Your memory eventually drives confidence in each hypothesis to 1 or 0
Crazy philosopher (commissar Yarrick) · 2024-10-28T09:00:27.084Z · comments (6)

[link] Higher Order Signs, Hallucination and Schizophrenia
Nicolas Villarreal (nicolas-villarreal) · 2024-11-02T16:33:10.574Z · comments (0)

[link] Social Science in its epistemological context
Arturo Macias (arturo-macias) · 2024-12-05T16:12:29.034Z · comments (0)

Don't want Goodhart? — Specify the variables more
YanLyutnev (YanLutnev) · 2024-11-21T22:43:48.362Z · comments (2)

[question] Are Sparse Autoencoders a good idea for AI control?
Gerard Boxo (gerard-boxo) · 2024-12-26T17:34:55.617Z · answers+comments (2)

San Francisco ACX Meetup “First Saturday”
Nate Sternberg (nate-sternberg) · 2024-10-28T05:05:36.757Z · comments (0)

Should you increase AI alignment funding, or increase AI regulation?
Knight Lee (Max Lee) · 2024-11-26T09:17:01.809Z · comments (1)

How to Teach Your Brain to Hate Procrastination
10xyz (10xyz-coder) · 2024-10-21T20:12:40.809Z · comments (0)

[link] Solving Newcomb's Paradox In Real Life
Alice Wanderland (alice-wanderland) · 2024-12-11T19:48:44.486Z · comments (0)

[link] Technical Risks of (Lethal) Autonomous Weapons Systems
Heramb · 2024-10-23T20:41:13.238Z · comments (0)

The Technist Reformation: A Discussion with o1 About The Coming Economic Event Horizon
Yuli_Ban · 2024-12-11T02:34:22.329Z · comments (1)

[question] Have we seen any "ReLU instead of sigmoid-type improvements" recently
KvmanThinking (avery-liu) · 2024-11-23T03:51:52.984Z · answers+comments (4)

Not Just For Therapy Chatbots: The Case For Compassion In AI Moral Alignment Research
kenneth_diao · 2024-09-30T18:37:20.409Z · comments (0)

Which AI Safety Benchmark Do We Need Most in 2025?
Loïc Cabannes (loic-cabannes) · 2024-11-17T23:50:56.337Z · comments (2)

Truth Terminal: A reconstruction of events
crvr.fr (crdevio) · 2024-11-17T23:51:21.279Z · comments (1)

Morality as Cooperation Part II: Theory and Experiment
DeLesley Hutchins (delesley-hutchins) · 2024-12-05T09:04:12.167Z · comments (0)

I Recommend More Training Rationales
Gianluca Calcagni (gianluca-calcagni) · 2024-12-31T14:06:44.007Z · comments (0)

Gothenburg LW/ACX meetup
Stefan (stefan-1) · 2024-10-29T20:40:22.754Z · comments (0)

[link] The Golden Opportunity for American AI
Annapurna (jorge-velez) · 2025-01-04T10:26:05.430Z · comments (8)

Singular Learning Theory for Dummies
Rahul Chand (rahul-chand) · 2024-10-15T21:13:55.842Z · comments (0)

[question] Poll: what’s your impression of altruism?
David Gross (David_Gross) · 2024-11-09T20:28:15.418Z · answers+comments (4)

aspirational leadership
dhruvmethi · 2024-11-20T16:07:43.507Z · comments (0)

Advice on Communicating Concisely
EvolutionByDesign (bioluminescent-darkness) · 2024-10-20T16:45:41.053Z · comments (9)

Ambiguities or the issues we face with AI in medicine
Thehumanproject.ai · 2024-10-20T16:45:31.341Z · comments (0)

Introducing Avatarism: A Rational Framework for Building actual Heaven
ratiba ro (ratiba-ro) · 2024-12-15T17:17:45.440Z · comments (2)

The 'Road Not Taken' in the Multiverse
Jonah Wilberg (jrwilb@googlemail.com) · 2024-11-29T19:01:51.775Z · comments (0)

How Your Physiology Affects the Mind's Projection Fallacy
YanLyutnev (YanLutnev) · 2024-12-14T21:10:23.240Z · comments (0)

Reminder: AI Safety is Also a Behavioral Economics Problem
zoop · 2024-12-20T01:40:53.847Z · comments (0)

Towards a Unified Interpretability of Artificial and Biological Neural Networks
jan_bauer · 2024-12-21T23:10:45.842Z · comments (0)

[link] Expevolu, Part II: Buying land to create countries
Fernando · 2025-01-09T21:11:11.780Z · comments (0)

[link] The Economics & Practicality of Starting Mars Colonization
Zero Contradictions · 2024-12-26T10:56:26.019Z · comments (1)

[question] Most capable publicly available agents?
Gabe · 2024-09-30T00:04:24.480Z · answers+comments (0)

Can AI Quantity beat AI Quality?
Gianluca Calcagni (gianluca-calcagni) · 2024-10-02T15:21:45.711Z · comments (0)

A Meritocracy of Taste
Daniele De Nuntiis (daniele-de-nuntiis) · 2024-11-28T09:10:10.598Z · comments (11)

Launching Third Opinion: Anonymous Expert Consultation for AI Professionals
karl (oaisis) · 2024-12-19T19:06:15.355Z · comments (0)

A Systematic Approach to AI Risk Analysis Through Cognitive Capabilities
Tom DAVID (tom-david) · 2025-01-09T00:18:04.608Z · comments (0)

[link] Some Preliminary Notes on the Promise of a Wisdom Explosion
Chris_Leong · 2024-10-31T09:21:11.623Z · comments (0)

← previous page (newer posts) · next page (older posts) →

Archive

Recent comments

ektimo on ektimo's Shortform

Prompt: write a micro play that is both disturbing and comforting
--

Title: "The Silly Child"

Scene: A mother is putting to bed her six-year-old child

CHILD: Mommy, how many Boltzmann brains are there?

MOTHER: As many as are possible.

CHILD (smiling): Can we make another one?

MOTHER (smiling): Sure. And while we're at it, let's delete the number 374? I've never liked that one.

CHILD (excited): Oh! And let's make a new Fischer-Griess group element too! Can we do that Mommy?

MOTHER (bops nose) That's enough stalling. You need to get your sleep. Sweet dreams, little one. (kisses forehead)

End

sharmake-farah on On Dwarkesh Patel’s 4th Podcast With Tyler Cowen

Einstein was not an experimentalist, yet was perfectly capable of physics; his successors have largely not touched his unfinished work, and not for lack of data.

While it is interesting at first glance, some caveats are called for here.

One, Einstein's achievements were sort of overrated, see these comments for details:

https://www.lesswrong.com/posts/GSBCw94DsxLgDat6r/interpreting-yudkowsky-on-deep-vs-shallow-knowledge#6HPjxMvTnP9JeibXZ [LW(p) · GW(p)]

https://www.lesswrong.com/posts/GSBCw94DsxLgDat6r/interpreting-yudkowsky-on-deep-vs-shallow-knowledge#icmCewLmXnxgtmANP [LW(p) · GW(p)]

Two, the EPR paradox is resolvable in modern physics by allowing non-locality in entanglement, but having a no-communication theorem that prevents exploiting it to break special relativity.

jbash on In Defense of a Butlerian Jihad

Societies aren't the issue; they're mindless aggregates that don't experience anything and don't actually even have desires in anything like the way a human, or or even an animal or an AI, has desires. Individuals are the issue. Do individuals get to choose which of these societies they live in?

jbash on In Defense of a Butlerian Jihad

I’m pretty sure he doesn’t buy the Christian Paradise of "having no job, only leisure is good actually" either.

This (a) doesn't have anything in particular to do with Christianity, (b) has been the most widely held view among people in general since forever, and (c) seems obviously correct. If you want to rely on the contrary supposition, I'm afraid you're going to have to argue for it.

You can still have hobbies.

I also kinda notice that there are no meaningful place left for humans in that society.

There's that word "meaningful" that I keep hearing everywhere. I claim it's a meaningless word (or at least that it's being used here in a meaningless sense). Please define it in a succinct, relevant, and unambiguous way.

If you believe that the democratic consensus made mostly of normal people will allow you that [Glorious Transhumanist Future], I have a bridge to sell to you.

The democratic consensus also won't allow a Butlerian Jihad, and I don't think you're claiming that it will.

So apparently nobody arguing for either can claim to represent either the democratic consensus or the only alternative to it. What's your point?

If you don’t have a plan then don’t build AGI, pretty please ?

I agree there.

This is obviously wrong. I won’t argue for why it is wrong — too long post, and so on.

I'm actually not sure what you're arguing for or against in this whole section.

Obviously you're not going to "solve human values". Equally obviously, any future, AI or non-AI, is going to be better for some people's values than others. Some values have always won, and some values have always lost, and that will not change. What that has to do with justice destroying the world, I have absolutely no clue.

I think you're trying to take the view that any major change in the "human condition", or in what's "human", is equivalent to the destruction of the world, no matter what benefits it may have. This is obviously wrong. I won't argue for why it's wrong, but now that I've said those magic words, you're bound to accept all my conclusions.

I still can’t believe some of you would sided with the super-happies !

So you're siding with the guy who killed 15 billion non-consenting people because he personally couldn't handle the idea of giving up suffering?

Wrong answers will disempower humans forever at best, reducing them to passive leafs in the wind.

Just like they are now and always have been. The Heat Death of the Universe (TM) is gonna eat ya, regardless of what you do.

Slightly wrong answers won’t go as far as that, but will result in the permanent loss of vast chunks of Human Values — the parts we will decide to discard, consciously or not.

Human Values have been changing, for individuals and in the "average", for as long as there've been humans, including being discarded consciously or unconsciously. Mostly in a pretty aimless, drifting way. This is not new and neither AI nor anything else will fundamentally change it. At least not while the "humans" involved are recognizably like the humans we have now... and changing away from that would be a pretty big break in itself, no?

You build your ASI. You have that big Diverse Plural Assembly that is apparently plan A

I haven't actually heard many people suggesting that.

d0themath on Nathan Helm-Burger's Shortform

I think its this

viliam on Could my work, "Beyond HaHa" benefit the LessWrong community?

use of humor as a pedagogical tool in Cardiopulmonary Resuscitation (CPR) courses

I first understood that as resuscitating people by telling them jokes. Like, when you laugh hard enough, your heart starts beating again. :D

*

Yeah, I think it could be interesting. To me, this feels unsurprising -- memory is related to emotions, so you should use emotions while teaching. But negative emotions, such as fear, help people remember, but also discourage them from researching the topic on their own. They help memory, but hurt creativity. Positive emotions should be useful for both remembering and experimenting.

Now the question is which positive emotions. Also, how. I guess people will remember funny things, but can you produce jokes about every important thing you want your students to remember? (If you can, you should totally do an educational YouTube comedy channel.)

tripp-lyons on No, the Polymarket price does not mean we can immediately conclude what the probability of a bird flu pandemic is. We also need to know the interest rate!

I've thought about this before, and the solution I came up with is to denominate the bets in short term treasury bonds instead of dollars.

benito on On Eating the Sun

I am not sure what point you are making with "respect their preferences", I am not proposing one country go to war with other countries to take the sun. For instance, one way it might go down is someone will just offer to buy it from Earth, and the price will be many orders of magnitude more resources than Earth has, so Earth will accept, and replace it with an artificial source of light & heat.

I may be wrong about the estimates of the value of the energy, neither of us have specified how the rest of the stars in the universe will get distributed. For concreteness, I am here imagining something like: the universe is not a whole singleton but made of many separate enclaves that have their own governance and engage in trade with one another, and that Earth is a special one that keeps a lot of its lineage with present-day Earth, and is generally outcompeted by all the others ones that are smarter/faster and primarily run by computational-minds rather than biological ones.

tsvibt on Views on when AGI comes and on strategy to reduce existential risk

But like, I wouldn't be surprised if, say, someone trained something that performed comparably to LLMs on a wide variety of benchmarks, using much less "data"... and then when you look into it, you find that what they were doing was taking activations of the LLMs and training the smaller guy on the activations. And I'll be like, come on, that's not the point; you could just as well have "trained" the smaller guy by copy-pasting the weights from the LLM and claimed "trained with 0 data!!". And you'll be like "but we met your criterion!" and I'll just be like "well whatever, it's obviously not relevant to the point I was making, and if you can't see that then why are we even having this conversation". (Or maybe you wouldn't do that, IDK, but this sort of thing--followed by being accused of "moving the goal posts"--is why this question feels frustrating to answer.)

habryka4 on Nathan Helm-Burger's Shortform

Link?