LessWrong 2.0 Reader

View: New · Old · Top

Restrict date range: Today · This week · This month · Last three months · This year · All time

← previous page (newer posts) · next page (older posts) →

[link] Managing Emotional Potential Energy
adamShimi · 2024-07-10T18:20:45.640Z · comments (4)

[LDSL#2] Latent variable models, network models, and linear diffusion of sparse lognormals
tailcalled · 2024-08-09T19:57:56.122Z · comments (0)

GPT-3.5 judges can supervise GPT-4o debaters in capability asymmetric debates
Charlie George (charlie-george) · 2024-08-27T20:44:08.683Z · comments (7)

The Garden of Eden
Alexander Turok · 2024-07-22T16:07:42.509Z · comments (2)

Whirlwind Tour of Chain of Thought Literature Relevant to Automating Alignment Research.
sevdeawesome · 2024-07-01T05:50:49.498Z · comments (0)

Would you benefit from, or object to, a page with LW users' reacts?
Raemon · 2024-08-20T16:35:47.568Z · comments (6)

Trying to be rational for the wrong reasons
Viliam · 2024-08-20T16:18:06.385Z · comments (8)

Model evals for dangerous capabilities
Zach Stein-Perlman · 2024-09-23T11:00:00.866Z · comments (4)

[link] My 5-step program for losing weight
Nikita Sokolsky (nikita-sokolsky) · 2024-06-30T01:05:40.408Z · comments (20)

[link] The Tech Industry is the Biggest Blocker to Meaningful AI Safety Regulations
garrison · 2024-08-16T19:37:28.416Z · comments (1)

Can We Predict Persuasiveness Better Than Anthropic?
Lennart Finke (l-f) · 2024-08-04T14:05:33.668Z · comments (5)

[link] Day Zero Antivirals for Future Pandemics
Niko_McCarty (niko-2) · 2024-08-26T15:18:33.858Z · comments (2)

Monthly Roundup #21: August 2024
Zvi · 2024-08-20T00:20:08.178Z · comments (6)

[LDSL#3] Information-orientation is in tension with magnitude-orientation
tailcalled · 2024-08-10T21:58:27.659Z · comments (0)

[link] Hyperpolation
Gunnar_Zarncke · 2024-09-15T21:37:00.002Z · comments (4)

[link] ML Safety Research Advice - GabeM
Gabe M (gabe-mukobi) · 2024-07-23T01:45:42.288Z · comments (2)

August 2024 Time Tracking
jefftk (jkaufman) · 2024-08-24T13:50:04.676Z · comments (0)

[link] on Science Beakers and DDT
bhauth · 2024-09-05T03:21:21.382Z · comments (12)

[link] Profit and Value
kwang · 2024-07-17T18:06:57.048Z · comments (3)

AXRP Episode 35 - Peter Hase on LLM Beliefs and Easy-to-Hard Generalization
DanielFilan · 2024-08-24T22:30:02.039Z · comments (0)

[link] An ML paper on data stealing provides a construction for "gradient hacking"
David Scott Krueger (formerly: capybaralet) (capybaralet) · 2024-07-30T21:44:37.310Z · comments (1)

Consider attending the AI Security Forum '24, a 1-day pre-DEFCON event
Charlie Rogers-Smith (charlie.rs) · 2024-07-12T23:01:46.370Z · comments (0)

Deception and Jailbreak Sequence: 1. Iterative Refinement Stages of Deception in LLMs
Winnie Yang (winnie-yang) · 2024-08-22T07:32:07.600Z · comments (0)

[link] To CoT or not to CoT? Chain-of-thought helps mainly on math and symbolic reasoning
Bogdan Ionut Cirstea (bogdan-ionut-cirstea) · 2024-09-19T16:13:55.835Z · comments (1)

Superintelligence Can't Solve the Problem of Deciding What You'll Do
Vladimir_Nesov · 2024-09-15T21:03:28.077Z · comments (10)

"The Singularity Is Nearer" by Ray Kurzweil - Review
Lavender (Kevin92) · 2024-07-08T21:32:27.307Z · comments (0)

[LDSL#5] Comparison and magnitude/diminishment
tailcalled · 2024-08-12T18:47:20.546Z · comments (0)

Failure Modes of Teaching AI Safety
Eleni Angelou (ea-1) · 2024-06-25T19:07:46.826Z · comments (0)

Instrumental vs Terminal Desiderata
Max Harms (max-harms) · 2024-06-26T20:57:17.584Z · comments (0)

How Often Does Taking Away Options Help?
niplav · 2024-09-21T21:52:40.822Z · comments (6)

Simon DeDeo on Explore vs Exploit in Science
Elizabeth (pktechgirl) · 2024-09-10T03:40:08.311Z · comments (0)

[link] Podcast: Elizabeth & Austin on "What Manifold was allowed to do"
Austin Chen (austin-chen) · 2024-06-28T22:10:41.607Z · comments (0)

[link] Compression Moves for Prediction
adamShimi · 2024-09-14T17:51:12.004Z · comments (0)

[link] The Great Organism Theory of Evolution
rogersbacon · 2024-08-10T12:26:02.434Z · comments (0)

[link] Podcast: "How the Smart Money teaches trading with Ricki Heicklen" (Patrick McKenzie interviewing)
rossry · 2024-07-11T22:49:06.633Z · comments (2)

[link] Four Randomized Control Trials In Economics
Maxwell Tabarrok (maxwell-tabarrok) · 2024-08-08T15:59:23.250Z · comments (1)

Ransomware Payments Should Require a Sin Tax
Brian Bien (brian-bien) · 2024-07-22T21:16:29.029Z · comments (10)

A necessary Membrane formalism feature
ThomasCederborg · 2024-09-10T21:33:09.508Z · comments (6)

[link] [Linkpost] 'The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery'
Bogdan Ionut Cirstea (bogdan-ionut-cirstea) · 2024-08-15T21:32:59.979Z · comments (1)

[question] Have people given up on iterated distillation and amplification?
Chris_Leong · 2024-07-19T12:23:04.625Z · answers+comments (1)

My decomposition of the alignment problem
Daniel C (harper-owen) · 2024-09-02T00:21:08.359Z · comments (22)

Fully booked - LessWrong Community weekend
jt · 2024-07-16T17:15:51.753Z · comments (2)

[link] [Linkpost] A Case for AI Consciousness
cdkg · 2024-07-06T14:52:21.704Z · comments (2)

Scaling Laws and Likely Limits to AI
Davidmanheim · 2024-08-18T17:19:46.597Z · comments (0)

Looking for Goal Representations in an RL Agent - Update Post
CatGoddess · 2024-08-28T16:42:19.367Z · comments (0)

[question] What should we do about COVID in 2024?
ChristianKl · 2024-08-04T10:57:24.140Z · answers+comments (2)

Tokenized SAEs: Infusing per-token biases.
tdooms · 2024-08-04T09:17:46.755Z · comments (20)

[link] what becoming more secure did for me
Chipmonk · 2024-08-22T17:44:48.525Z · comments (5)

Ten counter-arguments that AI is (not) an existential risk (for now)
Ariel Kwiatkowski (ariel-kwiatkowski) · 2024-08-13T22:35:15.341Z · comments (5)

A Second Wetsuit Summer
jefftk (jkaufman) · 2024-07-13T02:00:05.412Z · comments (2)

← previous page (newer posts) · next page (older posts) →

Archive

Recent comments

auspicious on ASIs will not leave just a little sunlight for Earth

Well said. Though IMO this analogy breaks down since Bill Gates (or Bernard Arnault) giving a random person $77.18 of their wealth is a very different scenario from an ASI taking a solar system from its local inhabitants.

A more direct analogy would be the chances of Bernauld Arnault stealing $77.18 from a random person. That doesn't seem very likely!

Of course many of the other arguments ring true - I'm just dubious of this particular comparison

saidachmiz on ASIs will not leave just a little sunlight for Earth

Meta: OP and some replies occasionally misspell the example billionaire’s surname as “Arnalt”; it’s actually “Arnault”, with a ‘u’.

sharmake-farah on Alexander Gietelink Oldenziel's Shortform

Re different algorithms, I actually agree with both you and Daniel Murfet in that conditional on non-reversible computers, there is at most 1-3 algorithms to achieve intelligence that can scale arbitrarily large, and I'm closer to 1 than 3 here.

But once reversible computers/superconducting wires are allowed, all bets are off on how many algorithms are allowed, because you can have far, far more computation with far, far less waste heat leaving, and a lot of the design of computers is due to heat requirements.

thomas-kwa on ASIs will not leave just a little sunlight for Earth

I agree but I'm not very optimistic about anything changing. Eliezer is often this caustic when correcting what he perceives as basic errors, and criticism in LW comments is why he stopped writing Sequences posts.

habryka4 on ASIs will not leave just a little sunlight for Earth

Bernald Arnalt has given eight-figure amounts to charity. Someone who reasoned, "Arnalt is so rich, surely he'll spare a little for the less fortunate" would in fact end up making a correct prediction about Bernald Arnalt's behavior!

Just for the sake of concreteness, since having numbers here seems useful, it seems like Bernald Anault has given around ~$100M to charity, which is around 0.1% of his net worth (spreading this contribution equally to everyone on earth would be around one cent per person, which I am just leaving it here for illustrative purposes, it's not like he could give any actually substantial amount to everyone if he really wanted).

buck on ASIs will not leave just a little sunlight for Earth

I wish the title of this made it clear that the post is arguing that ASIs won't spare humanity because of trade, and isn't saying anything about whether ASIs will want to spare humanity for some other reason. This is confusing because lots of people around here (e.g. me and many other commenters on this post) think that ASIs are likely to not kill all humans for some other reason.

(I think the arguments in this post are an okay defense of "ASI wouldn't spare humanity because of trade" and "ASIs are pretty likely to be scope-sensitively-maximizing enough that it's a big problem for us".)

tailcalled on tailcalled's Shortform

<controversial statement>

This statement had two parts. Part 1:

What if objectionists had a correct thermodynamics-style heuristic that implied superintelligence/RSI is impossible, but which could not answer the question of where exactly it failed? Then the failure of objectionists doesn't mean they were wrong.
We have to be willing to investigate the new evidence as it arrives, perform root cause analysis on why A but not B happened, and use this to update our models.
And the evidence I've gotten since then suggests something like "it is impossible to do something without assistance from a higher power"/"greater things can cause lesser things but not vice versa", as a sort of generalization of the laws of thermodynamics.
If appropriate thought had been applied by a knowledgeable person back in 2004, maybe they could have taken this model and realized that nanotech violates this ordering constraint while AlphaProteo does not. Either way, we have the relevant info now.

And part 2:

The particular way the objectionists failed was in that they didn't give a concrete prediction that matched the way stuff played out.

Part 2 is what Eliezer said was false, but it's not really central to my point (hence why I didn't write much about it in the original thread), and so it is self-sabotaging of Eliezer to zoom into this rather than the actually informative point.

quetzal_rainbow on tailcalled's Shortform

To be clear, I mean "your communication in this particular thread".

Pattern:

<controversial statement>

<this statement is false>

<controversial statement>

<this statement is false>

<mix of "this is trivially true because" and "here is my blogpost with esoteric terminology">

The following responses from EY are more in genre "I ain't reading this", because he is more using you as example for other readers than talking directly to you, with following block.

kaj_sotala on [Intuitive self-models] 1. Preliminaries

On the topic of bistable perception, this is one of my favorite examples:

(Animated version)

davekasten on davekasten's Shortform

I suspect this won't get published until November at the earliest, but I am already delightfully pleased with this bit:

Canada geese fly overhead, honking. Your inner northeast Ohioan notices that you are confused; it’s the wrong season for them to migrate this far south, and they’re flying westwards, anyways.

A quick Google discovers that some Canada geese have now established themselves non-migratorily in the Bay Area:
"The Migratory Bird Treaty Act of 1918 banned hunting or the taking of eggs without a permit. These protections, combined with an increase in desirable real estate—parks, golf course and the like—spurred a dramatic turnaround for the species. Canada geese began breeding in the Bay Area—the southern end of their range – in the late 1950s."
You nod, approvingly; this clearly is another part of the East Bay’s well-known, long-term philanthropic commitment to mitigating Acher-Risks.