LessWrong 2.0 Reader

View: New · Old · Top

Restrict date range: Today · This week · This month · Last three months · This year · All time

← previous page (newer posts) · next page (older posts) →

Alignment ideas
qbolec · 2025-01-18T12:43:49.384Z · comments (1)

The Clueless Sniper and the Principle of Indifference
Jim Buhler (jim-buhler) · 2025-01-27T11:52:57.978Z · comments (26)

[link] LLMs Do Not Think Step-by-step In Implicit Reasoning
Bogdan Ionut Cirstea (bogdan-ionut-cirstea) · 2024-11-28T09:16:57.463Z · comments (0)

Fundamental Uncertainty: Chapter 9 - How do we live with uncertainty?
Gordon Seidoh Worley (gworley) · 2024-11-07T18:15:45.049Z · comments (2)

Launching Applications for the Global AI Safety Fellowship 2025!
Aditya_SK (team-ai-safety) · 2024-11-30T14:02:16.537Z · comments (5)

Seasonal Patterns in BIDA's Attendance
jefftk (jkaufman) · 2025-02-02T02:40:03.768Z · comments (0)

[question] Journalism student looking for sources
pinkerton · 2025-02-04T18:58:49.740Z · answers+comments (3)

7. Iterate the Game: Racing Where?
Allison Duettmann (allison-duettmann) · 2025-01-02T19:06:22.165Z · comments (0)

[question] How counterfactual are logical counterfactuals?
Donald Hobson (donald-hobson) · 2024-12-15T21:16:40.515Z · answers+comments (10)

New Foresight Longevity Bio & Molecular Nano Grants Program
Allison Duettmann (allison-duettmann) · 2025-02-04T00:28:30.147Z · comments (0)

[link] Picking favourites is hard
dkl9 · 2024-12-04T20:46:47.470Z · comments (3)

[link] How to Do a PhD (in AI Safety)
Lewis Hammond (lewis-hammond-1) · 2025-01-05T16:57:35.409Z · comments (0)

[link] Uncontrollable: A Surprisingly Good Introduction to AI Risk
PeterMcCluskey · 2025-01-24T04:30:37.499Z · comments (0)

Contra Dances Getting Shorter and Earlier
jefftk (jkaufman) · 2025-01-23T23:30:03.595Z · comments (0)

What does success look like?
Raymond D · 2025-01-23T17:48:35.618Z · comments (0)

Rethink Wellbeing’s Year 2 Update: Foster Sustainable High Performance for Ambitious Altruists
Inga G. (inga-g) · 2024-12-08T14:32:39.902Z · comments (1)

[link] Forecast With GiveWell
ChristianWilliams · 2024-12-11T17:52:32.293Z · comments (0)

Rethinking Laplace's Rule of Succession
Cleo Nardo (strawberry calm) · 2024-11-22T18:46:25.156Z · comments (5)

My Mental Model of AI Optimist Opinions
tailcalled · 2025-01-29T18:44:36.485Z · comments (2)

The Three Warnings of the Zentradi
Trevor Hill-Hand (Jadael) · 2024-11-21T20:28:45.567Z · comments (1)

[question] Using hex to get murder advice from GPT-4o
Laurence Freeman (laurence-freeman) · 2024-11-13T18:30:23.475Z · answers+comments (5)

Favorite colors of some LLMs.
weightt an (weightt-an) · 2024-12-31T21:22:58.494Z · comments (3)

[link] Experts' AI timelines are longer than you have been told?
Vasco Grilo (vascoamaralgrilo) · 2025-01-16T18:03:18.958Z · comments (4)

[link] Proposing the Conditional AI Safety Treaty (linkpost TIME)
otto.barten (otto-barten) · 2024-11-15T13:59:01.050Z · comments (8)

Fundamental Uncertainty: Epilogue
Gordon Seidoh Worley (gworley) · 2024-11-16T00:57:48.823Z · comments (0)

[question] Is "hidden complexity of wishes problem" solved?
Roman Malov · 2025-01-05T22:59:30.911Z · answers+comments (4)

Apply to be a TA for TARA
yanni kyriacos (yanni) · 2024-12-20T02:25:03.514Z · comments (0)

[link] Bridgewater x Metaculus Forecasting Contest Goes Global — Feb 3, $25k, Opportunities
ChristianWilliams · 2025-01-07T21:40:30.899Z · comments (0)

Misfortune and Many Worlds
Jonah Wilberg (jrwilb@googlemail.com) · 2024-12-08T20:25:12.109Z · comments (4)

[link] Predation as Payment for Criticism
Benquo · 2025-01-30T01:06:27.591Z · comments (6)

Why We Wouldn't Build Aligned AI Even If We Could
Snowyiu · 2024-11-16T20:19:59.324Z · comments (7)

[question] What's the best metric for measuring quality of life?
ChristianKl · 2024-12-27T14:29:30.813Z · answers+comments (5)

[link] o1 tried to avoid being shut down
Raelifin · 2024-12-05T19:52:03.620Z · comments (5)

[link] When do experts think human-level AI will be created?
Vishakha (vishakha-agrawal) · 2024-12-30T06:20:33.158Z · comments (0)

[link] Bird's eye view: An interactive representation to see large collection of text "from above".
Alexandre Variengien (alexandre-variengien) · 2024-12-21T00:15:02.239Z · comments (4)

Low Temperature Solomonoff Induction
dil-leik-og (samuel-buteau) · 2024-12-06T18:55:08.948Z · comments (4)

[link] Predictions of Near-Term Societal Changes Due to Artificial Intelligence
Annapurna (jorge-velez) · 2024-12-29T14:53:57.176Z · comments (0)

Mini PAPR Review
jefftk (jkaufman) · 2024-12-12T19:10:01.692Z · comments (0)

[link] What are the differences between AGI, transformative AI, and superintelligence?
Vishakha (vishakha-agrawal) · 2025-01-23T10:03:31.886Z · comments (3)

Outlaw Code
scarcegreengrass · 2025-01-30T23:41:57.239Z · comments (1)

[link] LLMs for language learning
Benquo · 2025-01-15T14:08:54.620Z · comments (2)

Proactive 'If-Then' Safety Cases
Nathan Helm-Burger (nathan-helm-burger) · 2024-11-18T21:16:37.237Z · comments (0)

Expected Utility, Geometric Utility, and Other Equivalent Representations
StrivingForLegibility · 2024-11-20T23:28:21.826Z · comments (0)

[question] Has Someone Checked The Cold-Water-In-Left-Ear Thing?
Maloew (maloew-valenar) · 2024-12-28T20:15:35.951Z · answers+comments (0)

Americans are fat and sick—and it’s their fault…right?
Declan Molony (declan-molony) · 2024-11-19T06:41:36.648Z · comments (6)

The Human Alignment Problem for AIs
rife (edgar-muniz) · 2025-01-22T04:06:10.872Z · comments (5)

[link] Roots of Progress is hiring an event manager
jasoncrawford · 2024-12-03T20:46:42.929Z · comments (0)

AI for Resolving Forecasting Questions: An Early Exploration
ozziegooen · 2025-01-16T21:41:45.968Z · comments (2)

[link] Training Data Attribution: Examining Its Adoption & Use Cases
Deric Cheng (deric-cheng) · 2025-01-22T15:41:19.744Z · comments (0)

[link] Chemical Turing Machines
Yudhister Kumar (randomwalks) · 2024-12-03T05:26:25.950Z · comments (2)

← previous page (newer posts) · next page (older posts) →

Archive

Recent comments

rhollerith_dot_com on Mikhail Samin's Shortform

Out of curiosity, would you agree with this being the most plausible path, even if you disagree with the rest of my argument?

The most plausible-to-me path is one I cannot imagine right now :) but if forced to choose, I'd choose AI-powered technologies for human persuasion or maybe some future technology that is really good at determining people's true loyalties. I tend to think it is enough to take over the territory where all the leading-edge semiconductor fabs are located: taking over the whole world seems unnecessary to stop the AI project.

viliam on Journalism student looking for sources

https://intelligence.org/team/ - perhaps one of these will contact you with the right person

By the way, do not be surprised if people will hesitate to talk to you. The word "journalist" doesn't exactly bring happy memories in this community. A lot of writing about tech is just clickbait looking for a scandal (and making up something if they don't find one). A famous blogger in our community was harassed by journalists and lost his career over it (1, 2).

Here is someone who isn't shy talking about AI safety on camera: Robert Miles.

jbash on artifex0's Shortform

If you're really concerned, then just move to california! Its much easier than moving abroad.

I lived in California long enough ago to remember when getting queer-bashed was a reasonable concern for a fair number of people, even in, say, Oakland. It didn't happen daily, but it happened relatively often. If you were in the "out" LGBT community, I think you probably knew somebody who'd been bashed. Politics influence that kind of thing even if it's not legal.

... and in the legal arena, there's a whole lot of pressure building up on that state and local resistance. So far it's mostly money-based pressure, but within a few years, I could easily see a SCOTUS decision that said a state had to, say, extradite somebody accused of "abetting an abortion" in another state.

War in the continental US? No, I agree that's unlikely enough not to worry about.

Civil unrest, followed by violent crackdowns on civil unrest, followed by more violent civil unrest, followed by factional riots, on the other hand...

anthonyc on National Security Is Not International Security: A Critique of AGI Realism

I'd say I agree with just about all of that, and I'm glad to see it laid out so clearly!

I just also wouldn't be hugely surprised if it turns out something like designing and building remote-controllable self-replicating globally-deployable nanotech (as one example) is in some sense fundamentally "easy" for even an early ASI/modestly superhuman AGI. Say that's the case, and we build a few for the ASI, and then we distribute them across the world, in a matter of weeks. They do what controlled self-replicating nanobots do. Then after a few months the ASI already has an off switch or sleep mode button buried in everyone's brain. My guess is that then none of those hard steps of a war with China come into play.

To be clear, I don't think this story is likely. But in a broad sense, I am generally of the opinion that most people greatly overestimate how much new data we need to answer new questions or create (some kinds of) new things, and underestimate what can be done with clever use of existing data, even among humans, let alone as we approach the limits of cleverness.

jbash on artifex0's Shortform

I think that what you describe as being 2 to 15 percent probable sounds more extreme than what the original post described as being 5 percent probable. You can have "significant erosion" of some groups' rights without leaving the country being the only reasonable option, especially if you're not in those groups. It depends on what you're trying to achieve by leaving, I guess.

Although if I were a trans person in the US right now, especially on medication, I'd be making, if not necessarily immediately executing, some detailed escape plans that could be executed on short notice.

raemon on Thread for Sense-Making on Recent Murders and How to Sanely Respond

Presumably the "someone dies" means like, within a few years, and not because of x-risk or a major pandemic.

martin-randall on Self-Other Overlap: A Neglected Approach to AI Alignment

Did you figure out where it's stupid?

ozziegooen on ozziegooen's Shortform

I mostly want to point out that many disempowerment/dystopia failure scenarios don't require a step-change from AI, just an acceleration of current trends.

Do you think that the world is getting worse each year?

My rough take is that humans, especially rich humans, are generally more and more successful.

I'm sure there are ways for current trends to lead to catastrophe - line some trends dramatically increasing and others decreasing, but that seems like it would require a lengthy and precise argument.

purple-fire on What working on AI safety taught me about B2B SaaS sales

Sorry, I can elaborate better on the situation. The big tech companies know that they can pay way more than smaller competitors, so they do. But then that group of megacorp tech (Google, Amazon, Meta, etc.) collude with each other to prevent runaway race dynamics. This is how they're able to optimize their costs with the constraint of salaries being high enough to stifle competition. Here, I was just offering evidence for my claim that big tech is a monopsonistic cartel in the SWE labor market, it isn't really evidence one way or another for the claims I make in the original post.

viliam on Thread for Sense-Making on Recent Murders and How to Sanely Respond

Oh. I somehow missed/forgot that.

I guess it makes more sense this way. Like, the more transgenders there are in the community, the smaller the fraction of Zizians among them. With the numbers I originally assumed, Ziz's conversion ratio would be shockingly high. Now it makes more sense.

Thank you, this changes my perspective on the situation.