LessWrong 2.0 Reader

View: New · Old · Top

← previous page (newer posts) · next page (older posts) →

New Bill AB 501 to Prevent OpenAI's Non-profit Conversion
Peter Windberger (vinertising-support) · 2025-03-25T00:41:07.617Z · comments (1)

[link] Does Robust Agency Require a Self?
leebriskCyrano · 2025-03-25T00:25:58.644Z · comments (0)

Takes on Takeoff
atharva · 2025-03-25T00:20:07.915Z · comments (0)

An overview of control measures
ryan_greenblatt · 2025-03-24T23:16:49.400Z · comments (0)

Populectomy.ai
YonatanK (jonathan-kallay) · 2025-03-24T22:06:24.680Z · comments (2)

Policy for LLM Writing on LessWrong
jimrandomh · 2025-03-24T21:41:30.965Z · comments (58)

Analyzing long agent transcripts (Docent)
jsteinhardt · 2025-03-24T20:49:54.472Z · comments (2)

Convergence 2024 Impact Review
David_Kristoffersson · 2025-03-24T20:28:58.422Z · comments (0)

The Best Lecture Series on Every Subject
Rauno Arike (rauno-arike) · 2025-03-24T20:03:14.772Z · comments (1)

[link] Recent AI model progress feels mostly like bullshit
lc · 2025-03-24T19:28:43.450Z · comments (75)

Learning about AI regulation should be easier
mfg (Magnus Gjerde) · 2025-03-24T19:22:33.824Z · comments (0)

Speaker For AIs Soul
Max Abecassis (max@customplay.com) · 2025-03-24T19:20:31.509Z · comments (0)

Advanced AI Systems Will Not Follow Historical Technological Patterns and Will Not Suffer the Misattribution of Productivity Gains
Max Abecassis (max@customplay.com) · 2025-03-24T19:20:31.486Z · comments (0)

AI "Deep Research" Tools Reviewed
sarahconstantin · 2025-03-24T18:40:03.864Z · comments (5)

Notes on countermeasures for exploration hacking (aka sandbagging)
ryan_greenblatt · 2025-03-24T18:39:36.665Z · comments (4)

Subversion Strategy Eval: Can language models statelessly strategize to subvert control protocols?
Alex Mallen (alex-mallen) · 2025-03-24T17:55:59.358Z · comments (0)

Straightforward Steps to Marginally Improve Odds of Whole Brain Emulation
Dom Polsinelli (dom-polsinelli) · 2025-03-24T17:14:38.794Z · comments (20)

From Loops to Klein Bottles: Uncovering Hidden Topology in High Dimensional Data
Gunnar Carlsson (gunnar-carlsson) · 2025-03-24T17:09:32.945Z · comments (0)

[link] Will Jesus Christ return in an election year?
Eric Neyman (UnexpectedValues) · 2025-03-24T16:50:53.019Z · comments (44)

[link] Sentinel's Global Risks Weekly Roundup #12/2025: Famine in Gaza, H7N9 outbreak, US geopolitical leadership weakening.
NunoSempere (Radamantis) · 2025-03-24T16:46:51.490Z · comments (0)

AI, Greed, and the Death of Oversight: When Institutions Ignore Their Own Limits
funnyfranco · 2025-03-24T15:03:16.802Z · comments (0)

[link] Delicious Boy Slop - Boring Diet, Effortless Weightloss
sapphire (deluks917) · 2025-03-24T15:01:58.355Z · comments (8)

Hong Kong ACX Spring Meetup 2025
fbreton · 2025-03-24T14:27:11.854Z · comments (0)

More on Various AI Action Plans
Zvi · 2025-03-24T13:10:05.637Z · comments (0)

Emergent scaling effects on the functional hierarchies within LLMs
Foop · 2025-03-24T13:03:30.930Z · comments (0)

Recommender Alignment for Lock-In Risk
alamerton · 2025-03-24T12:56:46.389Z · comments (0)

Edge Cases in AI Alignment
Florian_Dietz · 2025-03-24T09:27:58.164Z · comments (3)

Towards an understanding of the Chinese AI scene
Mitchell_Porter · 2025-03-24T09:10:19.498Z · comments (0)

Selective modularity: a research agenda
cloud · 2025-03-24T04:12:44.822Z · comments (2)

Pictures for 2024
jefftk (jkaufman) · 2025-03-24T02:40:07.051Z · comments (0)

Notes on handling non-concentrated failures with AI control: high level methods and different regimes
ryan_greenblatt · 2025-03-24T01:00:38.222Z · comments (3)

We need (a lot) more rogue agent honeypots
Ozyrus · 2025-03-23T22:24:52.785Z · comments (11)

What's the word for the amount of expertise that I, an experienced therapy patient and generally educated person, have on psychology topics?
danielechlin · 2025-03-23T17:38:28.881Z · comments (0)

Probability Theory Fundamentals 102: Source of the Sample Space
Ape in the coat · 2025-03-23T17:23:57.790Z · comments (17)

How to mitigate sandbagging
Teun van der Weij (teun-van-der-weij) · 2025-03-23T17:19:07.452Z · comments (0)

Tabula Bio: towards a future free of disease (& looking for collaborators)
mpoon (michael-poon) · 2025-03-23T16:30:15.523Z · comments (14)

Solving willpower seems easier than solving aging
Yair Halberstadt (yair-halberstadt) · 2025-03-23T15:25:40.861Z · comments (28)

[question] Should I fundraise for open source search engine?
samuelshadrach (xpostah) · 2025-03-23T13:04:16.149Z · answers+comments (0)

[link] Privateers Reborn: Cyber Letters of Marque
arealsociety (shane-zabel) · 2025-03-23T03:39:25.990Z · comments (2)

Beware nerfing AI with opinionated human-centric sensors
Haotian (haotian-huang) · 2025-03-23T01:09:16.770Z · comments (0)

Reframing AI Safety as a Neverending Institutional Challenge
scasper · 2025-03-23T00:13:48.614Z · comments (12)

The Dangerous Illusion of AI Deterrence: Why MAIM Isn’t Rational
mc1soft · 2025-03-22T22:55:02.355Z · comments (0)

Dayton, Ohio, ACX Meetup
Lunawarrior · 2025-03-22T19:45:55.510Z · comments (0)

[Replication] Crosscoder-based Stage-Wise Model Diffing
annas (annasoli) · 2025-03-22T18:35:19.003Z · comments (0)

The Principle of Satisfying Foreknowledge
Randall Reams (randall-reams) · 2025-03-22T18:20:27.998Z · comments (0)

[question] Urgency in the ITN framework
Shaïman · 2025-03-22T18:16:07.900Z · answers+comments (2)

Transhumanism and AI: Toward Prosperity or Extinction?
Shaïman · 2025-03-22T18:16:07.868Z · comments (2)

Tied Crosscoders: Explaining Chat Behavior from Base Model
Santiago Aranguri (aranguri) · 2025-03-22T18:07:21.751Z · comments (0)

Dusty Hands and Geo-arbitrage
Tomás B. (Bjartur Tómas) · 2025-03-22T16:05:30.364Z · comments (3)

100+ concrete projects and open problems in evals
Marius Hobbhahn (marius-hobbhahn) · 2025-03-22T15:21:40.970Z · comments (1)

← previous page (newer posts) · next page (older posts) →

Archive

Recent comments

mako-yass on Why do many people who care about AI Safety not clearly endorse PauseAI?

For the US to undertake such a shift, it would help if you could convince them they'd do better in a secret race than an open one. There are indications that this may be possible, and there are indications that it may be impossible.

I'm listening to an Ecosystemics Futures podcast episode, which, to characterize... it's a podcast where the host has to keep asking guests whether the things they're saying are classified or not just in case she has to scrub it. At one point, Lue Elizondo does assert, in the context of talking to a couple of other people who know a lot about government secrets and in the context of talking about situations where excessive secrecy may be doing a lot of harm, quoting Chris Mellon, "We won the cold war against the soviet union not because we were better at keeping secrets, we won the cold war because we knew how to move information and secrets more efficiently across the government than the russians." I can believe the same thing could potentially be said about China too, censorship cultures don't seem to be good for ensuring availability of information, so that might be a useful claim if you ever want to convince the US to undertake this.

Right now, though, Vance has asserted straight out many times that working in the open is where the US's advantage is. That's probably not true at all, working in the open is how you give your advantage away or at least make it ephemeral, but that's the sentiment you're going to be up against over the next four years.

sodium on Apply to MATS 8.0!

Hate to be that person, but is that April 18th deadline AoE/PDT/a secret third thing?

adam-shai on Alexander Gietelink Oldenziel's Shortform

Can a Finite-State Fox Catch a Markov Mouse? for more details

adamzerner on Against podcasts

Hm. On the one hand, I agree that there are distinct things at play here and share the instinct that it'd be appropriate to have different words for these different things. But on the other hand, I'm not sure if the different words should fall under the umbrella of solitude, like "romantic solitude" and "seeing human faces solitude".

I dunno, maybe it should. After all, it seems that in different conceptualizations of solitude, it's about being isolated from something (others' minds, others' physical presence).

Ultimately, I'm trusting Newport here. I think highly of him and know that he's read a lot of relevant literature. At the same time, I still wouldn't argue too confidently that his preferred definition is the most useful one.

adamzerner on Against podcasts

That makes sense. I didn't mean to imply that such an extreme degree of isolation is a net positive. I don't think it is.

adamzerner on Against podcasts

That makes sense. Although I think the larger point I was making still stands: that in reading the book you're primarily consuming someone else's thoughts, just like you would be if the author sat there on the bench lecturing you (it'd be different if it were more of a two-way conversation; I should have clarified that in the post).

I suppose "primarily" isn't true for all readers, for all books. Perhaps some readers go slowly enough where they actually spend more of their time contemplating than they do reading, but I get the sense that that is pretty rare.

adamzerner on Against podcasts

Cool! I have a feeling you'd like a lot of Cal Newport's work like Digital Minimalism and Deep Work.

adamzerner on Against podcasts

When I'm walking around or riding the train, I want to be able to hear what's going on around me.

That makes sense about walking around, but why do you want to hear what's going on around you when you're riding the train?

adamzerner on Against podcasts

Yeah, that all makes sense. I think solitude probably exists along a spectrum, where in listening to music maybe you have 8/10 solitude instead of 10/10 but in watching a TV show you only get 2/10. The relevant question is probably "to what extent are the outputs of other minds influencing your thoughts".

Actually, now that I think about it, I wonder why we're focusing on the outputs of other minds. What about other things that influence your thoughts? Like, I don't know, bumble bees flying around you? I'm afraid of bumble bees so I know I'd have trouble focusing on my own thoughts in that scenario.

That said, I'm sure that outputs of other minds are probably a large majority of what is intrusive and prevents you from focusing on your own thoughts. But it still seems to me like the thing we actually care about is being able to focus on your own thoughts, not just reducing your exposure to the outputs of other minds.

adamzerner on Against podcasts

Hm. I was actually assuming in this post that the podcasts in question were actually "Effective Information" as opposed to "Trivia" or "Mental Masturbation". The issue is that even if they are "Effective Information", you also need to have solitude in your "diet", and the benefit of additional "Effective Information" probably isn't worth the cost of less solitude.

But I'm also realizing now that much of the time podcasts aren't actually "Effective Information" and are instead something like "Trivia" or "Mental Masturbation". And I see that as a separate but also relevant problem. And I think that carbs is probably a good analogy for that too. Or maybe something like refined sugar. It's a quick hedonic hit and probably ok to have in limited doses, but you really don't want to have too much of it in your diet.