LessWrong 2.0 Reader

View: New · Old · Top

Restrict date range: Today · This week · This month · Last three months · This year · All time

← previous page (newer posts) · next page (older posts) →

[Intuitive self-models] 8. Rooting Out Free Will Intuitions
Steven Byrnes (steve2152) · 2024-11-04T18:16:26.736Z · comments (16)

AI research assistants competition 2024Q3: Tie between Elicit and You.com
Elizabeth (pktechgirl) · 2024-10-12T15:10:05.417Z · comments (4)

[link] AI, centralization, and the One Ring
owencb · 2024-09-13T14:00:16.126Z · comments (11)

Retrospective: PIBBSS Fellowship 2024
DusanDNesic · 2024-12-20T15:55:24.194Z · comments (1)

A Qualitative Case for LTFF: Filling Critical Ecosystem Gaps
Linch · 2024-12-03T21:57:23.597Z · comments (2)

AI Craftsmanship
abramdemski · 2024-11-11T22:17:01.112Z · comments (7)

Book Review: On the Edge: The Fundamentals
Zvi · 2024-09-23T13:40:11.058Z · comments (3)

Brief analysis of OP Technical AI Safety Funding
22tom (thomas-barnes) · 2024-10-25T19:37:41.674Z · comments (5)

Another argument against maximizer-centric alignment paradigms
Fiora from Rosebloom · 2024-09-22T07:28:27.856Z · comments (39)

[link] Pay-on-results personal growth: first success
Chipmonk · 2024-09-14T03:39:12.975Z · comments (6)

SAEs are highly dataset dependent: a case study on the refusal direction
Connor Kissane (ckkissane) · 2024-11-07T05:22:18.807Z · comments (4)

[link] RL, but don't do anything I wouldn't do
Gunnar_Zarncke · 2024-12-07T22:54:50.714Z · comments (5)

[question] Is cybercrime really costing trillions per year?
Fabien Roger (Fabien) · 2024-09-27T08:44:07.621Z · answers+comments (28)

[Intuitive self-models] 6. Awakening / Enlightenment / PNSE
Steven Byrnes (steve2152) · 2024-10-22T13:23:08.836Z · comments (8)

Book Review: On the Edge: The Future
Zvi · 2024-09-27T14:00:05.279Z · comments (1)

[link] Dario Amodei — Machines of Loving Grace
Matrice Jacobine · 2024-10-11T21:43:31.448Z · comments (26)

[link] Slightly More Than You Wanted To Know: Pregnancy Length Effects
JustisMills · 2024-10-21T01:26:02.030Z · comments (4)

[link] Anthropic leadership conversation
Zach Stein-Perlman · 2024-12-20T22:00:45.229Z · comments (16)

[link] Electrostatic Airships?
DaemonicSigil · 2024-10-27T04:32:34.852Z · comments (13)

[link] on bacteria, on teeth
bhauth · 2024-09-30T15:56:56.830Z · comments (9)

Training AI agents to solve hard problems could lead to Scheming
Marius Hobbhahn (marius-hobbhahn) · 2024-11-19T00:10:55.522Z · comments (12)

Cognitive Work and AI Safety: A Thermodynamic Perspective
Daniel Murfet (dmurfet) · 2024-12-08T21:42:17.023Z · comments (9)

Checking in on Scott's composition image bet with imagen 3
Dave Orr (dave-orr) · 2024-12-22T19:04:17.495Z · comments (0)

[link] Zen and The Art of Semiconductor Manufacturing
Recurrented (rachel-farley) · 2024-12-09T17:19:35.236Z · comments (2)

A case for donating to AI risk reduction (including if you work in AI)
tlevin (trevor) · 2024-12-02T19:05:06.658Z · comments (2)

Why imperfect adversarial robustness doesn't doom AI control
Buck · 2024-11-18T16:05:06.763Z · comments (26)

AI #95: o1 Joins the API
Zvi · 2024-12-19T15:10:05.196Z · comments (1)

MATS Alumni Impact Analysis
utilistrutil · 2024-09-30T02:35:57.273Z · comments (7)

[link] electric turbofans
bhauth · 2024-11-02T22:50:59.807Z · comments (2)

Base LLMs refuse too
Connor Kissane (ckkissane) · 2024-09-29T16:04:21.343Z · comments (20)

Toward Safety Cases For AI Scheming
Mikita Balesni (mykyta-baliesnyi) · 2024-10-31T17:20:06.019Z · comments (1)

Pollsters Should Publish Question Translations
jefftk (jkaufman) · 2024-09-08T22:10:04.932Z · comments (3)

AI Alignment via Slow Substrates: Early Empirical Results With StarCraft II
Lester Leong (lester-leong) · 2024-10-14T04:05:05.096Z · comments (9)

Why our politicians aren't Median
Yair Halberstadt (yair-halberstadt) · 2024-11-03T14:03:33.779Z · comments (15)

Against empathy-by-default
Steven Byrnes (steve2152) · 2024-10-16T16:38:49.926Z · comments (24)

[link] Linkpost: Memorandum on Advancing the United States’ Leadership in Artificial Intelligence
Nisan · 2024-10-25T04:37:00.828Z · comments (2)

o1 Turns Pro
Zvi · 2024-12-10T17:00:08.036Z · comments (3)

Intricacies of Feature Geometry in Large Language Models
7vik (satvik-golechha) · 2024-12-07T18:10:51.375Z · comments (0)

AI #81: Alpha Proteo
Zvi · 2024-09-12T13:00:07.958Z · comments (3)

How you can help pass important AI legislation with 10 minutes of effort
ThomasW · 2024-09-14T22:10:50.386Z · comments (2)

AI #86: Just Think of the Potential
Zvi · 2024-10-17T15:10:06.552Z · comments (8)

The Geometry of Feelings and Nonsense in Large Language Models
7vik (satvik-golechha) · 2024-09-27T17:49:27.420Z · comments (10)

Mira Murati leaves OpenAI/ OpenAI to remove non-profit control
Sodium · 2024-09-25T21:15:17.315Z · comments (4)

[Intuitive self-models] 5. Dissociative Identity (Multiple Personality) Disorder
Steven Byrnes (steve2152) · 2024-10-15T13:31:46.157Z · comments (7)

[link] How much I'm paying for AI productivity software (and the future of AI use)
jacquesthibs (jacques-thibodeau) · 2024-10-11T17:11:27.025Z · comments (16)

[link] The Alignment Trap: AI Safety as Path to Power
crispweed · 2024-10-29T15:21:26.545Z · comments (17)

Seeking Collaborators
abramdemski · 2024-11-01T17:13:36.162Z · comments (15)

AI #87: Staying in Character
Zvi · 2024-10-29T07:10:08.212Z · comments (3)

[question] Could orcas be (trained to be) smarter than humans? 
Towards_Keeperhood (Simon Skade) · 2024-11-04T23:29:26.677Z · answers+comments (20)

An Illustrated Summary of "Robust Agents Learn Causal World Model"
Dalcy (Darcy) · 2024-12-14T15:02:44.828Z · comments (2)

← previous page (newer posts) · next page (older posts) →

Archive

Recent comments

nathan-helm-burger on evhub's Shortform

I agree! I contributed to and endorse this Corrigibility plan by Max Harms (MIRI researcher): Corrigibility as Singular Target [? · GW]

(See also posts by Seth Herd)

I think CAST offers much better safety under higher capabilities and more agentic workflows.

nathan-helm-burger on evhub's Shortform

In regards to:

Give more access to orgs like Redwood, Apollo, and METR (I don't know how much access you currently give, but I suspect the globally-optimal thing would be to give more)

I agree, and I also think that this would be better implemented by government AI Safety Institutions.

Specifically, I think that AISIs should build (and make mandatory the use of) special SCIF-style [AF · GW] reading rooms where external evaluators would be given early access to new models. This would mean that the evaluators would need permission from the government, rather than permission from AI companies. I think it's a mistake to rely on the AI companies voluntarily giving early access to external evaluators.

I think that Anthropic could make this a lot more likely to happen if they pushed for it, and that then it wouldn't be so hard to pull other major AI companies into the plan.

rotatingpaguro on No, the Polymarket price does not mean we can immediately conclude what the probability of a bird flu pandemic is. We also need to know the interest rate!

When you say "true probability", what do you mean?

The current hypotheses I have about what you mean are (in part non-exclusive):

You think some notion of objective, non-observer dependent probability makes sense, and that's the true probability.
You do not think "true probability" exists, you are referencing to it to say the market price is not anything like that.
You define "true probability" a probability that observers contextually agree on (like a coin flip observed by humans who don't know the thrower).

benito on RESCHEDULED Lighthaven Sequences Reading Group #16 (Saturday 12/28)

Reminder that this is happening tonight! Last one of the year :)

Also tonight's might be the last one for a while, Lighthaven is getting rented out for MATS in the New Year, so we might be needing to find a new space. I'll be adding folks who come tonight to a Signal group for more chat/updates (you can also text me to be added).

lsusr on What is your personal totalizing and self-consistent worldview/philosophy?

Yes, Bryan Caplan is not noticeably differentiated from other libertarian economists.

I'd be curious to hear if you see something deeper or more totalising in these people?

My answer might contain a frustratingly small amount of detail, because answering your question properly would require a top-level post for each person just to get the main ideas across.

Paul Graham is special because he has a proven track record of accurately calibrated confidence. He has an entire system for making progress at unknown unknowns. Much of that system is about knowing what you don't know, which results in him carefully restricting claims about his narrow domain of specialization. However, because that domain of specialization is "startups", its lightcone has already had (what I consider to be) a totalising impact.

Asimov's turned The Decline and Fall of the Roman Empire into his first popular novel. He eventually extended the whole thing into a future competition between psychohistory and a gaia planet. He didn't just create one Dath Ilan. He created two of them (albeit at much lower resolution).

As to the other authors you mention:

I haven't read enough Greg Egan or Vernor Vinge comment on them.
Heinlein absolutely has "his own totalising and self-consistent worldview/philosophy". I love his writing, but I just don't agree with him enough for him to make the list. I prefer Saturn's Children (and especially Neptune's Brood) by Charles Stross. Saturn's Children is basically Heinlein + Asimov fanfiction that takes their work in a different direction. Neptune's Brood is its sequel about interstellar cryptocoin markets.
Clarke was mostly boring to me, except for 3001: The Final Odyssey.
Neal Stephenson is definitely smart, but I never got the feeling he was trying to mind control me. Maybe that's just because he's so good at it.

vladimir_nesov on A breakdown of AI capability levels focused on AI R&D labor acceleration

The range of capabilities between what can be gained at a reasonable test-time cost and at an absurd cost (but in reasonable time) can remain small, with most improvements to the system exceeding this range, likely to move what could only be obtained at an absurd cost before into the reasonable range. This is true right now (for general intelligence), and it could well remain true until the intelligence explosion.

lsusr on What is your personal totalizing and self-consistent worldview/philosophy?

to hate something is the origin of my work

I like that quote.

lsusr on What is your personal totalizing and self-consistent worldview/philosophy?

Yes! 100%. I too have noticed that stating these outright doesn't work at all. It's also bad for developing one too.

When I'm trying to sell ideas I do so more indirectly than this. The reason I wrote this post is because I felt I did have one, and wanted to verify to myself that this was true.

radford-neal-1 on By default, capital will matter more than ever after AGI

All the infra for fiat currency exists; I don't see why the AIs would need to reinvent that

Because using an existing medium of exchange (that's not based on the value of a real commodity) involves transferring real wealth to the current currency holders. Instead, they might, for example, start up a new bitcoin blockchain, and use their new bitcoin, rather than transfer wealth to present bitcoin holders.

Maybe they'd use gold, although the current value of gold is mostly due to its conventional monetary value (rather than its practical usefulness, though that is non-zero).

sharmake-farah on By default, capital will matter more than ever after AGI

I am focused here on short-term politics in the US, which ordinarily would matter less, if it wasn't likely that world-changing AI would be built in the US, but given that it might, it becomes way more important than normal.