LessWrong 2.0 Reader

View: New · Old · Top

Restrict date range: Today · This week · This month · Last three months · This year · All time

next page (older posts) →

AI-enabled coups: a small group could use AI to seize power
Tom Davidson (tom-davidson-1) · 2025-04-16T16:51:29.561Z · comments (16)

Ctrl-Z: Controlling AI Agents via Resampling
Aryan Bhatt (abhatt349) · 2025-04-16T16:21:23.781Z · comments (0)

Training AGI in Secret would be Unsafe and Unethical
Daniel Kokotajlo (daniel-kokotajlo) · 2025-04-18T12:27:35.795Z · comments (2)

Three Months In, Evaluating Three Rationalist Cases for Trump
Arjun Panickssery (arjun-panickssery) · 2025-04-18T08:27:27.257Z · comments (13)

ALLFED emergency appeal: Help us raise $800,000 to avoid cutting half of programs
denkenberger · 2025-04-16T21:47:40.687Z · comments (8)

[link] The Russell Conjugation Illuminator
TimmyM (timmym) · 2025-04-17T19:33:06.924Z · comments (13)

Handling schemers if shutdown is not an option
Buck · 2025-04-18T14:39:18.609Z · comments (0)

What Makes an AI Startup "Net Positive" for Safety?
jacquesthibs (jacques-thibodeau) · 2025-04-18T20:33:22.682Z · comments (8)

Scaffolding Skills
Screwtape · 2025-04-18T17:39:25.634Z · comments (0)

o3 Will Use Its Tools For You
Zvi · 2025-04-18T21:20:02.566Z · comments (2)

[link] Understanding and overcoming AGI apathy
Dhruv Sumathi (dhruv-sumathi) · 2025-04-17T01:04:53.853Z · comments (1)

AI #112: Release the Everything
Zvi · 2025-04-17T15:10:02.029Z · comments (6)

GPT-4.1 Is a Mini Upgrade
Zvi · 2025-04-16T19:00:03.181Z · comments (6)

Prodromes and Biomarkers in Chronic Disease
sarahconstantin · 2025-04-16T21:30:02.978Z · comments (2)

Understanding Trust: Overview Presentations
abramdemski · 2025-04-16T18:08:31.064Z · comments (0)

[link] Inside OpenAI's Controversial Plan to Abandon its Nonprofit Roots
garrison · 2025-04-18T18:46:57.310Z · comments (0)

GPT-4.5 is Cognitive Empathy, Sonnet 3.5 is Affective Empathy
Jack (jack-3) · 2025-04-16T19:12:38.789Z · comments (2)

[link] Top OpenAI Catastrophic Risk Official Steps Down Abruptly
garrison · 2025-04-16T16:04:28.115Z · comments (0)

[link] METR’s preliminary evaluation of o3 and o4-mini
Christopher King (christopher-king) · 2025-04-16T20:23:00.285Z · comments (2)

[question] Comprehensive up-to-date resources on the Chinese Communist Party's AI strategy, etc?
Mateusz Bagiński (mateusz-baginski) · 2025-04-18T04:58:32.037Z · answers+comments (2)

Understanding Trust - Overview Presentations
abramdemski · 2025-04-16T18:05:39.792Z · comments (0)

[link] Telescoping
za3k (lispalien) · 2025-04-16T17:05:52.392Z · comments (1)

[link] Announcing Progress Conference 2025
jasoncrawford · 2025-04-17T17:12:44.191Z · comments (0)

Kamelo: A Rule-Based Constructed Language for Universal, Logical Communication
Saif Khan (saif-khan) · 2025-04-16T18:44:00.139Z · comments (7)

British and American Connotations
jefftk (jkaufman) · 2025-04-18T13:00:09.440Z · comments (2)

[link] Can LLM-based models do model-based planning?
jylin04 · 2025-04-16T12:38:00.793Z · comments (1)

Host Keys and SSHing to EC2
jefftk (jkaufman) · 2025-04-17T15:10:29.139Z · comments (6)

[Rockville] Rationalist Shabbat
maia · 2025-04-18T15:38:30.650Z · comments (0)

[link] Conditional Forecasting as Model Parameterization
Molly (hickman-santini) · 2025-04-18T02:35:42.110Z · comments (0)

[link] Human-level is not the limit
Vishakha (vishakha-agrawal) · 2025-04-16T08:33:15.498Z · comments (2)

0 Motivation Mapping through Information Theory
P. João (gabriel-brito) · 2025-04-18T00:53:34.360Z · comments (0)

Mass Exposure Paradox
max-sixty · 2025-04-16T20:18:00.492Z · comments (0)

How Logic "Really" Works: An Engineering Perspective
Daniil Strizhov (mila-dolontaeva) · 2025-04-16T05:34:09.443Z · comments (0)

Gamify life from BayesianMind
P. João (gabriel-brito) · 2025-04-16T16:17:49.284Z · comments (2)

Karma Tests in Logical Counterfactual Simulations motivates strong agents to protect weak agents
Knight Lee (Max Lee) · 2025-04-18T11:11:23.239Z · comments (0)

[link] AI is advancing fast
Vishakha (vishakha-agrawal) · 2025-04-16T08:17:06.055Z · comments (0)

One Night in Delphi
Eggs (donald-sampson) · 2025-04-18T02:17:04.957Z · comments (2)

Finance and AI Timelines
DAL · 2025-04-16T16:55:06.957Z · comments (0)

On AI personhood
p.b. · 2025-04-17T12:31:52.288Z · comments (6)

[link] AI may attain human level soon
Vishakha (vishakha-agrawal) · 2025-04-16T08:28:55.592Z · comments (0)

8 PRIME SKILLS - A simplified construction from MaxEnt Informational Efficiency in 4 questions
P. João (gabriel-brito) · 2025-04-17T11:04:07.424Z · comments (4)

[link] The road from human-level to superintelligent AI may be short
Vishakha (vishakha-agrawal) · 2025-04-16T08:35:54.376Z · comments (0)

The Case for White Box Control
J Rosser (j-rosser-uk) · 2025-04-18T16:10:57.823Z · comments (0)

Consequentialists should have a comprehensive set of deontological beliefs they adhere to
Jay95 · 2025-04-18T20:50:27.064Z · comments (2)

[link] How worker co-ops can help restore social trust
B Jacobs (Bob Jacobs) · 2025-04-17T14:14:47.165Z · comments (5)

[link] Doing Prioritization Better
arvomm (arvo-munoz) · 2025-04-16T18:46:41.797Z · comments (1)

8 PRIME SKILLS – A construction from MaxEnt Informational Efficiency in 4 questions
P. João (gabriel-brito) · 2025-04-16T16:53:51.351Z · comments (0)

Towards Understanding the Representation of Belief State Geometry in Transformers
Karthik Viswanathan (vkarthik095) · 2025-04-18T12:39:01.251Z · comments (0)

Opportunity to to learn more about AI Innovation & Security Policy
PolicyTakes · 2025-04-16T01:35:27.203Z · comments (0)

Evaluating Collaborative AI Performance Subject to Sabotage
Matthew Khoriaty (matthew-khoriaty) · 2025-04-18T19:33:41.547Z · comments (0)

next page (older posts) →

Archive

Recent comments

romeostevensit on Three Months In, Evaluating Three Rationalist Cases for Trump

I think the major impacts that matter are on war, pandemic risk, and x-risk. I rarely see anyone try to figure those out, perhaps the sign is too uncertain due to complexity.

jkaufman on Risers for Foot Percussion

I did see your comment on FB! I'm still thinking about what I want to try next. I'm worried that silicone with your method would tear, though.

hpcfung on Rationalist Should Win. Not Dying with Dignity and Funding WBE.

I'm also interested, have you made any progress since your comment?

lc on Three Months In, Evaluating Three Rationalist Cases for Trump

The doubling down is delusional but I think you're simplifying the failure of projection a bit. The inability of markets and forecasters to predict Trump's second term is quite interesting. A lot of different models of politics failed.

gjm on o3 Will Use Its Tools For You

Pedantic note: there are many instances of "syncopathy" that I am fairly sure should be "sycophancy".

(It's an understandable mistake -- "syncopathy" is composed of familiar components, which could plausibly be put together to mean something like "the disease of agreeing too much" which is, at least in the context of AI, not far off what sycophancy in fact means. Whereas if you can parse "sycophancy" at all you might work out that it means "fig-showing" which obviously has nothing to do with anything. So far as I can tell, no one actually knows how "fig-showing" came to be the term for servile flattery.)

michaeldickens on Planning for Extreme AI Risks

I think the right way to self-destruct isn't to shut down entirely. It's to spend all your remaining assets on safety (whether that be lobbying for regulations, or research, or whatever). This would greatly increase the total amount of money spent on safety efforts so it might help quite a lot.

I do believe shutting down does have a decent chance, although not a comfortingly large one, of scaring government and/or other AI companies into taking the risks seriously.

anthonyc on What Makes an AI Startup "Net Positive" for Safety?

I won't comment on your specific startup, but I wonder in general how an AI Safety startup becomes a successful business. What's the business model? Who is the target customer? Why do they buy? Unless the goal is to get acquired by one of the big labs, in which case, sure, but again, why or when do they buy, and at what price? Especially since they already don't seem to be putting much effort into solving the problem themselves despite having better tools and more money to do so than any new entrant startup.

anthonyc on Three Months In, Evaluating Three Rationalist Cases for Trump

I really, really hope at some point the Democrats will acknowledge the reason they lost is that they failed to persuade the median voter of their ideas, and/or adopt ideas that appeal to said voters. At least among those I interact with, there seems to be a denial of the idea that this is how you win elections, which is a prerequisite for governing.

saidachmiz on A Dissent on Honesty

The hard cases are much more interesting. What about lying to my landlord about renting a room on airbnb? What about saying your class will make people millionaires for the low low price of $1,000 (hey, it could happen)? What about hiding the rats from the health inspector?

None of these seem like hard cases to me. Lying is wrong (and pretty obviously so) in all three of these cases.

anthonyc on Why Does It Feel Like Something? An Evolutionary Path to Subjectivity

That seems very possible to me, and if and when we can show whether something like that is the case, I do think it would represent significant progress. If nothing else, it would help tell us what the thing we need to be examining actually is, in a way we don't currently have an easy way to specify.