LessWrong 2.0 Reader

View: New · Old · Top

Restrict date range: Today · This week · This month · Last three months · This year · All time

← previous page (newer posts) · next page (older posts) →

Personal AI Planning
jefftk (jkaufman) · 2024-11-10T14:00:06.837Z · comments (10)

(Salt) Water Gargling as an Antiviral
Elizabeth (pktechgirl) · 2024-11-22T18:00:02.765Z · comments (0)

Why Large Bureaucratic Organizations?
johnswentworth · 2024-08-27T18:30:07.422Z · comments (52)

Indecision and internalized authority figures
Kaj_Sotala · 2024-07-06T10:10:02.528Z · comments (1)

AI #42: The Wrong Answer
Zvi · 2023-12-14T14:50:05.086Z · comments (6)

Timaeus is hiring!
Jesse Hoogland (jhoogland) · 2024-07-12T23:42:28.651Z · comments (6)

Towards Guaranteed Safe AI: A Framework for Ensuring Robust and Reliable AI Systems
Joar Skalse (Logical_Lunatic) · 2024-05-17T19:13:31.380Z · comments (10)

[Intuitive self-models] 4. Trance
Steven Byrnes (steve2152) · 2024-10-08T13:30:41.446Z · comments (7)

[link] The economics of space tethers
harsimony · 2024-08-22T16:15:22.699Z · comments (22)

[link] Open Source Automated Interpretability for Sparse Autoencoder Features
kh4dien · 2024-07-30T21:11:36.866Z · comments (1)

o1-preview is pretty good at doing ML on an unknown dataset
Håvard Tveit Ihle (havard-tveit-ihle) · 2024-09-20T08:39:49.927Z · comments (1)

[question] Will quantum randomness affect the 2028 election?
Thomas Kwa (thomas-kwa) · 2024-01-24T22:54:30.800Z · answers+comments (52)

An AI Race With China Can Be Better Than Not Racing
niplav · 2024-07-02T17:57:36.976Z · comments (32)

OpenAI: Altman Returns
Zvi · 2023-11-30T14:10:05.469Z · comments (12)

Preventing model exfiltration with upload limits
ryan_greenblatt · 2024-02-06T16:29:33.999Z · comments (21)

How to be an amateur polyglot
arisAlexis (arisalexis) · 2024-05-08T15:08:11.404Z · comments (16)

Implementing activation steering
Annah (annah) · 2024-02-05T17:51:55.851Z · comments (7)

Friendship is transactional, unconditional friendship is insurance
Ruby · 2024-07-17T22:52:41.967Z · comments (24)

[link] On Shifgrethor
JustisMills · 2024-10-27T15:30:13.688Z · comments (18)

minutes from a human-alignment meeting
bhauth · 2024-05-24T05:01:53.904Z · comments (4)

[link] Most experts believe COVID-19 was probably not a lab leak
DanielFilan · 2024-02-02T19:28:00.319Z · comments (89)

Out-of-distribution Bioattacks
jefftk (jkaufman) · 2023-12-02T12:20:05.626Z · comments (15)

[link] Funding case: AI Safety Camp
Remmelt (remmelt-ellen) · 2023-12-12T09:08:18.911Z · comments (5)

OpenAI's Preparedness Framework: Praise & Recommendations
Akash (akash-wasil) · 2024-01-02T16:20:04.249Z · comments (1)

[link] Static Analysis As A Lifestyle
adamShimi · 2024-07-03T18:29:37.384Z · comments (11)

Do Not Mess With Scarlett Johansson
Zvi · 2024-05-22T15:10:03.215Z · comments (7)

[link] How LDT helps reduce the AI arms race
Tamsin Leake (carado-1) · 2023-12-10T16:21:44.409Z · comments (13)

AI #69: Nice
Zvi · 2024-06-20T12:40:02.566Z · comments (9)

[link] An Opinionated Evals Reading List
Marius Hobbhahn (marius-hobbhahn) · 2024-10-15T14:38:58.778Z · comments (0)

METR is hiring!
Beth Barnes (beth-barnes) · 2023-12-26T21:00:50.625Z · comments (1)

Occupational Licensing Roundup #1
Zvi · 2024-10-30T11:00:04.516Z · comments (11)

Interpreting and Steering Features in Images
Gytis Daujotas (gytis-daujotas) · 2024-06-20T18:33:59.512Z · comments (6)

Fear of centralized power vs. fear of misaligned AGI: Vitalik Buterin on 80,000 Hours
Seth Herd · 2024-08-05T15:38:09.682Z · comments (22)

SAEs (usually) Transfer Between Base and Chat Models
Connor Kissane (ckkissane) · 2024-07-18T10:29:46.138Z · comments (0)

How a chip is designed
YM (Yannick_Muehlhaeuser_duplicate0.05902100825326273) · 2024-06-28T08:04:27.392Z · comments (4)

2. Corrigibility Intuition
Max Harms (max-harms) · 2024-06-08T15:52:29.971Z · comments (10)

[link] The Perceptron Controversy
Yuxi_Liu · 2024-01-10T23:07:23.341Z · comments (18)

Schelling game evaluations for AI control
Olli Järviniemi (jarviniemi) · 2024-10-08T12:01:24.389Z · comments (5)

[question] What's with all the bans recently?
[deleted] · 2024-04-04T06:16:49.062Z · answers+comments (83)

The Third Fundamental Question
Screwtape · 2024-11-15T04:01:33.770Z · comments (7)

Advice to junior AI governance researchers
Akash (akash-wasil) · 2024-07-08T19:19:07.316Z · comments (1)

AI Craftsmanship
abramdemski · 2024-11-11T22:17:01.112Z · comments (7)

On the Debate Between Jezos and Leahy
Zvi · 2024-02-06T14:40:05.487Z · comments (6)

Announcing New Beginner-friendly Book on AI Safety and Risk
Darren McKee · 2023-11-25T15:57:08.078Z · comments (2)

[Interim research report] Activation plateaus & sensitive directions in GPT2
StefanHex (Stefan42) · 2024-07-05T17:05:25.631Z · comments (2)

[link] DeepMind: Frontier Safety Framework
Zach Stein-Perlman · 2024-05-17T17:30:02.504Z · comments (0)

Book Review: On the Edge: The Fundamentals
Zvi · 2024-09-23T13:40:11.058Z · comments (3)

[Intuitive self-models] 8. Rooting Out Free Will Intuitions
Steven Byrnes (steve2152) · 2024-11-04T18:16:26.736Z · comments (16)

Superposition is not "just" neuron polysemanticity
LawrenceC (LawChan) · 2024-04-26T23:22:06.066Z · comments (4)

Please do not use AI to write for you
Richard_Kennaway · 2024-08-21T09:53:34.425Z · comments (34)

← previous page (newer posts) · next page (older posts) →

Archive

Recent comments

yanling-guo on How Universal Basic Income Could Help Us Build a Brighter Future

I’m personally responsible for every point in my post, not ChatGPT. While I can conceive some don’t like ChatGPT, I don’t understand what’s the purpose of human written comments if you use exactly the same phrases as Kennaway: “something ChatGPT might right”, etc.

I have genuine belief in what I published. This post is a call to the business to actively co-shape UBI instead of passively rejecting it. Whoever pays, has accordingly more say, like if Microsoft co-finances UBI, it can ask UBI recipients to learn its online courses and make certificates, so when the economy recovers and Microsoft again wants to hire more people, it can more easily find qualified staff. I don’t know what other companies may want, but in general if you don’t participate in the financing, you also have no say.

t3t on [deleted]

Should be fixed now.

t3t on [deleted]

Good catch, looks like that's from this [? · GW] revision, which looks like it was copied over from Arbital - some LaTeX didn't make it through. I'll see if it's trivial to fix.

t3t on [deleted]

The page isn't dead, Arbital pages just don't load sometimes (or take 15+ seconds).

yanling-guo on How Universal Basic Income Could Help Us Build a Brighter Future

Yes, I used ChatGPT to polish the English, it did a great job 👍 while I am of course myself responsible for every point in this post.

To your comments:

This post points out that it’s better for the business to actively co-shape UBI instead of passively rejecting it. For humanitarian reasons, it’s good to ensure the existential minimum for everyone, even those too old or too sick to learn or work. If this minimum is already covered by other governmental programs or philanthropic organizations, there’s no need to include it in UBI. If business co-shape UBI, they can ask it to be conditioned on completing training programs, like the certificates offered by Microsoft. It’s an illusion that the market can automatically solve the problem. The market mechanisms only says that in an economic downturn, staff should be laid off. When the economy recovers, they can be re-hired. But the market mechanism doesn’t take care to maintain a disciplined reserve workforce in the meantime. When business starts to re-hire, they may find it difficult to find qualified staff, because part of the workforce drifted off in the meantime, some got mental problems, or alcohol/drug problems, some were radicalized, beside the agony suffered by the laid-off staff and destabilization faced by the society, it also becomes more expensive for the business to find qualified staff when they need them. Of course you can say that it’s the government’s job to take care for the unemployed, but of course government has to raise taxes for its social programs. If business actively co-shapes, it can make such programs more effective and efficient, have the reserve workforce trained in the way they can better find a job or start self-employment/start-up that can better meet the needs of the economy, and that at a lower cost.

avturchin on Are You More Real If You're Really Forgetful?

I'm inclined to bite this bullet too, though it feels somewhat strange. Weird implication: you can increase the amount of reality-fluid assigned to you by giving yourself amnesia.

I explored a similar line of reasoning here: Magic by forgetting [LW · GW]

I think that yes, the sameness of humans as agents is generated by the process of self-identification in which a human being is identifies herself through a short string of information "Name, age, sex, profession + few more kilobytes". Evidence for this is the success of improv theatre, where people quickly adopt completely new roles through one-line instructions.

If yes, then we should expect ourselves to be agents that exist in a universe that abstracts well, because "high-level agents" embedded in such universes are "supported" by a larger equivalence class of universes (since they draw on reality fluid from an entire pool of "low-level" agents).

I think that your conclusion is valid.

keltan on Which things were you surprised to learn are not metaphors?

If I’ll probably see them again, I don’t miss people. I thought people saying they miss you were just being overly polite.

interstice on lemonhope's Shortform

Yeah I definitely agree you should start learning as young as possible. I think I would usually advise someone starting out to learn general math/CS stuff and do AI safety on the side, since there's way more high-quality knowledge in those fields. Although "just dive in to AI" seems to have worked out well for some people like Chris Olah, and timelines are plausibly pretty short so ¯\_(ツ)_/¯

yams on yams's Shortform

Yes this world.

drossbucket on Doing Research Part-Time is Great

Interesting post! I’ve wondered the same thing before.

I’m doing a much more half-arsed version, as a casual quantum foundations enjoyer alongside a technical writing job, and also getting endlessly distracted by other things I find interesting, so my output is not impressive. But it’s a pretty fun hobby and I’m surprised more people don’t try this!