LessWrong 2.0 Reader

View: New · Old · Top

Restrict date range: Today · This week · This month · Last three months · This year · All time

← previous page (newer posts) · next page (older posts) →

[link] introduction to cancer vaccines
bhauth · 2024-05-05T01:06:16.972Z · comments (19)

[link] Please support this blog (with money)
Elizabeth (pktechgirl) · 2024-08-17T15:30:05.641Z · comments (3)

Ablations for “Frontier Models are Capable of In-context Scheming”
AlexMeinke (Paulawurm) · 2024-12-17T23:58:19.222Z · comments (1)

Hierarchical Agency: A Missing Piece in AI Alignment
Jan_Kulveit · 2024-11-27T05:49:04.241Z · comments (20)

The case for more ambitious language model evals
Jozdien · 2024-01-30T00:01:13.876Z · comments (30)

Four visions of Transformative AI success
Steven Byrnes (steve2152) · 2024-01-17T20:45:46.976Z · comments (22)

The Parable Of The Fallen Pendulum - Part 1
johnswentworth · 2024-03-01T00:25:00.111Z · comments (32)

DeepSeek beats o1-preview on math, ties on coding; will release weights
Zach Stein-Perlman · 2024-11-20T23:50:26.597Z · comments (26)

[link] Practically A Book Review: Appendix to "Nonlinear's Evidence: Debunking False and Misleading Claims" (ThingOfThings)
tailcalled · 2024-01-03T17:07:13.990Z · comments (25)

The Pearly Gates
lsusr · 2024-05-30T04:01:14.198Z · comments (6)

You should go to ML conferences
Jan_Kulveit · 2024-07-24T11:47:52.214Z · comments (13)

Please stop using mediocre AI art in your posts
Raemon · 2024-08-25T00:13:52.890Z · comments (24)

Why I'm Moving from Mechanistic to Prosaic Interpretability
Daniel Tan (dtch1997) · 2024-12-30T06:35:43.417Z · comments (34)

What I Would Do If I Were Working On AI Governance
johnswentworth · 2023-12-08T06:43:42.565Z · comments (32)

Introduction to French AI Policy
Lucie Philippon (lucie-philippon) · 2024-07-04T03:39:45.273Z · comments (12)

Ten arguments that AI is an existential risk
KatjaGrace · 2024-08-13T17:00:03.397Z · comments (41)

Sorry for the downtime, looks like we got DDosd
habryka (habryka4) · 2024-12-02T04:14:30.209Z · comments (13)

Being nicer than Clippy
Joe Carlsmith (joekc) · 2024-01-16T19:44:23.893Z · comments (32)

' petertodd'’s last stand: The final days of open GPT-3 research
mwatkins · 2024-01-22T18:47:00.710Z · comments (16)

A Selection of Randomly Selected SAE Features
CallumMcDougall (TheMcDouglas) · 2024-04-01T09:09:49.235Z · comments (2)

[link] A primer on the current state of longevity research
Abhishaike Mahajan (abhishaike-mahajan) · 2024-08-22T17:14:57.990Z · comments (6)

The Leopold Model: Analysis and Reactions
Zvi · 2024-06-14T15:10:03.480Z · comments (19)

Clarifying METR's Auditing Role
Beth Barnes (beth-barnes) · 2024-05-30T18:41:56.029Z · comments (1)

OthelloGPT learned a bag of heuristics
jylin04 · 2024-07-02T09:12:56.377Z · comments (10)

"AI Alignment" is a Dangerously Overloaded Term
Roko · 2023-12-15T14:34:29.850Z · comments (100)

Fact Finding: Attempting to Reverse-Engineer Factual Recall on the Neuron Level (Post 1)
Neel Nanda (neel-nanda-1) · 2023-12-23T02:44:24.270Z · comments (10)

Attitudes about Applied Rationality
Camille Berger (Camille Berger) · 2024-02-03T14:42:22.770Z · comments (18)

The Big Nonprofits Post
Zvi · 2024-11-29T16:10:06.938Z · comments (10)

[link] Most smart and skilled people are outside of the EA/rationalist community: an analysis
titotal (lombertini) · 2024-07-12T12:13:56.215Z · comments (36)

Danger, AI Scientist, Danger
Zvi · 2024-08-15T22:40:06.715Z · comments (9)

Me, Myself, and AI: the Situational Awareness Dataset (SAD) for LLMs
L Rudolf L (LRudL) · 2024-07-08T22:24:38.441Z · comments (36)

[link] Announcing turntrout.com, my new digital home
TurnTrout · 2024-11-17T17:42:08.164Z · comments (24)

[link] Perplexity wins my AI race
Elizabeth (pktechgirl) · 2024-08-24T19:20:10.859Z · comments (12)

Discriminating Behaviorally Identical Classifiers: a model problem for applying interpretability to scalable oversight
Sam Marks (samuel-marks) · 2024-04-18T16:17:39.136Z · comments (10)

2023 in AI predictions
jessicata (jessica.liu.taylor) · 2024-01-01T05:23:42.514Z · comments (35)

Demystifying "Alignment" through a Comic
milanrosko · 2024-06-09T08:24:22.454Z · comments (19)

[link] Aristocracy and Hostage Capital
Arjun Panickssery (arjun-panickssery) · 2025-01-08T19:38:47.104Z · comments (7)

Catching AIs red-handed
ryan_greenblatt · 2024-01-05T17:43:10.948Z · comments (27)

Scaling and evaluating sparse autoencoders
leogao · 2024-06-06T22:50:39.440Z · comments (6)

The first future and the best future
KatjaGrace · 2024-04-25T06:40:04.510Z · comments (12)

Why I'm doing PauseAI
Joseph Miller (Josephm) · 2024-04-30T16:21:54.156Z · comments (16)

Skills I'd like my collaborators to have
Raemon · 2024-02-09T08:20:37.686Z · comments (9)

In favour of exploring nagging doubts about x-risk
owencb · 2024-06-25T23:52:01.322Z · comments (2)

New LessWrong review winner UI ("The LeastWrong" section and full-art post pages)
kave · 2024-02-28T02:42:05.801Z · comments (64)

[link] A Chess-GPT Linear Emergent World Representation
Adam Karvonen (karvonenadam) · 2024-02-08T04:25:15.222Z · comments (14)

The Local Interaction Basis: Identifying Computationally-Relevant and Sparsely Interacting Features in Neural Networks
Lucius Bushnaq (Lblack) · 2024-05-20T17:53:25.985Z · comments (4)

[question] What convincing warning shot could help prevent extinction from AI?
Charbel-Raphaël (charbel-raphael-segerie) · 2024-04-13T18:09:29.096Z · answers+comments (18)

On the future of language models
owencb · 2023-12-20T16:58:28.433Z · comments (17)

SAE reconstruction errors are (empirically) pathological
wesg (wes-gurnee) · 2024-03-29T16:37:29.608Z · comments (16)

[link] A case for AI alignment being difficult
jessicata (jessica.liu.taylor) · 2023-12-31T19:55:26.130Z · comments (58)

← previous page (newer posts) · next page (older posts) →

Archive

Recent comments

viliam on keltan's Shortform

Upvoted for the song.

viliam on YangYing's Shortform

What do you mean by "hunger for life"?

What do you mean by "capitalism"?

If you basically mean that machines should help us overcome scarcity, and then everyone should be able to focus on games, friendship, learning, et cetera... sure, why not?

But first we need to make sure the machines won't kill us all when they get smarter than us and start controlling the wold. (Because if they do, it doesn't matter how our corporations and governments were set up.)

However, capitalism, as it stands, obstructs the realization of this vision.

So far, the attempts to replace capitalism often did even worse.

One problem - scarcity. Usually made worse by eliminating capitalism.

Second problem - humans. Psychopaths compete for power, in both capitalism and socialism. We need to solve this. Democracy alone is not a solution; psychopaths are quite successful at getting elected, or getting their people elected.

Is it time to rethink the way corporations and governing systems operate?

I believe people are thinking about this all the time, but do you have a specific proposal that wasn't widely considered yet?

abramdemski on Lecture Series on Tiling Agents

I'm hopeful about it, but preparing the lectures alone will be a lot of work (although the first one will be a repeat of some material presented at ILIAD).

viliam on Comment on "Death and the Gorgon"

Consider Egan's incentives. "A group of effective altruists collects a ton of money, buys anti-malaria nets, saves million African lives (but other millions still die of malaria)" is an improvement over status quo in real life, but it would be a boring and disappointing story.

Cool fictional villains are at least an improvement over the media narrative "EAs are crypto scammers".

I wonder if there are people who joined the rationalist or effective altruist communities because of recent Egan's stories. A negative advertisement is still advertisement... I can imagine someone reading the story, then trying to find more on the internet, then joining; the question is whether this actually happens.

seth-herd on We probably won't just play status games with each other after AGI

I'm wondering less if humans will want to date AGIs and more if AGIs will want to date humans.

Sure, if we solve the alignment problem we can build AGIs that want to date humans; but will we decide that's ethical?

The criteria for consciousness and moral worth are varied and debated. The answer to whether AGIs will be conscious and worthy is definitely sort of.

So: is creating a conscious being with a core motivation designed specifically so that it wants to date you a form of slavery? It definitely smacks of grooming or something....

One issue is whether AGIs will want to stay around the human cognitive level. There's an issue with power dynamics in a relationship between a nerd and a demigod.

Sure the humans can cognitively enhance too; what fraction of us will want to become demigods ourselves?

It's going to be wild if we can get there. And fun. Speaking of which, we won't be playing games mostly for status -- we'll mostly be playing for fun.

We won't all have the coolest friends, but we'll all have cool friends because we'll all be cool friends. Humans will no longer be repressed, neurotic messes because we'll have actual understanding of psychology and actual good, safe, supportive childhoods for essentially everyone.

It's gonna be wild if we can get there.

sarahconstantin on sarahconstantin's Shortform

links 1/15/25: https://roamresearch.com/#/app/srcpublic/page/01-15-2025

https://www.proteinatlas.org/ seems like a good resource. Swedish.
https://en.m.wikipedia.org/wiki/Human_cloning human cloning was first discussed by JBS Haldane in a 1969 speech!
https://en.wikipedia.org/wiki/Protalix_BioTherapeutics they seem pretty successful. enzyme replacement for Gaucher disease. Israeli.
- https://en.wikipedia.org/wiki/Phillip_Frost interesting guy. "served as a lieutenant commander, U.S. Public Health Service at the National Cancer Institute, from 1963 to 1965." Major pharma investor.
What happened to Amyris?
- they used to be a biofuel company but couldn't get production up and costs down:
- they pivoted to low-volume, high-price beauty & personal care ingredients, which actually generated a bunch of revenue, but not enough to cover costs. and then also bought a ton of celebrity beauty brands, which didn't. 2022 stock plunge, 2023 bankruptcy.
- they're not terrible at industrial fermentation (compared to other synbio unicorns) and have some lessons learned
  - https://pmc.ncbi.nlm.nih.gov/articles/PMC7695652/
- they got in trouble with the SEC for recognizing more revenue than they actually made (according to standard accounting)
  - https://www.sec.gov/enforcement-litigation/administrative-proceedings/34-93341-s
- https://www.science.org/content/article/synthetic-biology-once-hailed-moneymaker-meets-tough-times bad times for biomanufacturing/synbio overall
  - there are kind of...zero large profitable firms founded after 2000 that specialize in industrial fermentation/biomanufacturing, EXCEPT a couple of biotechs that make enzyme drugs.
    - there's plenty of biomanufactured products but pretty much all from very large old boring firms at sorta commodity prices?

cole-wyeth on What is the most impressive game LLMs can play well?

Interesting, the prices seemed reasonable overall though I traded the later dates down a little bit because if LLMs haven't won be 2030 the paradigm is probably limited (IMO they hadn't priced in that update).

I suppose that it's a slightly "unfair" comparison because chess engines are very narrow and humans can't beat them either. How do LLMs compare to top human chess players?

kaj_sotala on We probably won't just play status games with each other after AGI

I think that getting too fixated on status games is usually due to some kind of insecurity, e.g. feeling that you need to accumulate status in order to be accepted, respected or something like that. (One can certainly play status games for the fun of it without having such an insecurity, but that seems unlikely to lead to the level of fixation where status game would become the primary activity that humans engage in.)

If every human can close friends or even lovers with AI systems as you suggest, then I would expect that to provide the kind of deep unconditional feeling of security that makes the need for playing status games fall away. If you feel deeply safe, loved, respected, etc., then status can certainly still feel nice to have, but it's unlikely to feel that important for most people. In the same way that e.g. a loving parent focused on taking care of and spending time with their family may find themselves becoming much less interested in spending their time playing status games.

d0themath on lemonhope's Shortform

in some sense that’s just hiring you for any other job, and of course if an AGI lab wants you, you end up with greater negotiating leverage at your old place, and could get a raise (depending on how tight capital constraints are, which, to be clear, in AI alignment are tight).

daniel-tan on Daniel Tan's Shortform

Actually we don’t even need to train a new byte latent transformer. We can just generate patches using GPT-2 small.

Do the patches correspond to atomic concepts?
If we train on the patches generated as such, do we get a better LM