LessWrong 2.0 Reader

View: New · Old · Top

Restrict date range: Today · This week · This month · Last three months · This year · All time

← previous page (newer posts) · next page (older posts) →

[link] Agreeing With Stalin in Ways That Exhibit Generally Rationalist Principles
Zack_M_Davis · 2024-03-02T22:05:49.553Z · comments (25)

[link] AI & wisdom 1: wisdom, amortised optimisation, and AI
L Rudolf L (LRudL) · 2024-10-28T21:02:51.215Z · comments (0)

5. Open Corrigibility Questions
Max Harms (max-harms) · 2024-06-10T14:09:20.777Z · comments (0)

Reviewing the Structure of Current AI Regulations
Deric Cheng (deric-cheng) · 2024-05-07T12:34:17.820Z · comments (0)

Paper Summary: Princes and Merchants: European City Growth Before the Industrial Revolution
Jeffrey Heninger (jeffrey-heninger) · 2024-07-15T21:30:04.043Z · comments (1)

[question] What Other Lines of Work are Safe from AI Automation?
RogerDearnaley (roger-d-1) · 2024-07-11T10:01:12.616Z · answers+comments (35)

Examples of How I Use LLMs
jefftk (jkaufman) · 2024-10-14T17:10:04.597Z · comments (2)

Big-endian is better than little-endian
Menotim · 2024-04-29T02:30:48.053Z · comments (17)

DPO/PPO-RLHF on LLMs incentivizes sycophancy, exaggeration and deceptive hallucination, but not misaligned powerseeking
tailcalled · 2024-06-10T21:20:11.938Z · comments (13)

The new ruling philosophy regarding AI
Mitchell_Porter · 2024-11-11T13:28:24.476Z · comments (0)

[question] Where to find reliable reviews of AI products?
Elizabeth (pktechgirl) · 2024-09-17T23:48:25.899Z · answers+comments (6)

[link] My MATS Summer 2023 experience
James Chua (james-chua) · 2024-03-20T11:26:14.944Z · comments (0)

Aggregative Principles of Social Justice
Cleo Nardo (strawberry calm) · 2024-06-05T13:44:47.499Z · comments (10)

AI #61: Meta Trouble
Zvi · 2024-05-02T18:40:03.242Z · comments (0)

[link] Why Recursion Pharmaceuticals abandoned cell painting for brightfield imaging
Abhishaike Mahajan (abhishaike-mahajan) · 2024-11-05T14:51:41.310Z · comments (1)

Disagreement on AGI Suggests It’s Near
tangerine · 2025-01-07T20:42:43.456Z · comments (15)

Mini Go: Gateway Game
jefftk (jkaufman) · 2025-01-14T03:30:02.020Z · comments (1)

Trading Candy
jefftk (jkaufman) · 2024-11-01T01:10:08.024Z · comments (4)

Investigating Sensitive Directions in GPT-2: An Improved Baseline and Comparative Analysis of SAEs
Daniel Lee (daniel-lee) · 2024-09-06T02:28:41.954Z · comments (0)

Concrete Methods for Heuristic Estimation on Neural Networks
Oliver Daniels (oliver-daniels-koch) · 2024-11-14T05:07:55.240Z · comments (0)

[link] Our Digital and Biological Children
Eneasz · 2024-10-24T18:36:38.719Z · comments (0)

Distinguishing ways AI can be "concentrated"
Matthew Barnett (matthew-barnett) · 2024-10-21T22:21:13.666Z · comments (2)

DIY RLHF: A simple implementation for hands on experience
Mike Vaiana (mike-vaiana) · 2024-07-10T12:07:03.047Z · comments (0)

Childhood and Education Roundup #6: College Edition
Zvi · 2024-06-26T11:40:03.990Z · comments (8)

[link] Arithmetic Models: Better Than You Think
kqr · 2024-10-26T09:42:07.185Z · comments (4)

Option control
Joe Carlsmith (joekc) · 2024-11-04T17:54:03.073Z · comments (0)

Reading More Each Day: A Simple $35 Tool
aysajan · 2024-07-24T13:54:04.290Z · comments (2)

Towards Quantitative AI Risk Management
Henry Papadatos (henry) · 2024-10-16T19:26:48.817Z · comments (1)

Winning isn't enough
JesseClifton · 2024-11-05T11:37:39.486Z · comments (14)

Monthly Roundup #19: June 2024
Zvi · 2024-06-25T12:00:03.333Z · comments (9)

AI #64: Feel the Mundane Utility
Zvi · 2024-05-16T15:20:02.956Z · comments (11)

[link] ML Safety Research Advice - GabeM
Gabe M (gabe-mukobi) · 2024-07-23T01:45:42.288Z · comments (2)

Is AI Alignment Enough?
Aram Panasenco (panasenco) · 2025-01-10T18:57:48.409Z · comments (6)

[question] Which things were you surprised to learn are metaphors?
Gordon Seidoh Worley (gworley) · 2024-11-22T03:46:02.845Z · answers+comments (18)

Corrigibility's Desirability is Timing-Sensitive
RobertM (T3t) · 2024-12-26T22:24:17.435Z · comments (4)

Gratitudes: Rational Thanks Giving
Seth Herd · 2024-11-29T03:09:47.410Z · comments (2)

First Solo Bus Ride
jefftk (jkaufman) · 2024-12-03T12:20:02.344Z · comments (1)

Two flavors of computational functionalism
EuanMcLean (euanmclean) · 2024-11-25T10:47:04.584Z · comments (9)

{Book Summary} The Art of Gathering
Tristan Williams (tristan-williams) · 2024-04-16T10:48:41.528Z · comments (0)

Cryonics p(success) estimates are only weakly associated with interest in pursuing cryonics in the LW 2023 Survey
Andy_McKenzie · 2024-02-29T14:47:28.613Z · comments (6)

Can quantised autoencoders find and interpret circuits in language models?
charlieoneill (kingchucky211) · 2024-03-24T20:05:50.125Z · comments (4)

[link] Quick Thoughts on Scaling Monosemanticity
[deleted] · 2024-05-23T16:22:48.035Z · comments (1)

Aggregative principles approximate utilitarian principles
Cleo Nardo (strawberry calm) · 2024-06-12T16:27:22.179Z · comments (3)

AI #65: I Spy With My AI
Zvi · 2024-05-23T12:40:02.793Z · comments (7)

[link] AI Safety at the Frontier: Paper Highlights, August '24
gasteigerjo · 2024-09-03T19:17:24.850Z · comments (0)

Tackling Moloch: How YouCongress Offers a Novel Coordination Mechanism
Hector Perez Arenas (hector-perez-arenas) · 2024-05-15T23:13:48.501Z · comments (9)

Auditing LMs with counterfactual search: a tool for control and ELK
Jacob Pfau (jacob-pfau) · 2024-02-20T00:02:09.575Z · comments (6)

An Affordable CO2 Monitor
Pretentious Penguin (dylan-mahoney) · 2024-03-21T03:06:53.255Z · comments (1)

Collection (Part 6 of "The Sense Of Physical Necessity")
LoganStrohl (BrienneYudkowsky) · 2024-03-14T21:37:00.160Z · comments (0)

An explanation of evil in an organized world
KatjaGrace · 2024-05-02T05:20:06.240Z · comments (9)

← previous page (newer posts) · next page (older posts) →

Archive

Recent comments

josh-you on Implications of the inference scaling paradigm for AI safety

In Holden Karnofsky's "AI Could Defeat All Of Us Combined" a plausible existential risk threat model is described, in which a swarm of human-level AIs outmanoeuvre humans due to AI's faster cognitive speeds and improved coordination, rather than qualitative superintelligence capabilities. This scenario is predicated on the belief that "once the first human-level AI system is created, whoever created it could use the same computing power it took to create it in order to run several hundred million copies for about a year each." If the first AGIs are as expensive to run as o3-high (costing ~$3k/task), this threat model seems much less plausible.

I wonder how different the reasoning paradigm is, actually, from the picture presented here. After all, running a huge number of AI copies in parallel is... scaling up test-time compute.

The overhang argument is a rough analogy anyway. I think you are invoking the intuition of replacing the AI equivalent of a very large group of typical humans with the AI equivalent of a small number of ponderous geniuses, but those analogies are going to be highly imperfect in practice.

viliam on CstineSublime's Shortform

especially if it controls your social media feed

but... it already does :(

I mean, on facebook and xitter and reddit; I am still free to control my browsing of substack

and yes, applying the same level of control to my real life sounds like a bad idea

steve2152 on Heritability: Five Battles

Thanks! Yeah, I think I would have said something pretty similar to that.

Actually, I might have gone a bit further and said:

Maybe, people have the experience

(A) “First, I reprocessed the childhood scare experience. Second, I found some that my adult anxiety was generally relieved to some extent.”

…and they naturally conclude

(B) “…Therefore, the childhood scare experience must have been (partly) causing the adult anxiety all along.”

…but I wonder if we could also entertain an alternate theory:

(B’) “…Gee, I guess this reprocessing must have been a kind of ‘training / practice / exercise’ during which I could forge new better subconscious habits and associations related to ‘the feeling of anxiety’ in general. And these new subconscious habits and associations are now serving me well in a wide variety of adult contexts.”

After all, you can’t form new subconscious habits and associations related to “the feeling of anxiety” except by invoking “the feeling of anxiety” somehow in the process. It seems plausible to me that childhood memories would be very effective way to do that. After all, (1) I think emotions are generally very strong in childhood and teenage years, and (2) maybe there’s some sense in which long-ago memories are objectively “safer” since the situation is long over, and thus it’s easier to entertain the idea that the feeling is not serving any real purpose.

Also, AFAICT, people achieve great therapeutic success by methods that involve bringing up childhood memories, but other people [LW · GW] also achieve great therapeutic success by methods that don’t. :)

I’m not an expert like you are—indeed I have no personal experience whatsoever—so you can tell me if that doesn’t ring true. :)

viliam on keltan's Shortform

Upvoted for the song.

viliam on YangYing's Shortform

What do you mean by "hunger for life"?

What do you mean by "capitalism"?

If you basically mean that machines should help us overcome scarcity, and then everyone should be able to focus on games, friendship, learning, et cetera... sure, why not?

But first we need to make sure the machines won't kill us all when they get smarter than us and start controlling the wold. (Because if they do, it doesn't matter how our corporations and governments were set up.)

However, capitalism, as it stands, obstructs the realization of this vision.

So far, the attempts to replace capitalism often did even worse.

One problem - scarcity. Usually made worse by eliminating capitalism.

Second problem - humans. Psychopaths compete for power, in both capitalism and socialism. We need to solve this. Democracy alone is not a solution; psychopaths are quite successful at getting elected, or getting their people elected.

Is it time to rethink the way corporations and governing systems operate?

I believe people are thinking about this all the time, but do you have a specific proposal that wasn't widely considered yet?

abramdemski on Lecture Series on Tiling Agents

I'm hopeful about it, but preparing the lectures alone will be a lot of work (although the first one will be a repeat of some material presented at ILIAD).

viliam on Comment on "Death and the Gorgon"

Consider Egan's incentives. "A group of effective altruists collects a ton of money, buys anti-malaria nets, saves million African lives (but other millions still die of malaria)" is an improvement over status quo in real life, but it would be a boring and disappointing story.

Cool fictional villains are at least an improvement over the media narrative "EAs are crypto scammers".

I wonder if there are people who joined the rationalist or effective altruist communities because of recent Egan's stories. A negative advertisement is still advertisement... I can imagine someone reading the story, then trying to find more on the internet, then joining; the question is whether this actually happens.

seth-herd on We probably won't just play status games with each other after AGI

I'm wondering less if humans will want to date AGIs and more if AGIs will want to date humans.

Sure, if we solve the alignment problem we can build AGIs that want to date humans; but will we decide that's ethical?

The criteria for consciousness and moral worth are varied and debated. The answer to whether AGIs will be conscious and worthy is definitely sort of.

So: is creating a conscious being with a core motivation designed specifically so that it wants to date you a form of slavery? It definitely smacks of grooming or something....

One issue is whether AGIs will want to stay around the human cognitive level. There's an issue with power dynamics in a relationship between a nerd and a demigod.

Sure the humans can cognitively enhance too; what fraction of us will want to become demigods ourselves?

It's going to be wild if we can get there. And fun. Speaking of which, we won't be playing games mostly for status -- we'll mostly be playing for fun.

We won't all have the coolest friends, but we'll all have cool friends because we'll all be cool friends. Humans will no longer be repressed, neurotic messes because we'll have actual understanding of psychology and actual good, safe, supportive childhoods for essentially everyone.

It's gonna be wild if we can get there.

sarahconstantin on sarahconstantin's Shortform

links 1/15/25: https://roamresearch.com/#/app/srcpublic/page/01-15-2025

https://www.proteinatlas.org/ seems like a good resource. Swedish.
https://en.m.wikipedia.org/wiki/Human_cloning human cloning was first discussed by JBS Haldane in a 1969 speech!
https://en.wikipedia.org/wiki/Protalix_BioTherapeutics they seem pretty successful. enzyme replacement for Gaucher disease. Israeli.
- https://en.wikipedia.org/wiki/Phillip_Frost interesting guy. "served as a lieutenant commander, U.S. Public Health Service at the National Cancer Institute, from 1963 to 1965." Major pharma investor.
What happened to Amyris?
- they used to be a biofuel company but couldn't get production up and costs down:
- they pivoted to low-volume, high-price beauty & personal care ingredients, which actually generated a bunch of revenue, but not enough to cover costs. and then also bought a ton of celebrity beauty brands, which didn't. 2022 stock plunge, 2023 bankruptcy.
- they're not terrible at industrial fermentation (compared to other synbio unicorns) and have some lessons learned
  - https://pmc.ncbi.nlm.nih.gov/articles/PMC7695652/
- they got in trouble with the SEC for recognizing more revenue than they actually made (according to standard accounting)
  - https://www.sec.gov/enforcement-litigation/administrative-proceedings/34-93341-s
- https://www.science.org/content/article/synthetic-biology-once-hailed-moneymaker-meets-tough-times bad times for biomanufacturing/synbio overall
  - there are kind of...zero large profitable firms founded after 2000 that specialize in industrial fermentation/biomanufacturing, EXCEPT a couple of biotechs that make enzyme drugs.
    - there's plenty of biomanufactured products but pretty much all from very large old boring firms at sorta commodity prices?

cole-wyeth on What is the most impressive game LLMs can play well?

Interesting, the prices seemed reasonable overall though I traded the later dates down a little bit because if LLMs haven't won be 2030 the paradigm is probably limited (IMO they hadn't priced in that update).

I suppose that it's a slightly "unfair" comparison because chess engines are very narrow and humans can't beat them either. How do LLMs compare to top human chess players?