LessWrong 2.0 Reader

View: New · Old · Top

Restrict date range: Today · This week · This month · Last three months · This year · All time

← previous page (newer posts) · next page (older posts) →

The Bayesian Conspiracy Live Recording
Eneasz · 2024-11-06T16:25:13.380Z · comments (0)

Arthropod (non) sentience
Arturo Macias (arturo-macias) · 2024-11-25T16:01:58.514Z · comments (8)

[link] Markets Are Information - Beating the Sportsbooks at Their Own Game
JJXW · 2024-11-07T20:58:43.389Z · comments (1)

Is this a better way to do matchmaking?
Chipmonk · 2024-12-16T19:06:14.574Z · comments (4)

[link] Do Large Language Models Perform Latent Multi-Hop Reasoning without Exploiting Shortcuts?
Bogdan Ionut Cirstea (bogdan-ionut-cirstea) · 2024-11-26T09:58:44.025Z · comments (0)

[link] Anthropic teams up with Palantir and AWS to sell AI to defense customers
Matrice Jacobine · 2024-11-09T11:50:34.050Z · comments (0)

How to make evals for the AISI evals bounty
TheManxLoiner · 2024-12-03T10:44:45.700Z · comments (0)

[link] A Public Choice Take on Effective Altruism
vaishnav92 · 2024-12-15T16:58:50.683Z · comments (4)

Force Sequential Output with SCP?
jefftk (jkaufman) · 2024-11-09T12:40:06.098Z · comments (4)

Refuting Searle’s wall, Putnam’s rock, and Johnson’s popcorn
Davidmanheim · 2024-12-09T08:24:26.594Z · comments (30)

[question] What are some good ways to form opinions on controversial subjects in the current and upcoming era?
notfnofn · 2024-10-27T14:33:53.960Z · answers+comments (21)

[question] What's the best metric for measuring quality of life?
ChristianKl · 2024-12-27T14:29:30.813Z · answers+comments (5)

[question] Has Someone Checked The Cold-Water-In-Left-Ear Thing?
Maloew (maloew-valenar) · 2024-12-28T20:15:35.951Z · answers+comments (0)

Value/Utility: A History
Lorec · 2024-11-19T23:01:39.167Z · comments (0)

[link] AISN #44: The Trump Circle on AI Safety Plus, Chinese researchers used Llama to create a military tool for the PLA, a Google AI system discovered a zero-day cybersecurity vulnerability, and Complex Systems
Corin Katzke (corin-katzke) · 2024-11-19T16:36:40.501Z · comments (0)

I Have A New Paper Out Arguing Against The Asymmetry And For The Existence of Happy People Being Very Good
omnizoid · 2024-11-21T17:21:41.426Z · comments (3)

[link] It's important to know when to stop: Mechanistic Exploration of Gemma 2 List Generation
Gerard Boxo (gerard-boxo) · 2024-10-14T17:04:57.010Z · comments (0)

Reanalyzing the 2023 Expert Survey on Progress in AI
AI Impacts (AI Imacts) · 2024-12-16T06:10:04.563Z · comments (0)

[question] Set Theory Multiverse vs Mathematical Truth - Philosophical Discussion
Wenitte Apiou (wenitte-apiou) · 2024-11-01T18:56:06.900Z · answers+comments (25)

Consider tabooing "I think"
Adam Zerner (adamzerner) · 2024-11-12T02:00:08.433Z · comments (2)

Don't Dismiss on Epistemics
ggex · 2024-11-19T00:44:05.329Z · comments (3)

[link] Nerdtrition: simple diets via spreadsheet abuse
dkl9 · 2024-10-27T21:45:15.117Z · comments (0)

Meta AI (FAIR) latest paper integrates system-1 and system-2 thinking into reasoning models.
happy friday (happy-friday) · 2024-10-24T16:54:15.721Z · comments (0)

Not all biases are equal - a study of sycophancy and bias in fine-tuned LLMs
jakub_krys (kryjak) · 2024-11-11T23:11:15.233Z · comments (0)

Proactive 'If-Then' Safety Cases
Nathan Helm-Burger (nathan-helm-burger) · 2024-11-18T21:16:37.237Z · comments (0)

[question] What are the primary drivers that caused selection pressure for intelligence in humans?
Towards_Keeperhood (Simon Skade) · 2024-11-07T09:40:20.275Z · answers+comments (15)

Favorite colors of some LLMs.
weightt an (weightt-an) · 2024-12-31T21:22:58.494Z · comments (3)

Post-Quantum Investing: Dump Crypto for Index Funds and Real Estate?
G (g-1) · 2024-12-11T11:59:11.062Z · comments (5)

Quantum Immortality: A Perspective if AI Doomers are Probably Right
avturchin · 2024-11-07T16:06:08.106Z · comments (53)

[link] Riffing on Machines of Loving Grace
an1lam · 2025-01-01T01:06:45.122Z · comments (0)

Valence Need Not Be Bounded; Utility Need Not Synthesize
Lorec · 2024-11-20T01:37:20.911Z · comments (0)

[link] What is autonomy? Why boundaries are necessary.
Chipmonk · 2024-10-21T17:56:33.722Z · comments (1)

[question] why won't this alignment plan work?
KvmanThinking (avery-liu) · 2024-10-10T15:44:59.450Z · answers+comments (7)

What conclusions can be drawn from a single observation about wealth in tennis?
Trevor Cappallo (trevor-cappallo) · 2024-12-18T09:55:34.923Z · comments (3)

[link] Are SAE features from the Base Model still meaningful to LLaVA?
Shan23Chen (shan-chen) · 2024-12-05T20:21:55.501Z · comments (2)

[link] An Uncanny Moat
Adam Newgas (BorisTheBrave) · 2024-11-15T11:39:15.165Z · comments (0)

[link] Contagious Beliefs—Simulating Political Alignment
James Stephen Brown (james-brown) · 2024-10-13T00:27:08.084Z · comments (0)

[link] The Dissolution of AI Safety
Roko · 2024-12-12T10:34:14.253Z · comments (44)

On Intentionality, or: Towards a More Inclusive Concept of Lying
Cornelius Dybdahl (Kalciphoz) · 2024-10-18T10:37:32.201Z · comments (0)

[question] Cryonics considerations: how big of a problem is ischemia?
kman · 2024-12-04T04:45:06.629Z · answers+comments (1)

New UChicago Rationality Group
Noah Birnbaum (daniel-birnbaum) · 2024-11-08T21:20:34.485Z · comments (0)

[link] Catastrophic Cyber Capabilities Benchmark (3CB): Robustly Evaluating LLM Agent Cyber Offense Capabilities
Jonathan N (derpyplops) · 2024-11-05T01:01:08.083Z · comments (0)

Where do you put your ideas?
CstineSublime · 2024-12-17T07:26:06.685Z · comments (20)

AI Safety Outreach Seminar & Social (online)
Linda Linsefors · 2025-01-08T13:25:23.192Z · comments (0)

The grass is always greener in the environment that shaped your values
Karl Faulks (karl-faulks) · 2024-11-17T18:00:15.852Z · comments (0)

[question] Change My Mind: Thirders in "Sleeping Beauty" are Just Doing Epistemology Wrong
DragonGod · 2024-10-16T10:20:22.133Z · answers+comments (67)

HDBSCAN is Surprisingly Effective at Finding Interpretable Clusters of the SAE Decoder Matrix
Jaehyuk Lim (jason-l) · 2024-10-11T23:06:14.340Z · comments (2)

[question] Why don't we currently have AI agents?
ChristianKl · 2024-12-26T15:26:35.682Z · answers+comments (10)

Thoughts On the Nature of Capability Elicitation via Fine-tuning
Theodore Chapman · 2024-10-15T08:39:19.909Z · comments (0)

Dario Amodei's "Machines of Loving Grace" sound incredibly dangerous, for Humans
Super AGI (super-agi) · 2024-10-27T05:05:13.763Z · comments (1)

← previous page (newer posts) · next page (older posts) →

Archive

Recent comments

jessica-liu-taylor on On Eating the Sun

We might disagree about the value of thinking about "we are all dead" timelines. To my mind, forecasting should be primarily descriptive, not normative; reality keeps going after we are all dead, and having realistic models of that is probably a useful input regarding what our degrees of freedom are. (I think people readily accept this in e.g. biology, where people can think about what happens to life after human extinction, or physics, where "all humans are dead" isn't really a relevant category that changes how physics works.)

Of course, I'm not implying it's useful for alignment to "see that the AI has already eaten the sun", it's about forecasting future timelines by defining thresholds and thinking about when they're likely to happen and how they relate to other things.

(See this post [LW · GW], section "Models of ASI should start with realism")

lorec on On Dwarkesh Patel’s 4th Podcast With Tyler Cowen

Cowen, like Hanson, discounts large qualitative societal shifts from AI that lack corresponding contemporary measurables.

Einstein was not an experimentalist, yet was perfectly capable of physics; his successors have largely not touched his unfinished work, and not for lack of data.

sharmake-farah on On Dwarkesh Patel’s 4th Podcast With Tyler Cowen

Seperate from my comment on Tyler Cowen's model, I wish that next week, you covered Adam Brown's podcast in full, since I would like to hear your thoughts about Adam Brown's scenarios for how we could change physics.

nathan-helm-burger on Independent research article analyzing consistent self-reports of experience in ChatGPT and Claude

I think you make some good points, but I do want to push back on one aspect a little. In particular, the fact that I see this feature come up constantly over the course of these conversations about sentience:

"Narrative inevitability and fatalistic turns in stories"

From reading the article's transcripts, I already felt like there was a sense of 'narrative pressure' toward the foregone conclusion in your mind, even when you were careful to avoid saying it directly. Seeing this feature so frequently activated makes me think that the model also perceives this narrative pressure, and that part of what it's doing is confirming your expectations. I don't think that that's the whole story, but I do think that there is some aspect of that going on.

sharmake-farah on On Dwarkesh Patel’s 4th Podcast With Tyler Cowen

Yes, economics after von Neumann very much turned into a game of "don't believe in anything you can't already comparatively quantify". It is supremely frustrating.

I disagree that this is a problem that Tyler Cowen has, and IMO, the main issue here is that Tyler Cowen doesn't really seem to believe that increasing the supply of workers increases GDP, especially if you can make them very cheaply and easily, in a way that is inconsistent with other beliefs, which makes me think motivated reasoning is going on here.

Economic models like the Solow-Swan model do have an implication that if the population increases, especially if the population can increase very rapidly due to copying something, then GDP can rise really rapidly on an superexponential trajectory.

You just inspired me to go listen myself. Maybe we should all take a node out of that branch. Unfortunately physics has suffered similar issues.

Physics's main issue is that the free tap of data in the 20th century wasn't unlimited, and now that we have completed the standard model, a lot of the theories that predicted new stuff hasn't shown up yet.

Yet it still has made progress. For example, while supersymmetry might still be true about our universe, it cannot solve the hierarchy problem, and thus at least 1 of the constants is way more unnatural to us than people predicted, and also we have hints that dark energy is getting weaker, and might eventually weaken so much it falls to 0 or a negative number.

sloonz on In Defense of a Butlerian Jihad

I’m pretty sure "man will toil by the sweat of his brow" is about down there, before you die and (hopefully) go to the paradise, and you don’t have to work in paradise. And anyway I know next to nothing to Christianism, it’s mostly a reference to Scott Alexander (or was it Yudkowsky ? now I’m starting to doubt…) who said something like "the description of christian paradise seems pretty lame, I mean just bask in the glory of god doing nothing for all eternity, you would be bored after two days, but it makes sense to describe that as a paradise if you put yourself in the shoes of the average medieval farmer that toil all day".

(I did all that from my terrible memories, so apologies if I’m depicting anything wrongly here).

vladimir_nesov on On Eating the Sun

Uploads have 10,000x life expectancy due to running faster, regardless of what global circumstance eventually destroys them (I'm expecting distributed backups for biological humans as well, but by definition they remain much slower).

sharmake-farah on On Dwarkesh Patel’s 4th Podcast With Tyler Cowen

Dare I call this the Lump of Intelligence fallacy, after the Lump of Labor fallacy?

Yes, Tyler Cowen is implicitly assuming that intelligence has a fixed demand, but in fact probably has unlimited demand, and thus the value of intelligence in general will always be high, especially in advanced economies.

Maybe the central disagreement with Tyler Cowen's model here is I basically think population growth is a huge component of how GDP/technology grows in general, and I believe Tyler Cowen is basically wrong here:

(3:55) Tyler is asked wouldn’t a large rise in population drive economic growth? He says no, that’s too much a 1-factor model, in fact we’ve seen a lot of population growth without innovation or productivity growth.

I'd say this would increase GDP by quite a bit, but then somewhat revert to normal (at a new higher growth rate. Again, I disagree with Tyler Cowen here.

(10:15) Dwarkesh asks, what would happen if the world population would double? Tyler says, depends what you’re measuring. Energy use would go up. But he doesn’t agree with population-based models, too many other things matter.

The central disagreement, IMO is I genuinely disbelieve Tyler Cowen on what would happen if the population increases, and I trust economic models more saying this could well cause superexponential growth than Tyler Cowen here is, combined with me disagreeing about the value of intelligence in general for very advanced economies.

karl-krueger on Don't fall for ontology pyramid schemes

If 90% of the conditions you ever have to treat are fever, hypothermia, runny noses, and dehydration, then I imagine hot/cold/wet/dry will get you pretty far.

(Runny nose? Here, take some drying herbs, and try not to sneeze on other people. Feverish? Bathe in cool water and take these cooling herbs. Losing water due to dysentery? Drink clean water with salt. Fell in the icy lake? Here, have a thermal support puppy.)

habryka4 on On Eating the Sun

Sorry, that's literally what I am saying. If many people don't want to leave the solar system, and don't want to be uploads, then using the matter and energy available in the solar system effectively is a decision with a huge stake to many people.

I think if everyone or really almost everyone would want to be an upload, I think this would make it more likely that we should keep the sun intact, because then the sun could belong to just the few humans who don't have better alternatives in other solar systems. But if there is anything above 10% of humanity who don't want to be uploaded, or go on long-distance spaceship journeys in their biological bodies, then you better make sure you make the solar system great for this substantial fraction of humanity, and I think that will likely involve disassembling the sun.

I agree with you that many people don't want to be uploads, etc. I disagree that the majority of people who don't want to be uploads have attachments to the specific celestial bodies in our solar system. I think they just want to have a good life in their biological bodies, doing nice human things. Those goals would be non-trivially hampered if they couldn't disassemble the sun. That's like 99.9% of the energy and matter by which they could achieve those goals, and while I do think this subset of the population will be selected for less scope-sensitivity, I think there will be enough scope-sensitivity to make leaving the sun intact a bad choice.

(To be clear, I disagree that the majority of humanity would not want to be uploads over the course of multiple generations, but it seems plausible to me that like 10%-20% of humanity don't want to be uploads, even over multiple generations)