LessWrong 2.0 Reader

View: New · Old · Top

Restrict date range: Today · This week · This month · Last three months · This year · All time

← previous page (newer posts) · next page (older posts) →

[question] How can we prevent AGI value drift?
Dakara (chess-ice) · 2024-11-20T18:19:24.375Z · answers+comments (6)

Importing Bluesky Comments
jefftk (jkaufman) · 2024-11-28T03:50:06.635Z · comments (0)

[link] Disentangling Representations through Multi-task Learning
Bogdan Ionut Cirstea (bogdan-ionut-cirstea) · 2024-11-24T13:10:26.307Z · comments (1)

[link] I, Token
Ivan Vendrov (ivan-vendrov) · 2024-11-25T02:20:35.629Z · comments (2)

[question] What epsilon do you subtract from "certainty" in your own probability estimates?
Dagon · 2024-11-26T19:13:46.795Z · answers+comments (6)

Curriculum of Ascension
andrew sauer (andrew-sauer) · 2024-11-07T23:54:18.983Z · comments (0)

[link] The lying p value
kqr · 2024-11-12T06:12:59.934Z · comments (6)

Registrations Open for 2024 NYC Secular Solstice & Megameetup
Joe Rogero · 2024-11-12T17:50:10.827Z · comments (0)

Crosspost: Developing the middle ground on polarized topics
juliawise · 2024-11-25T14:39:53.041Z · comments (15)

[question] Why is Gemini telling the user to die?
Burny · 2024-11-18T01:44:12.583Z · answers+comments (1)

Goal: Understand Intelligence
Johannes C. Mayer (johannes-c-mayer) · 2024-11-03T21:20:02.900Z · comments (19)

[link] [Linkpost] Building Altruistic and Moral AI Agent with Brain-inspired Affective Empathy Mechanisms
Gunnar_Zarncke · 2024-11-04T10:15:35.550Z · comments (0)

Paraddictions: unreasonably compelling behaviors and their uses
Michael Cohn (michael-cohn) · 2024-11-22T20:53:59.479Z · comments (0)

ML4Good (AI Safety Bootcamp) - Experience report
JanEbbing · 2024-11-05T01:18:43.554Z · comments (0)

The current state of RSPs
Zach Stein-Perlman · 2024-11-04T16:00:42.630Z · comments (0)

AXRP Episode 38.1 - Alan Chan on Agent Infrastructure
DanielFilan · 2024-11-16T23:30:09.098Z · comments (0)

GPT-4o Can In Some Cases Solve Moderately Complicated Captchas
dirk (abandon) · 2024-11-09T04:04:37.782Z · comments (2)

Sideloading: creating a model of a person via LLM with very large prompt
avturchin · 2024-11-22T16:41:28.293Z · comments (4)

The Three Warnings of the Zentradi
Trevor Hill-Hand (Jadael) · 2024-11-21T20:28:45.567Z · comments (1)

Reward Bases: A simple mechanism for adaptive acquisition of multiple reward type
Bogdan Ionut Cirstea (bogdan-ionut-cirstea) · 2024-11-23T12:45:01.067Z · comments (0)

Commenting Patterns by Platform
jefftk (jkaufman) · 2024-12-01T11:50:06.932Z · comments (0)

Fundamental Uncertainty: Chapter 9 - How do we live with uncertainty?
Gordon Seidoh Worley (gworley) · 2024-11-07T18:15:45.049Z · comments (2)

[link] Anthropic - The case for targeted regulation
anaguma · 2024-11-05T07:07:48.174Z · comments (0)

[link] LLMs Do Not Think Step-by-step In Implicit Reasoning
Bogdan Ionut Cirstea (bogdan-ionut-cirstea) · 2024-11-28T09:16:57.463Z · comments (0)

Exporting Facebook Comments, Again
jefftk (jkaufman) · 2024-11-30T12:40:07.339Z · comments (5)

Why We Wouldn't Build Aligned AI Even If We Could
Snowyiu · 2024-11-16T20:19:59.324Z · comments (7)

Fundamental Uncertainty: Epilogue
Gordon Seidoh Worley (gworley) · 2024-11-16T00:57:48.823Z · comments (0)

[link] Proposing the Conditional AI Safety Treaty (linkpost TIME)
otto.barten (otto-barten) · 2024-11-15T13:59:01.050Z · comments (8)

[question] What are some positive developments in AI safety in 2024?
Satron · 2024-11-15T10:32:39.541Z · answers+comments (5)

[question] Using hex to get murder advice from GPT-4o
Laurence Freeman (laurence-freeman) · 2024-11-13T18:30:23.475Z · answers+comments (5)

Expected Utility, Geometric Utility, and Other Equivalent Representations
StrivingForLegibility · 2024-11-20T23:28:21.826Z · comments (0)

Festival Stats 2024
jefftk (jkaufman) · 2024-11-12T02:00:04.831Z · comments (0)

Fractals to Quasiparticles
James Camacho (james-camacho) · 2024-11-26T20:19:29.675Z · comments (0)

[link] Book Review: Replacing Guilt - On Having Something to Fight For
Cole Killian (cole-killian) · 2024-11-03T19:47:35.093Z · comments (0)

Testing "True" Language Understanding in LLMs: A Simple Proposal
MtryaSam · 2024-11-02T19:12:34.710Z · comments (2)

I Have A New Paper Out Arguing Against The Asymmetry And For The Existence of Happy People Being Very Good
omnizoid · 2024-11-21T17:21:41.426Z · comments (3)

Force Sequential Output with SCP?
jefftk (jkaufman) · 2024-11-09T12:40:06.098Z · comments (4)

Don't Dismiss on Epistemics
ggex · 2024-11-19T00:44:05.329Z · comments (3)

Reflections on ML4Good
james__p · 2024-11-25T02:40:32.586Z · comments (0)

Arthropod (non) sentience
Arturo Macias (arturo-macias) · 2024-11-25T16:01:58.514Z · comments (8)

Contra Musician Gender II
jefftk (jkaufman) · 2024-11-13T03:30:09.510Z · comments (0)

The Bayesian Conspiracy Live Recording
Eneasz · 2024-11-06T16:25:13.380Z · comments (0)

[link] AISN #44: The Trump Circle on AI Safety Plus, Chinese researchers used Llama to create a military tool for the PLA, a Google AI system discovered a zero-day cybersecurity vulnerability, and Complex Systems
Corin Katzke (corin-katzke) · 2024-11-19T16:36:40.501Z · comments (0)

[link] Markets Are Information - Beating the Sportsbooks at Their Own Game
JJXW · 2024-11-07T20:58:43.389Z · comments (1)

[link] Do Large Language Models Perform Latent Multi-Hop Reasoning without Exploiting Shortcuts?
Bogdan Ionut Cirstea (bogdan-ionut-cirstea) · 2024-11-26T09:58:44.025Z · comments (0)

Do Deep Neural Networks Have Brain-like Representations?: A Summary of Disagreements
Joseph Emerson (joseph-emerson) · 2024-11-18T00:07:15.155Z · comments (0)

Is the mind a program?
EuanMcLean (euanmclean) · 2024-11-28T09:42:02.892Z · comments (41)

Rethinking Laplace's Rule of Succession
Cleo Nardo (strawberry calm) · 2024-11-22T18:46:25.156Z · comments (5)

Value/Utility: A History
Lorec · 2024-11-19T23:01:39.167Z · comments (0)

The grass is always greener in the environment that shaped your values
Karl Faulks (karl-faulks) · 2024-11-17T18:00:15.852Z · comments (0)

← previous page (newer posts) · next page (older posts) →

Archive

Recent comments

dagon on Is malice a real emotion?

I'm not sure it classifies as an emotion (nor does stupidity, for that matter), but it probably does exist as a motivation for some human acts, with the relevant emotion usually being anger.

I don't think your distinction (harm for its own sake, distinct from harm with a motivation) is real, unless you think there are un-caused actions in other realms, or you discount some motivations (like anger or hatred) as "not valid" for some reason.

joec on Magnitudes: Let's Comprehend the Incomprehensible!

One example of a web of interrelated facts that I have concerns molecular simulations, with bold/italic denoting things that I have in my anki deck, or would make good cards.

One interesting thing about moleculaes bouncing around is that a nanosecond, which sounds really short, is actually a decently long time. Consider that molecules at room temperature are typically moving at about the speed of sound (340 m/s) and a typical chemical bond length is about 0.1 to 0.2 nanometers. This means that a typical molecule (if nothing bumps into it) will go 1700-3400 bond-lengths in a nanosecond! Of course, molecules in liquid, which are jammed pretty close together, won't move that far without interruptions- they'll bump into each other, switch direction and bump into others many times over the course of a nanosecond. This means that the typical timestep (the when integrating the differential equations of motion) for a molecular dynamics simulation has to be much shorter. In practice, for a molecular dynamics simulation that simulates all the atoms of a system, $d t$ is about a femtosecond. With these timesteps, it becomes possible to simulate about a microsecond of simulation time per day of all atoms of a medium-sized protein moving around on a modern GPU like an A40. This is a big reason for why we can't just simulate a protein folding to crack the protein folding problem. Protein folding takes about a second or on the order of a million GPU-days if you were to simulate it.

screwtape on 2024 Unofficial LessWrong Census/Survey

Hrm. My definition of "anti-agathic" is something that prolongs life, so it isn't obviously not counting a brain transplant to a younger body.

I'm somewhat opposed to tweaking the wording on long-standing parts of the census, since that makes it harder to compare to earlier years. If we want to go this route, I'd rather write a new question and ask both some year so we can compare them.

neel-nanda-1 on You should consider applying to PhDs (soon!)

Do you know what topics within AI Safety you're interested in? Or are you unsure and so looking for something that lets you keep your options open?

npcollapse on Conjecture: A Roadmap for Cognitive Software and A Humanist Future of AI

Hi habryka, I don't really know how best to respond to such a comment. First, I would like to say thank you for your well-wishes, assuming you did not mean them sarcastically. Maybe I have lost the plot, and if so, I do appreciate help in recovering it. Secondly, I feel confused as to why you would say such things in general.

Just last month, me and my coauthors released a 100+ page explanation/treatise on AI extinction risk that gives a detailed account of where AGI risk comes from and how it works, which was received warmly by LW and the general public alike, and which continues to be updated and actively publicised.

In parallel, our sister org ControlAI, a non-profit policy advocacy org focused solely on extinction risk prevention I work with frequently, has had A Narrow Path, a similarly extensive writeup on principles of regulation to address xrisk from ASI, which me and ControlAI have pushed and discussed extensively with policy makers of multiple countries, and there are other regulation-promoting projects ongoing.

I have been on CNN, BBC, Fox News and other major news sources warning in no ambiguous terms about the risks. There is literally dozens of hours of podcast material, including from just last month, where I explain in excruciating depth the existential risk posed by AGI systems and where it comes from, and how it differs from other forms of AI risk. If you think all my previous material has "lost the plot", then well, I guess in your eyes I never had it, not much I can do.

This post is a technical agenda that is not framed in the usual LW ideological ontology, and has not been optimized to appeal to that audience, but rather to identify an angle that is tractable and generalizes the problem without losing its core, and leads to solutions that address the hard core, which is Complexity. In the limit, if we had beautifully simple, legible designs for ASIs that we fully understand and can predict, technical xrisk (but not governance) would be effectively solved. If you disagree with this, I would have greatly enjoyed your engagement with what object level points you think are wrong, and it may have helped me write a better roadmap.

But it seems to me that you have not even tried to engage with the content of this post at all, and have instead merely asserted it is a "random rant against AI-generated art" and "name-calling." I see no effort other than surface level pattern matching, or any curiosity to how it might fit with my previous writings and thinking that have been shared and discussed.

Do you truly think that's the best effort at engaging in good faith you can make?

If so, I don't know what I can say that would help. I hope we can both find the plot again, since neither of us seem to see it in the other person.

joec on Magnitudes: Let's Comprehend the Incomprehensible!

One thing that's useful for me is to draw analogies. For instance, the earth is about as big compared to the kilogram as benzene ( kg) is small.

joec on Magnitudes: Let's Comprehend the Incomprehensible!

That's true. The specific energy of antimatter is also actually double the "maximum" if you don't count the mass of the matter (1 gram of antimatter + 1 gram of air produces about 2 grams worth of energy). Funny enough, this is analogous to combustion fuel. The reason combustion fuel (on the order of 50 MJ/kg for most hydrocarbons) seems to be able to store much more energy than, say a high explosive (on the order of 5 MJ/kg) is because high explosives contain their own oxidizers, while combustion fuel uses the air as an oxidizer.

abramdemski on Ayn Rand’s model of “living money”; and an upside of burnout

Ah, yeah, sorry. I do think about this distinction more than I think about the actual model-based vs model-free distinction as defined in ML. Are there alternative terms you'd use if you wanted to point out this distinction? Maybe policy-gradient vs ... not policy-gradient?

matt-levinson on Beyond Gaussian: Language Model Representations and Distributions

I've uploaded the code to github.

lucid_levi_ackerman on Live Machinery: An Interface Design Philosophy for Wholesome AI Futures

Nice to hear people are making room for uncomfortable honesty and weirdness. Wish I could have attended.