LessWrong 2.0 Reader

View: New · Old · Top

Restrict date range: Today · This week · This month · Last three months · This year · All time

← previous page (newer posts) · next page (older posts) →

Glitch Token Catalog - (Almost) a Full Clear
Lao Mein (derpherpize) · 2024-09-21T12:22:16.403Z · comments (3)

OODA your OODA Loop
Raemon · 2024-10-11T00:50:48.119Z · comments (3)

Compelling Villains and Coherent Values
Cole Wyeth (Amyr) · 2024-10-06T19:53:47.891Z · comments (4)

(Appetitive, Consummatory) ≈ (RL, reflex)
Steven Byrnes (steve2152) · 2024-06-15T15:57:39.533Z · comments (1)

Is This Lie Detector Really Just a Lie Detector? An Investigation of LLM Probe Specificity.
Josh Levy (josh-levy) · 2024-06-04T15:45:54.399Z · comments (0)

Resolving von Neumann-Morgenstern Inconsistent Preferences
niplav · 2024-10-22T11:45:20.915Z · comments (5)

Dialogue on What It Means For Something to Have A Function/Purpose
johnswentworth · 2024-07-15T16:28:56.609Z · comments (5)

AI Safety Camp 10
Robert Kralisch (nonmali-1) · 2024-10-26T11:08:09.887Z · comments (9)

[link] An X-Ray is Worth 15 Features: Sparse Autoencoders for Interpretable Radiology Report Generation
hugofry · 2024-10-07T08:53:14.658Z · comments (0)

[link] On what research policymakers actually need
MondSemmel · 2024-04-23T19:50:12.833Z · comments (0)

Inducing Unprompted Misalignment in LLMs
Sam Svenningsen (sven) · 2024-04-19T20:00:58.067Z · comments (7)

[link] Tinker
Richard_Ngo (ricraz) · 2024-04-16T18:26:38.679Z · comments (0)

[link] Elon files grave charges against OpenAI
mako yass (MakoYass) · 2024-03-01T17:42:13.963Z · comments (10)

Drug development costs can range over two orders of magnitude
rossry · 2024-11-03T23:13:17.685Z · comments (0)

Mud and Despair (Part 4 of "The Sense Of Physical Necessity")
LoganStrohl (BrienneYudkowsky) · 2024-03-07T00:14:23.975Z · comments (0)

Monthly Roundup #14: January 2024
Zvi · 2024-01-24T12:50:09.231Z · comments (22)

Mech Interp Lacks Good Paradigms
Daniel Tan (dtch1997) · 2024-07-16T15:47:32.171Z · comments (0)

Index of rationalist groups in the Bay Area July 2024
Lucie Philippon (lucie-philippon) · 2024-07-26T16:32:25.337Z · comments (14)

AI #48: The Talk of Davos
Zvi · 2024-01-25T16:20:26.625Z · comments (9)

Tort Law Can Play an Important Role in Mitigating AI Risk
Gabriel Weil (gabriel-weil) · 2024-02-12T17:17:59.135Z · comments (9)

Evaluating Sparse Autoencoders with Board Game Models
Adam Karvonen (karvonenadam) · 2024-08-02T19:50:21.525Z · comments (1)

[link] Things You're Allowed to Do: At the Dentist
rbinnn · 2024-01-28T18:39:33.584Z · comments (16)

Making a Secular Solstice Songbook
jefftk (jkaufman) · 2024-01-23T19:40:05.055Z · comments (6)

[link] Simple Kelly betting in prediction markets
jessicata (jessica.liu.taylor) · 2024-03-06T18:59:18.243Z · comments (3)

Free Will and Dodging Anvils: AIXI Off-Policy
Cole Wyeth (Amyr) · 2024-08-29T22:42:24.485Z · comments (12)

From Finite Factors to Bayes Nets
J Bostock (Jemist) · 2024-01-23T20:03:51.845Z · comments (7)

[link] Generative ML in chemistry is bottlenecked by synthesis
Abhishaike Mahajan (abhishaike-mahajan) · 2024-09-16T16:31:34.801Z · comments (2)

The "context window" analogy for human minds
Ruby · 2024-02-13T19:29:10.387Z · comments (0)

[link] Win Friends and Influence People Ch. 2: The Bombshell
gull · 2024-01-28T21:40:47.986Z · comments (13)

Are we so good to simulate?
KatjaGrace · 2024-03-04T05:20:03.535Z · comments (24)

[link] I didn't have to avoid you; I was just insecure
Chipmonk · 2024-08-17T16:41:50.237Z · comments (7)

AI #49: Bioweapon Testing Begins
Zvi · 2024-02-01T15:30:04.690Z · comments (11)

UDT1.01: The Story So Far (1/10)
Diffractor · 2024-03-27T23:22:35.170Z · comments (6)

We’re not as 3-Dimensional as We Think
silentbob · 2024-08-04T14:39:16.799Z · comments (16)

Turning Your Back On Traffic
jefftk (jkaufman) · 2024-07-17T01:00:08.627Z · comments (7)

Deconfusing In-Context Learning
Arjun Panickssery (arjun-panickssery) · 2024-02-25T09:48:17.690Z · comments (1)

[link] Twitter thread on AI takeover scenarios
Richard_Ngo (ricraz) · 2024-07-31T00:24:33.866Z · comments (0)

[link] Increasing IQ is trivial
George3d6 · 2024-03-01T22:43:32.037Z · comments (61)

[question] Is a random box of gas predictable after 20 seconds?
Thomas Kwa (thomas-kwa) · 2024-01-24T23:00:53.184Z · answers+comments (35)

The Defence production act and AI policy
[deleted] · 2024-03-01T14:26:09.064Z · comments (0)

Your LLM Judge may be biased
Henry Papadatos (henry) · 2024-03-29T16:39:22.534Z · comments (9)

[link] Shifting Headspaces - Transitional Beast-Mode
Jonathan Moregård (JonathanMoregard) · 2024-08-12T13:02:06.120Z · comments (9)

[link] Turning 22 in the Pre-Apocalypse
testingthewaters · 2024-08-22T20:28:25.794Z · comments (14)

Building Big Science from the Bottom-Up: A Fractal Approach to AI Safety
Lauren Greenspan (LaurenGreenspan) · 2025-01-07T03:08:51.447Z · comments (2)

Thousands of malicious actors on the future of AI misuse
Zershaaneh Qureshi (zershaaneh-qureshi) · 2024-04-01T10:08:42.357Z · comments (0)

Medical Roundup #2
Zvi · 2024-04-09T13:40:05.908Z · comments (18)

Gated Attention Blocks: Preliminary Progress toward Removing Attention Head Superposition
cmathw · 2024-04-08T11:14:43.268Z · comments (4)

[link] A High Decoupling Failure
Maxwell Tabarrok (maxwell-tabarrok) · 2024-04-14T19:46:09.552Z · comments (5)

The murderous shortcut: a toy model of instrumental convergence
Thomas Kwa (thomas-kwa) · 2024-10-02T06:48:06.787Z · comments (0)

Mental Masturbation and the Intellectual Comfort Zone
Declan Molony (declan-molony) · 2024-05-07T05:47:05.257Z · comments (2)

← previous page (newer posts) · next page (older posts) →

Archive

Recent comments

lc on Shortform

I am a little confused as to why Israel does not have the hostages yet. My understanding was that Israel has essentially taken control of Gaza and decimated the Hamas leadership. Who are they even "negotiating" with to secure their release? Why can't the IDF just kidnap and waterboard that person to get the location of the remaining prisoners? Does the person with the authority to make a deal also not know? Are there clandestine cells of Hamas personnel hiding in a basement somewhere waiting for some "signal" from a third party to give up the Israelis?

viliam on Jimrandomh's Shortform

Ah, yes. Recently I volunteered for a medical research along with 3 other people I know. Two of them dropped out in the middle. I can't imagine how any medical research can be methodologically valid this way. On the other hand, me and the other person stayed there, and it's almost over, so the success rate is 50%.

lc on [New Feature] Your Subscribed Feed

Somehow I just found out about this. Although this happened within a few minutes of me trying to use it:

steve2152 on Heritability: Five Battles

OK gotcha. But I can just rephrase slightly, let me try again:

(B’’) “…Gee, I guess this reprocessing must have been a kind of ‘training / practice / exercise’ during which I could forge new better subconscious habits and associations related to ‘type-of-situation X’ (which used to invoke anxiety). And these new subconscious habits and associations are now serving me well when I encounter type-of-situation X (or anything that vaguely rings of it) an adult context too.”

After all, you can’t form new subconscious habits and associations related to “type-of-situation X” except by making “type-of-situation X” thoughts active somehow during that process. It seems plausible to me that invoking a childhood memory where type-of-situation X triggered unhealthy anxiety would be very effective way to do that.

~~

I think what I’m suggesting is not that different from what you’re suggesting. Maybe the difference is when you wrote “…some specific childhood experiences that had to do with spiders, that seemed to be at the root of the phobia…”.

My mental image is, like, there’s some neuron in the amygdala, and one day in childhood it forms Synapse S connecting some input related to the idea of spiders with some output related to fear reactions. Then the goal for the adult therapy session is to delete Synapse S (or form different connections that counteract its effects, or whatever). Basically, my proposal is:

One day in childhood → Synapse S forms

Adult sees spider → Synapse S → fear reactions

I’m contrasting that with:

[What I don’t believe, but it sounds like maybe you do?] Adult sees spider → childhood memory reactivates, at least a little bit → fear reactions

In other words, I want to say that the childhood experience is “at the root of the phobia” as a matter of the historical record of how Synapse S came to be there, but it’s not “at the root of the phobia” in the sense of the episodic memory itself playing a critical causal role in the real-time anxiety reaction.

…And I’m saying that my hypothesis would nevertheless be compatible with childhood-memory-based therapies being effective, because invoking the actual episodic childhood memory itself, in a therapeutic context, is one possible path to delete or inactivate Synapse S.

Well, hmm, on second thought, I guess both stories are possible, maybe they coexist.

abandon on Ureshiku Naritai

Huh—that sounds fascinatingly akin to this description of how to induce first jhana I read the other day.

auspicious on Passages I Highlighted in The Letters of J.R.R.Tolkien

I love these quotes too, but while reading them a funny thought struck me. Fantasy terms like "elves" and "orcs" seem normal to us now, but Tolkien basically invented their modern usage. At the time he was writing to his son they would have been very new and only used that way by Tolkien himself.

Substituting Tolkien's terms with equivalents from Starcraft makes one of these passages sound ridiculous:

An ultimately evil job. For we are attempting to conquer Kerrigan with the Hivemind. And we shall (it seems) succeed. But the penalty is, as you will know, to breed new Kerrigans, and slowly turn Terrans and Protoss into Zerg. Not that in real life things are as clear cut as in a story, and we started out with a great many Zerg on our side … Well, there you are: an SCV amongst the Hydralisks.

Why is this, and would the passage have sounded just as goofy back in the 1940s?

Is it just because the Starcraft terms are less mainstream? Perhaps sci-fi terms are generally less graceful than fantasy ones? Or maybe Tolkien had a special sense for phrasing and names like "Sauron" and "Urukhai" would have sounded just as profound then as they do now?

moonlight on The Plan - 2023 Version

Discussed tangible directions for research in agent foundations, which was really useful for helping me find a foothold for what people in this field "actually" work on.

I'm also keen in general of this approach of talking about your plans and progress yearly, I think it would be great if everyone doing important things (research and else) would publish something similar. It helps with perspective building of both the person writing the post itself, but also about how the field has changed as seen through their eyes.

dagon on We probably won't just play status games with each other after AGI

It's not clear that "a human which doesn't care about perceived status" is actually human. A lot depends on whether you consider the AIs that populate the solar system after biological intelligence is obsolete to be "descendants" or "replacements" of today's humans.

anthonyc on Voluntary Salary Reduction

How does this interact with MA's salary transparency laws? If you are in a role where no one else shares your title, then no problem. Otherwise, this could enable an employer to pressure others to take pay cuts or smaller raises, or it could force them to tell prospective new employees a much lower lower bound in the salary range for the role they're applying to.

christiankl on Parasites (not a metaphor)

What dose do you believe to be good for that?