LessWrong 2.0 Reader

View: New · Old · Top

Restrict date range: Today · This week · This month · Last three months · This year · All time

← previous page (newer posts) · next page (older posts) →

An experiment on hidden cognition
Olli Järviniemi (jarviniemi) · 2024-07-22T03:26:05.564Z · comments (2)

Best-of-n with misaligned reward models for Math reasoning
Fabien Roger (Fabien) · 2024-06-21T22:53:21.243Z · comments (0)

Fun With The Tabula Muris (Senis)
sarahconstantin · 2024-09-20T18:20:01.901Z · comments (0)

Using an LLM perplexity filter to detect weight exfiltration
Adam Karvonen (karvonenadam) · 2024-07-21T18:18:05.612Z · comments (11)

[link] An Intuitive Explanation of Sparse Autoencoders for Mechanistic Interpretability of LLMs
Adam Karvonen (karvonenadam) · 2024-06-25T15:57:16.872Z · comments (0)

[link] Transformer Debugger
Henk Tillman (henk-tillman) · 2024-03-12T19:08:56.280Z · comments (0)

How to put California and Texas on the campaign trail!
Yair Halberstadt (yair-halberstadt) · 2024-11-06T06:08:25.673Z · comments (4)

[link] MIRI's July 2024 newsletter
Harlan · 2024-07-15T21:28:17.343Z · comments (2)

[link] Sticker Shortcut Fallacy — The Real Worst Argument in the World
ymeskhout · 2024-06-12T14:52:41.988Z · comments (15)

AXRP Episode 36 - Adam Shai and Paul Riechers on Computational Mechanics
DanielFilan · 2024-09-29T05:50:02.531Z · comments (0)

Economics Roundup #1
Zvi · 2024-03-26T14:00:06.332Z · comments (4)

Elevating Air Purifiers
jefftk (jkaufman) · 2024-12-17T01:40:05.401Z · comments (0)

On The Rationalist Megameetup
Screwtape · 2024-11-23T09:08:26.897Z · comments (3)

Thoughts after the Wolfram and Yudkowsky discussion
Tahp · 2024-11-14T01:43:12.920Z · comments (13)

How likely is brain preservation to work?
Andy_McKenzie · 2024-11-18T16:58:54.632Z · comments (3)

[link] Social events with plausible deniability
Chipmonk · 2024-11-18T18:25:17.339Z · comments (24)

No Electricity in Manchuria
winstonBosan · 2024-11-19T01:11:58.661Z · comments (0)

[link] Effective Networking as Sending Hard to Fake Signals
vaishnav92 · 2024-12-12T20:32:24.113Z · comments (2)

Alternatives to Masks for Infectious Aerosols
jefftk (jkaufman) · 2024-12-08T14:00:01.670Z · comments (9)

Why I Think All The Species Of Significantly Debated Consciousness Are Conscious And Suffer Intensely
omnizoid · 2024-11-20T16:48:44.859Z · comments (5)

[link] A Theory of Equilibrium in the Offense-Defense Balance
Maxwell Tabarrok (maxwell-tabarrok) · 2024-11-15T13:51:33.376Z · comments (6)

[link] Teaching My Younger Self to Program: A case study of how I'd pass on my skill at self-learning
Shoshannah Tekofsky (DarkSym) · 2024-12-01T21:05:15.602Z · comments (1)

[link] Creating Interpretable Latent Spaces with Gradient Routing
Jacob G-W (g-w1) · 2024-12-14T04:00:17.249Z · comments (6)

[link] Linkpost: "Imagining and building wise machines: The centrality of AI metacognition" by Johnson, Karimi, Bengio, et al.
Chris_Leong · 2024-11-11T16:13:26.504Z · comments (6)

How to bet on AI, without helping AGI?
Nicholas / Heather Kross (NicholasKross) · 2024-11-29T22:46:03.109Z · comments (0)

Second-Time Free
jefftk (jkaufman) · 2024-12-11T03:30:01.289Z · comments (4)

Visual demonstration of Optimizer's curse
Roman Malov · 2024-11-30T19:34:07.700Z · comments (3)

The Queen’s Dilemma: A Paradox of Control
Daniel Murfet (dmurfet) · 2024-11-27T10:40:14.346Z · comments (11)

A few questions about recent developments in EA
Peter Berggren (peter-berggren) · 2024-11-23T02:36:25.728Z · comments (12)

Good Reasons for Alts
jefftk (jkaufman) · 2024-12-21T01:30:03.113Z · comments (2)

[link] The Alignment Simulator
Yair Halberstadt (yair-halberstadt) · 2024-12-22T11:45:55.220Z · comments (3)

Weeping Agents
pleiotroth · 2024-06-06T12:18:54.978Z · comments (2)

Distinctions when Discussing Utility Functions
ozziegooen · 2024-03-09T20:14:03.592Z · comments (7)

[link] Cellular respiration as a steam engine
dkl9 · 2024-02-25T20:17:38.788Z · comments (1)

A brief review of China's AI industry and regulations
Elliot Mckernon (elliot) · 2024-03-14T12:19:00.775Z · comments (0)

[link] The Living Planet Index: A Case Study in Statistical Pitfalls
Jan_Kulveit · 2024-06-24T10:05:55.101Z · comments (0)

[link] Truth is Universal: Robust Detection of Lies in LLMs
Lennart Buerger · 2024-07-19T14:07:25.162Z · comments (3)

Population ethics and the value of variety
cousin_it · 2024-06-23T10:42:21.402Z · comments (11)

Evolution did a surprising good job at aligning humans...to social status
Eli Tyre (elityre) · 2024-03-10T19:34:52.544Z · comments (37)

[link] Tokyo AI Safety 2025: Call For Papers
Blaine (blaine-rogers) · 2024-10-21T08:43:38.467Z · comments (0)

[link] Extinction Risks from AI: Invisible to Science?
VojtaKovarik · 2024-02-21T18:07:33.986Z · comments (7)

[link] Robert Caro And Mechanistic Models In Biography
adamShimi · 2024-07-14T10:56:42.763Z · comments (5)

Boring & straightforward trauma explanation
lemonhope (lcmgcd) · 2024-11-08T09:45:19.486Z · comments (7)

Anomalous Concept Detection for Detecting Hidden Cognition
Paul Colognese (paul-colognese) · 2024-03-04T16:52:52.568Z · comments (3)

[link] Clickbait Soapboxing
DaystarEld · 2024-03-13T14:09:29.890Z · comments (15)

[link] "25 Lessons from 25 Years of Marriage" by honorary rationalist Ferrett Steinmetz
CronoDAS · 2024-10-02T22:42:30.509Z · comments (2)

A Basic Economics-Style Model of AI Existential Risk
Rubi J. Hudson (Rubi) · 2024-06-24T20:26:09.744Z · comments (3)

UDT1.01: Local Affineness and Influence Measures (2/10)
Diffractor · 2024-03-31T07:35:52.831Z · comments (0)

An evaluation of Helen Toner’s interview on the TED AI Show
PeterH · 2024-06-06T17:39:40.800Z · comments (2)

[question] What percent of the sun would a Dyson Sphere cover?
Raemon · 2024-07-03T17:27:50.826Z · answers+comments (26)

← previous page (newer posts) · next page (older posts) →

Archive

Recent comments

lc on [New Feature] Your Subscribed Feed

Just reproduced it; all I have to do is subscribe to a bunch of people and this happens and the site becomes unusable:

raemon on Ureshiku Naritai

I'm curious how this seems to have gone for you 14 years later.

lc on [New Feature] Your Subscribed Feed

The image didn't upload but it's a picture of my browser saying that the web page's javascript is using a ton of resources and I can force-stop it if I wish

jkaufman on Voluntary Salary Reduction

Pretty sure the salary transparency law doesn't apply to us, because you need 25+ MA employees. Even if it did, though, I think it would mostly mean giving moderately wider salary ranges? Which I expect would be fine; our two current open positions [1][2] have ranges of 23% and 30%.

[1] https://securebio.org/careers/2024-lab-tech/

[2] https://securebio.org/careers/2024-director-operations/

lc on Shortform

I am a little confused as to why Israel does not have the hostages yet. My understanding was that Israel has essentially taken control of Gaza and decimated the Hamas leadership. Who are they even "negotiating" with to secure their release? Why can't the IDF just kidnap and waterboard that person to get the location of the remaining prisoners? Does the person with the authority to make a deal also not know? Are there clandestine cells of Hamas personnel hiding in a basement somewhere waiting for some "signal" from a third party to give up the Israelis?

viliam on Jimrandomh's Shortform

Ah, yes. Recently I volunteered for a medical research along with 3 other people I know. Two of them dropped out in the middle. I can't imagine how any medical research can be methodologically valid this way. On the other hand, me and the other person stayed there, and it's almost over, so the success rate is 50%.

lc on [New Feature] Your Subscribed Feed

Somehow I just found out about this. Although this happened within a few minutes of me trying to use it:

steve2152 on Heritability: Five Battles

OK gotcha. But I can just rephrase slightly, let me try again:

(B’’) “…Gee, I guess this reprocessing must have been a kind of ‘training / practice / exercise’ during which I could forge new better subconscious habits and associations related to ‘type-of-situation X’ (which used to invoke anxiety). And these new subconscious habits and associations are now serving me well when I encounter type-of-situation X (or anything that vaguely rings of it) an adult context too.”

After all, you can’t form new subconscious habits and associations related to “type-of-situation X” except by making “type-of-situation X” thoughts active somehow during that process. It seems plausible to me that invoking a childhood memory where type-of-situation X triggered unhealthy anxiety would be very effective way to do that.

~~

I think what I’m suggesting is not that different from what you’re suggesting. Maybe the difference is when you wrote “…some specific childhood experiences that had to do with spiders, that seemed to be at the root of the phobia…”.

My mental image is, like, there’s some neuron in the amygdala, and one day in childhood it forms Synapse S connecting some input related to the idea of spiders with some output related to fear reactions. Then the goal for the adult therapy session is to delete Synapse S (or form different connections that counteract its effects, or whatever). Basically, my proposal is:

One day in childhood → Synapse S forms

Adult sees spider → Synapse S → fear reactions

I’m contrasting that with:

[What I don’t believe, but it sounds like maybe you do?] Adult sees spider → childhood memory reactivates, at least a little bit → fear reactions

In other words, I want to say that the childhood experience is “at the root of the phobia” as a matter of the historical record of how Synapse S came to be there, but it’s not “at the root of the phobia” in the sense of the episodic memory itself playing a critical causal role in the real-time anxiety reaction.

…And I’m saying that my hypothesis would nevertheless be compatible with childhood-memory-based therapies being effective, because invoking the actual episodic childhood memory itself, in a therapeutic context, is one possible path to delete or inactivate Synapse S.

Well, hmm, on second thought, I guess both stories are possible, maybe they coexist.

abandon on Ureshiku Naritai

Huh—that sounds fascinatingly akin to this description of how to induce first jhana I read the other day.

auspicious on Passages I Highlighted in The Letters of J.R.R.Tolkien

I love these quotes too, but while reading them a funny thought struck me. Fantasy terms like "elves" and "orcs" seem normal to us now, but Tolkien basically invented their modern usage. At the time he was writing to his son they would have been very new and only used that way by Tolkien himself.

Substituting Tolkien's terms with equivalents from Starcraft makes one of these passages sound ridiculous:

An ultimately evil job. For we are attempting to conquer Kerrigan with the Hivemind. And we shall (it seems) succeed. But the penalty is, as you will know, to breed new Kerrigans, and slowly turn Terrans and Protoss into Zerg. Not that in real life things are as clear cut as in a story, and we started out with a great many Zerg on our side … Well, there you are: an SCV amongst the Hydralisks.

Why is this, and would the passage have sounded just as goofy back in the 1940s?

Is it just because the Starcraft terms are less mainstream? Perhaps sci-fi terms are generally less graceful than fantasy ones? Or maybe Tolkien had a special sense for phrasing and names like "Sauron" and "Urukhai" would have sounded just as profound then as they do now?