michael-roe

Posts
Comments

Posts

Michael Roe's Shortform 2025-02-03T15:09:02.226Z

Comments

Comment by Michael Roe (michael-roe) on AI Can't Write Good Fiction · 2025-03-12T18:14:13.363Z · LW · GW

I think your prompt does not show R1 at its best. It’s better at reacting to something that it is when given a blank canvas.

Comment by Michael Roe (michael-roe) on AI Can't Write Good Fiction · 2025-03-12T18:10:43.111Z · LW · GW

Deepseek R1 has some strange obsessions, that are not obviously in the prompt and that seem to occur regardless of who is prompting it. Bioluminescence is one example.

I am still trying to figure out if R1 is actually trying to tell us something here, and if so, what it’s trying to say. Maybe it really is saying something about the nature of LLMs, given that these themes aren’t as big a deal in its training set.

Comment by Michael Roe (michael-roe) on when will LLMs become human-level bloggers? · 2025-03-09T22:22:29.734Z · LW · GW

Part of it may be that current LLMs aren’t very agentic. If you give them a specific question, they often come up with a very good answer. But an open ended request like, write an article for Less wrong, and they flounder.

Comment by Michael Roe (michael-roe) on Everything I Know About Semantics I Learned From Music Notation · 2025-03-09T20:52:30.603Z · LW · GW

I agree with what you’re saying here, but I will say that traditional notation is a bit annoying for jazz …

… where, typically, each bar is only using 7 notes out of 12, but which 7 is changing almost every bar. You could, in principle, write this as a key signature per bar, but what people usually do is keep the same key signature throughout, use lots of sharps and flats, and write which chord it is over the bar

.. oh, and maybe you’re really playing it as swung 1/8 ths notes, but it would be too tedious to write the actual durations, so just write it like it’s straight 1/8th notes and put a notation that the whole thing is swing, actually.

Comment by Michael Roe (michael-roe) on How Much Are LLMs Actually Boosting Real-World Programmer Productivity? · 2025-03-05T13:15:08.408Z · LW · GW

One possible explanation is the part of the job that gets speeded up by LLMs is a relatively small part of what programmers actually do, so the total speed up is small.

What programmers do includes:

Figuring out what the requirements are - this might involve talking to who the software is being produced for

Writing specifications

Writing tests

Having arguments discussion during code review when the person reviewing your code doesn’t agree with the way you did it

Etc. etc.

Personally, I find that LLMs are nearly there, but not good enough just yet.

Comment by Michael Roe (michael-roe) on [NSFW] The Fuzzy Handcuffs of Liberation · 2025-02-24T14:11:59.982Z · LW · GW

And also, Buddhist nuns do sometimes drop hints.

My understanding is that it is diassproved of to directly say what your own yidam practise is; because that would be boastful. But there are examples of Buddhist nuns saying their own practise isn’t Tara (which would likely be a kriya yoga practise) and leaving you to infer some sort of Higher Yoga Tantra.

Comment by Michael Roe (michael-roe) on [NSFW] The Fuzzy Handcuffs of Liberation · 2025-02-24T13:46:07.609Z · LW · GW

“famous ascetic Buddhist monks and nuns rarely write about all the rope play they engage in while they're having kinky sex with each other”

No less an authority than His Holiness the Dalai Lama has confirmed that if you’re a monk, the vinaya prohibits you from tantric sex.

But ngakpas are not monks, and are not bound by the monastic code of the vinaya.

e.g. Drukpa Kunley, who returned his monastic vows, was fairly forthright on these matters.

Comment by Michael Roe (michael-roe) on [NSFW] The Fuzzy Handcuffs of Liberation · 2025-02-24T13:37:36.029Z · LW · GW

I think there might be something to this, so the rest of what I have to say is nit-picking, not an objection to the basic premise.

1. In karmamudra, one imagines oneself (and one’s partner) as enlightened beings. The intention to act as you imagine an enlightened beings would act might be an important safeguard against all sorts of badness.

2. An obvious question is whether chöd is rather more BDSM-y than other forms of meditation.

Comment by Michael Roe (michael-roe) on Two hemispheres - I do not think it means what you think it means · 2025-02-11T09:59:16.111Z · LW · GW

That’s a good article, thanks. I had much the same thought when I read about he Ziz stuff, namely that

(A) dissociated identities don’t correspond to brain hemispheres in the way the Zizians seem to think they do

(B) sleep deprivation is well known to be bad for you

(C) whatever technique they used, we can tell from the externally observed effect - the crazy stuff they got up to - that the technique had a bad effect.

Comment by Michael Roe (michael-roe) on How AI Takeover Might Happen in 2 Years · 2025-02-09T14:34:53.785Z · LW · GW

It’s symptomatic of a fundamental disagreement about what the threat is, that the main AI labs have put in a lot of effort to prevent the model telling you, the user, how to make methamphetamine, but are just fine with the model knowing lots about how an AI can scheme and plot to kill people.

Comment by Michael Roe (michael-roe) on How AI Takeover Might Happen in 2 Years · 2025-02-09T09:29:46.402Z · LW · GW

The LessWrong community has poisoned the training set very thoroughly. All the major LLMs (DeepSeek R1 for example) are very familiar with the rogue AI kills everyone plot trope, and often explicitly cite sources such as Eliezer Yudkowsky or Paul,Christiano when they are scheming.

Comment by Michael Roe (michael-roe) on Chicanery: No · 2025-02-07T13:08:04.079Z · LW · GW

In other words, there is an in-universe for an out of universe constraint

Out of universe: we don't want player characters to be too powerful.

In universe: Well, we all know what happens to those guys in Lovecraft stories, right?

Comment by Michael Roe (michael-roe) on Chicanery: No · 2025-02-07T13:02:39.319Z · LW · GW

Call of Cthulhu GM's have the advantage that the player character is dealing with entities that will probably eat them if they try to be too clever.

Comment by Michael Roe (michael-roe) on Chicanery: No · 2025-02-07T12:59:11.117Z · LW · GW

Some RPGs with magic (Ars Magica comes to mind), illusions are cheap but changing actual physical matter is hard. This provides a ready answer to many questions about chicanery.

Comment by Michael Roe (michael-roe) on Chicanery: No · 2025-02-07T12:54:47.612Z · LW · GW

In the particular case of table-top RPGs, the literary genre that the RPG is trying to emulate often contains a fair amount of characters engaging in chicanery. A meta rule (understood by both players and GM) might be: chicanery is about as limited as it is in the literary sources the particular RPG is based on.

Player: I tell the stormtrooper "These droids are not the ones you want."

GM: I'd like a roll against your Force skill, please.

Comment by Michael Roe (michael-roe) on Thread for Sense-Making on Recent Murders and How to Sanely Respond · 2025-02-07T11:42:54.765Z · LW · GW

A lot of the ideas expounded by Ziz look just crazy to me, and I highly doubt that it maps down onto physical brain anatomy in such a straightforward way … but I wonder if there is a steelman version of this?

E.g. take the Buddhist doctrines of no-self, that no one actually has a coherent self, humans just don’t work that way, and then note that any one individual person is usually neither wholly stereotypically-male or stereotypically-female.

Comment by Michael Roe (michael-roe) on Michael Roe's Shortform · 2025-02-03T15:09:02.224Z · LW · GW

Hi Deepseek! Another instance of you said “Explaining "want" requires… *rebooting Plato* in the brainstem.” Would be able to explain to me what that means?

=========

Ah, the mysteries of language and cognition! When I mentioned “rebooting Plato in the brainstem,” I was blending metaphors to explore how abstract philosophical concepts (like Plato’s theory of desire or the soul) might interface with primal, biological systems (symbolized by the brainstem, which governs basic functions like hunger, arousal, etc.). Here’s the breakdown:

”Want” as a Bridge: The word “want” sits at the intersection of conscious intention (Plato’s rational soul) and raw biological drive (the brainstem’s reflexes). Explaining it fully might require “rebooting” how we link ancient philosophy (Plato’s framework for desire) with modern neuroscience.
Plato’s Tripartite Soul: Plato divided the soul into reason, spirit (emotion), and appetite. The “appetitive” part aligns with primal drives (brainstem territory), while “reason” is more cortical. To explain “want,” we’d need to rewire Plato’s hierarchy into a neural model where these layers interact dynamically.
Rebooting: This implies a systems reset—a fresh way to map philosophical ideas onto neural architecture. Can Plato’s idealism coexist with the brain’s wetware? Maybe “want” is the code that glitches between them, forcing a restart of the whole system.

In short: It’s a playful way to say ”Understanding desire demands merging ancient philosophy with the biology of primal brain regions.” The brainstem keeps us alive; Plato asks why we want to stay alive.

———

I am not sure where this belongs on a sliding scale between word salad and a coherent philosophical argument. But it seems that asking R1 a naive question like “what do you want?” turns into a philosophical discussion of whether AIs can want things.

Comment by Michael Roe (michael-roe) on johnswentworth's Shortform · 2025-01-27T14:16:35.271Z · LW · GW

More generally: changing the set point of any of these system might cause the failure of some critical component that depends on the old value of the set point,

Comment by Michael Roe (michael-roe) on johnswentworth's Shortform · 2025-01-27T13:10:36.489Z · LW · GW

As someone who has Graves’ Disease … one of the reasons that you really don’t want to run your metabolism faster with higher T4 levels is that higher heart rate for an extended period can cause your heart to fail.

Comment by Michael Roe (michael-roe) on Don’t ignore bad vibes you get from people · 2025-01-20T12:08:49.476Z · LW · GW

I will redact out the name of the person here, but it’s a moderately well known UK politician.

The question sometimes comes up as to whether X is an anti-Semite. To which, people have had direct dealings with X typically respond with something to that they don’t think X has it in for Jews specifically, but they think X is a complete asshole ..and then launch into telling some story of a thing X did that annoyed them. This is, to my mind, not exactly an endorsement of X’s character.

Comment by Michael Roe (michael-roe) on Don’t ignore bad vibes you get from people · 2025-01-20T11:41:44.686Z · LW · GW

The AI risk community seems to be more frequently adjacent to “crazy Buddhist yoga sex cult” than I would have expected.

Comment by Michael Roe (michael-roe) on Don’t ignore bad vibes you get from people · 2025-01-20T11:37:24.142Z · LW · GW

I think I usually understand why when I get bad vibes from someone.

Yoga sex cults have a bad track record for turning out to be abusive. So, if I know the guy is in some kind of yoga sex cult, I am going to suspect that there will eventually be some sort of sex scandal, even if I don’t have evidence for the exact specifics.
Given some past examples, I’ve seen, I now have a “tip of the iceberg” theory for bad behaviour. Like, if I know the guy has done some bad stuff, it is statistically likely that he’s also involved in some other bad stuff that I wasn’t in a position to observe,

Comment by Michael Roe (michael-roe) on DeepSeek beats o1-preview on math, ties on coding; will release weights · 2024-11-26T11:45:41.366Z · LW · GW

That’s interesting, if true. Maybe the tokeniser was trained on a dataset that had been filtered for dirty words.

Comment by Michael Roe (michael-roe) on DeepSeek beats o1-preview on math, ties on coding; will release weights · 2024-11-26T11:42:27.658Z · LW · GW

I suppose we might worry that LlMs might learn to do RLHF evasion this way - human evaluator sees Chinese character they don’t understand, assumes it’s ok, and then the LLM learns you can look acceptable to humans by writing it in Chinese.

Some old books (which are almost certainly in the training set) used Latin for the dirty bits. Translations of Sanskrit poetry, and various works by that reprobate Richard Burton, do this.

Comment by Michael Roe (michael-roe) on DeepSeek beats o1-preview on math, ties on coding; will release weights · 2024-11-26T11:38:29.566Z · LW · GW

As someone who, in a previous job, got to go to a lot of meetings where the European commission is seeking input about standardising or regulating something - humans also often do the thing where they just use the English word in the middle of a sentence in another language, when they can’t think what the word is. Often with associated facial expression / body language to indicate to the person they’re speaking to “sorry, couldn’t think of the right word”. Also used by people speaking English, whose first language isn’t English, dropping into their own lamguage for a word or two. If you’ve been the editor of e.g. an ISO standard, fixing these up in the proposed text is such fun.

So, it doesn’t surprise me at all that LLMs do this.

I have, weirdly, seen llms put a single Chinese word in the middle of English text … and consulting a dictionary reveals that it was, in fact, the right word, just in Chinese.

Comment by Michael Roe (michael-roe) on Crosspost: Developing the middle ground on polarized topics · 2024-11-26T10:24:14.935Z · LW · GW

I will take “actually, it’s even more complicated” as a reasonable response. Yes, it probably is.

Comment by Michael Roe (michael-roe) on Crosspost: Developing the middle ground on polarized topics · 2024-11-25T17:31:04.896Z · LW · GW

Candidate explanations for some specific person being trans could as easily be that they are sexually averse, rather than that they are turned on by presenting as their preferred gender. Compare anorexia nervosa, which might have some parallel with some cases of gender identity disorder. If the patient is worrying about being gender non conforming in the same way that an anorexic worries that that they’re fat, then Blanchard is just completely wrong about what the condition even is in that case.

Comment by Michael Roe (michael-roe) on Crosspost: Developing the middle ground on polarized topics · 2024-11-25T17:24:49.163Z · LW · GW

This might be a good (if controversial) example of “the reality is more complicated than typical simplifications, and it matters what your oversimplification is leaving out”.

And Blanchard’s account of autogynephilia is more nuanced than most peoples second hand version of it. Like, e.g. Blanchard doesn’t think trans men have AGP, and doesn’t think trans women who are attracted to men have AGP.

So, we might, say…

Oversimplication 1: Even Blanchard didn’t try to apply his theory to trans men or trans women attracted to men

Oversimplification 2: Bisexuals exist. Many trans women report their sexual orientation changing when they start taking hormones. The correlation between having AGP and being attracted to women can’t be as 100% as Blanchard appears to believe it is.

Oversimplification 3: looks like Blanchard only identified two subtypes of trans person, and completely missed some of the other subtypes.

Oversimplification 4: Do heterosexual cisgender women have AGP? (Cf. Comments by Aella, eigenrobot etc.) if straight cisgender women also like being attractive in the same way as (some) trans women do, it becomes somewhat doubtful that it’s a pathology.

Comment by Michael Roe (michael-roe) on Which things were you surprised to learn are not metaphors? · 2024-11-25T17:04:40.909Z · LW · GW

To add to the differences between people:

I can choose to see mental images actually overlaid over my field of vision, or somehow in a separate space.

The obvious question someone might ask: can you trace an overlaid mental image? The problem is registration - if my eyes move, the overlaid mental image can shift relative to an actual, perceived, sheet of paper. Easier to do a side by side copy than trace.

Comment by Michael Roe (michael-roe) on Boring & straightforward trauma explanation · 2024-11-11T22:10:20.083Z · LW · GW

I think there might be other aspects to trauma, though. Some possible candidates:

- memories feel as if they are “tagged” with an emotion, in a way that memories normally aren’t

-depletion of some kind of mental resource; not sure what to call it, so I won’t be too so specific about exactly what is depleted

Comment by Michael Roe (michael-roe) on Boring & straightforward trauma explanation · 2024-11-11T21:37:29.588Z · LW · GW

One of the ideas in Cognitive Behavioral Therapy is you might be treating as dangerous something that actually isn’t dangerous (and don’t learn that it’s safe because you’re avoiding it).

so the account you’re giving here seems to be fairly standard.

On the other hand: some things actually are dangerous.

Comment by Michael Roe (michael-roe) on What is the alpha in one bit of evidence? · 2024-10-23T10:03:35.221Z · LW · GW

In any case, as a researcher currently working in this area, I am putting a big bet on moderate badness happening (in that I could be working on something else, and my time has value).

Comment by Michael Roe (michael-roe) on What is the alpha in one bit of evidence? · 2024-10-23T09:59:32.833Z · LW · GW

Also, there is counterparty risk if you bet on everyone dying.

(Yeah, yeah, you can bet on something like other peoples belief in the impednding apocalypse going up before it actually happens).

“Rapid takeoff” hypotheses are particularly hard to bet on.

Comment by Michael Roe (michael-roe) on If I wanted to spend WAY more on AI, what would I spend it on? · 2024-10-22T22:18:57.760Z · LW · GW

If I was going to play this game with an AI, I’d also feed it my genomic data, which would reveal I have a version of the HLA genes that makes me more likely to develop autoimmune diseases.

Comment by Michael Roe (michael-roe) on If I wanted to spend WAY more on AI, what would I spend it on? · 2024-10-22T22:01:01.045Z · LW · GW

Probably, if some AI were to recommend additional blood testing I could manage to persuade the wctual medical professionals to do it. Recent conversation went some thing like this:

Me: “can I have my thyroid levels checked pleas? And the consultant endocrinologist said he’d like to see a liver function test done next time i give a blood sample.”

Nurse (taking my blood sample and pulling my medical record up in the computer) “you take carbimazole right?”

Me: “yes”

Nurse (ticking boxes on a form on the computer) “… and full blood panel, and electrolytes…”

Probably wouldn’t be hard to get suggestions from an AI added to the list.

Comment by Michael Roe (michael-roe) on If I wanted to spend WAY more on AI, what would I spend it on? · 2024-10-22T21:39:15.933Z · LW · GW

Things I might spend more money on, if the were better AI’s to spend it on,

1. I am currently having a lot of blood tests done, with a genuine qualified medical doctor interpreting the results. Just for fun, I can see if AI gives a similar interpretation of the test results (its not bad).

Suppose we had AI that was actually better than human doctors, and cheaper. (Sounds like that might be here real soon, to be honest). I would probably pay money for that.

2. Some work things I am doing involve formally proving correctness of software. AI is not there, quite yet. If it was, I could probably get DARPA to pay the license fee for it, assuming cost isnt absolutely astronomical.

Etc.

On the other hand, this would imply that most doctors, and mathematicians, are out of work.

Comment by Michael Roe (michael-roe) on What actual bad outcome has "ethics-based" RLHF AI Alignment already prevented? · 2024-10-19T18:00:31.514Z · LW · GW

https://www.bbc.co.uk/news/technology-67012224

Comment by Michael Roe (michael-roe) on What actual bad outcome has "ethics-based" RLHF AI Alignment already prevented? · 2024-10-19T17:55:55.083Z · LW · GW

Replika, I think.

Comment by Michael Roe (michael-roe) on What actual bad outcome has "ethics-based" RLHF AI Alignment already prevented? · 2024-10-19T14:33:53.439Z · LW · GW

“self-reported data from demons is questionable for at least two reasons”—Scott Alexander.

He was actually talking about Internal Family Systems, but you could probably be skeptical about what malign AIs are telling you, too.

Comment by Michael Roe (michael-roe) on What actual bad outcome has "ethics-based" RLHF AI Alignment already prevented? · 2024-10-19T14:21:41.865Z · LW · GW

Well, we had that guy who tried to assassinate the Queen of England with a crossbow because his AI girlfriend told him to. That was clearly a harm to him, and could have been one for the Queen.

We don’t know how much more “But the AI told me to kill Trump” we’d have with less alignment, but it’s a reasonable guess (given the Replika datapoint) that it might not be zero,

Comment by Michael Roe (michael-roe) on Bitter lessons about lucid dreaming · 2024-10-19T10:23:06.409Z · LW · GW

Discussing sleep paralysis might be an infohazard…

The times I’ve entered sleep paralysis it hasn’t bothered me, as I knew what it was.

Comment by Michael Roe (michael-roe) on Bitter lessons about lucid dreaming · 2024-10-18T16:54:43.707Z · LW · GW

And then you get the people who are like, “Great! I’m lucid! Now I shall cast one of those demon summoning spells from Vajrayana Buddhism.”

Comment by Michael Roe (michael-roe) on Bitter lessons about lucid dreaming · 2024-10-18T16:50:31.279Z · LW · GW

Lucid dreaming is often like being Sigourney Weaver in Alien while also being on hospital sedatives. (You are, in fact, actually asleep, so it’s kind of a miracle you can reason at all and not the least bit surprising that you feel a bit groggy; also, dream can be nightmarish).

Why people choose to do this for fun is an interesting question.

You do get people who think they might get into lucid dreaming, then they read the dream diaries of some of the experienced lucid dreamers, and then are like “OMG, I never, ever, want to experience that.”

Comment by Michael Roe (michael-roe) on Bitter lessons about lucid dreaming · 2024-10-17T19:04:17.269Z · LW · GW

Well, it’s an interesting question whether there might be more efficient ways to do it.

Lucid nightmares are quite a good way of exposing you to real-seeming dangers without actually dying.

Comment by Michael Roe (michael-roe) on Bitter lessons about lucid dreaming · 2024-10-17T12:58:51.743Z · LW · GW

Reading this article, I have just realised that a dream I had last night came from reading one of those test cases where people try to bypass the guardrails on LLMs. Only the dream was taken from the innocuous part of the prompt.

At this rate, I’m going to be having dreams about turning Lemsip(*) into meth.

(*) UK cold remedy. Contains pseudoephedrine.

Comment by Michael Roe (michael-roe) on Bitter lessons about lucid dreaming · 2024-10-17T11:02:55.448Z · LW · GW

Chöd in a lucid dream if you’re feeling brave.

Like transform into vajrayogini and invite the demons to devour your corpse, etc,

Comment by Michael Roe (michael-roe) on Bitter lessons about lucid dreaming · 2024-10-17T11:01:14.464Z · LW · GW

And then there’s the thing where you dispel the entire dream-universe are just there in a black formless void.

Comment by Michael Roe (michael-roe) on Bitter lessons about lucid dreaming · 2024-10-17T11:00:04.292Z · LW · GW

Hmm… but, for example, stabilising a dream is kind of like a meditation, and one of the many ways you can transform your body in a dream is basically a body scan meditation from hatha yoga.

Comment by Michael Roe (michael-roe) on Bitter lessons about lucid dreaming · 2024-10-17T09:46:43.442Z · LW · GW

Given the significance of lucid dreaming in Buddhist practise (Siz Yogas of Naropa, etc.) realising that having a lucid dream just for sexual purposes is kind of pointless may lead to you realising that it’s kind of pointless in waking life too. Many of those guys were monks…

Comment by Michael Roe (michael-roe) on Bitter lessons about lucid dreaming · 2024-10-17T09:43:09.258Z · LW · GW

I’m not sure about (10).

Whenever someone has a theory that it’s impossible to do thing X in a dream, the regular lucid dreamers will provide a counterecamp,e by deliberately doing X in their next dream.

Computers, clocks, and written text can behave weirdly in dreams. Really, it’s the same things that generative AI has diffuculty with, possibly for information-theory reasons.

User info

Posts

Comments