Posts

shortest goddamn bayes guide ever 2024-05-10T07:06:23.734Z
Is being a trans woman (or just low-T) +20 IQ? 2024-04-24T20:04:36.829Z
Speedrun ruiner research idea 2024-04-13T23:42:29.479Z
Upcoming unambiguously good tech possibilities? (Like eg indoor plumbing) 2024-04-11T23:14:06.355Z
Any evidence or reason to expect a multiverse / Everett branches? 2024-04-09T05:26:30.990Z
Best *organization* red-pill books and posts? 2024-03-20T07:01:16.536Z
Fixed point or oscillate or noise 2024-03-14T18:37:11.212Z
Let's build definitely-not-conscious AI 2024-03-06T07:50:01.880Z
What's this 3rd secret directive of evolution called? (survive & spread & ___) 2024-02-07T14:11:58.143Z
Wrong answer bias 2024-02-01T20:05:38.573Z
The economy is mostly newbs (strat predictions) 2024-02-01T19:15:49.420Z
What exactly did that great AI future involve again? 2024-01-28T10:10:21.270Z
Decent plan prize winner & highlights 2024-01-19T23:30:34.242Z
Decent plan prize announcement (1 paragraph, $1k) 2024-01-12T06:27:44.495Z
Go flash blinking lights at printed text right now 2023-11-05T07:29:44.630Z
Responsible scaling policy TLDR 2023-09-28T18:51:20.330Z
Series of absurd upgrades in nature's great search 2023-09-03T09:35:20.760Z
We can do better than DoWhatIMean (inextricably kind AI) 2023-08-19T05:41:47.046Z
Could fabs own AI? 2023-08-19T00:16:37.848Z
Could we breed/engineer intelligent parrots? 2023-08-02T07:32:17.686Z
When did you orient? 2023-06-19T07:22:27.968Z
Work dumber not smarter 2023-06-01T12:40:31.264Z
When should I close the fridge? 2023-05-17T16:56:35.629Z
Distinguishing misuse is difficult and uncomfortable 2023-05-01T16:23:17.040Z
More money with less risk: sell services instead of model access 2023-03-04T20:51:36.480Z
Planning capacity and daemons 2022-09-26T00:15:42.409Z
Inner alignment: what are we pointing at? 2022-09-18T11:09:58.661Z
AI-assisted list of ten concrete alignment things to do right now 2022-09-07T08:38:29.757Z
Do yourself a FAVAR: security mindset 2022-06-18T02:08:47.415Z
Against unstoppable crypto prediction markets 2021-02-25T06:02:23.102Z
lukehmiles's Shortform 2020-01-27T00:52:37.833Z
Creating Environments to Design and Test Embedded Agents 2019-08-23T03:17:33.265Z

Comments

Comment by lukehmiles (lcmgcd) on lukehmiles's Shortform · 2024-05-14T05:41:17.542Z · LW · GW

The acceptable tone of voice here feels like 3mm wide to me. I'm always having bad manners

Comment by lukehmiles (lcmgcd) on D0TheMath's Shortform · 2024-05-14T05:38:02.353Z · LW · GW

I swear to never joke again sir

Comment by lukehmiles (lcmgcd) on We might be missing some key feature of AI takeoff; it'll probably seem like "we could've seen this coming" · 2024-05-12T09:22:26.477Z · LW · GW

I assumed somebody had. Maybe everyone did haha

Comment by lukehmiles (lcmgcd) on D0TheMath's Shortform · 2024-05-12T09:21:28.540Z · LW · GW

#onlyReadBadWriters #hansonFTW

Comment by lukehmiles (lcmgcd) on lukehmiles's Shortform · 2024-05-12T09:14:06.473Z · LW · GW

From the frontpage:

https://www.lesswrong.com/posts/zAqqeXcau9y2yiJdi/can-we-build-a-better-public-doublecrux

https://www.lesswrong.com/posts/bkr9BozFuh7ytiwbK/my-hour-of-memoryless-lucidity

https://www.lesswrong.com/posts/Lgq2DcuahKmLktDvC/applying-refusal-vector-ablation-to-a-llama-3-70b-agent

https://www.lesswrong.com/posts/ANGmJnZL2fskHX6tj/dyslucksia

https://www.lesswrong.com/posts/BRZf42vpFcHtSTraD/linkpost-towards-a-theoretical-understanding-of-the-reversal

Like all of them basically.

most of the value is in even figuring out how to diagram the posts

Think of it like a TLDR. There are many ways to TLDR but any method that's not terrible is fantastic

Comment by lukehmiles (lcmgcd) on lukehmiles's Shortform · 2024-05-12T09:04:28.605Z · LW · GW

The job would of course be done by a diagramming god, not a wordpleb like me

If i got double dog dared...

Comment by lukehmiles (lcmgcd) on ChristianKl's Shortform · 2024-05-12T08:59:08.332Z · LW · GW

"Lo-salt" salt is salt with potassium. That's been my table salt for 5 years.

Comment by lukehmiles (lcmgcd) on David Gross's Shortform · 2024-05-12T08:38:03.228Z · LW · GW

Put your phone in the oven and stand in the grass and eat some grass and see how it tastes

Comment by lukehmiles (lcmgcd) on lukehmiles's Shortform · 2024-05-12T08:34:23.827Z · LW · GW

LW mods, please pay somebody to turn every post with 20+ karma into a diagram. Diagrams are just so vastly superior to words.

Comment by lukehmiles (lcmgcd) on We might be missing some key feature of AI takeoff; it'll probably seem like "we could've seen this coming" · 2024-05-12T08:28:24.341Z · LW · GW

That title!! I was even fan of you and yam specifically and had even gone through a number of your old works looking for nuggets! Figure 22.3 makes up for it all though haha. Diagrams are so far superior to words...

Comment by lukehmiles (lcmgcd) on Some background for reasoning about dual-use alignment research · 2024-05-12T08:20:29.671Z · LW · GW

Bump

Comment by lukehmiles (lcmgcd) on We might be missing some key feature of AI takeoff; it'll probably seem like "we could've seen this coming" · 2024-05-12T08:10:18.406Z · LW · GW

Okay now I know why I got this one wrong. It's your fault. You hid it in chapter 22 of a book! Not even a clickbait title for the chapter! I even bought that book when it came out and read a good portion of it but never saw the chapter :(

Comment by lukehmiles (lcmgcd) on We might be missing some key feature of AI takeoff; it'll probably seem like "we could've seen this coming" · 2024-05-12T07:58:50.884Z · LW · GW

Btw, why didn't we have vending machines for everything 50 years ago?

Comment by lukehmiles (lcmgcd) on We might be missing some key feature of AI takeoff; it'll probably seem like "we could've seen this coming" · 2024-05-12T07:56:42.270Z · LW · GW

I got all the questions you mentioned wrong and definitely feel like I should've gotten them all right.

I think it just takes a lot time of time and effort to find the obvious future and it isn't super fun. You don't get to spend most of the time building up your tower of predictions. A lot of digging up foundations, pouring new foundations, digging them up...

It probably can be fun with the right culture within a small group of friends. Damn maybe that's what those people who were correct had...

Comment by lukehmiles (lcmgcd) on Questions are usually too cheap · 2024-05-12T07:28:19.901Z · LW · GW

Justify this extensively right now or you're a phony

Comment by lukehmiles (lcmgcd) on shortest goddamn bayes guide ever · 2024-05-12T01:28:05.351Z · LW · GW

I was thinking the bear would scare other stuff off yeah. But now I think I'm doing this wrong and the code is broken. Can you fix my code?

Comment by lukehmiles (lcmgcd) on shortest goddamn bayes guide ever · 2024-05-10T16:42:08.544Z · LW · GW

A possom or whatever will scratch mine like half the time

Comment by lukehmiles (lcmgcd) on AI #62: Too Soon to Tell · 2024-05-04T05:05:08.619Z · LW · GW

Original post that introduced the technique is best explanation of steering stuff. https://www.lesswrong.com/posts/5spBue2z2tw4JuDCx/steering-gpt-2-xl-by-adding-an-activation-vector

Comment by lukehmiles (lcmgcd) on lukehmiles's Shortform · 2024-04-28T23:11:52.849Z · LW · GW

What monster downvoted this

Comment by lukehmiles (lcmgcd) on lukehmiles's Shortform · 2024-04-28T20:40:59.411Z · LW · GW

Hmm I think the damaging effect would occur over many years but mainly during puberty. It looks like there's only two studies they mention lasting over a year. One found a damaging effect and the other found no effect.

Comment by lukehmiles (lcmgcd) on Refusal in LLMs is mediated by a single direction · 2024-04-28T20:23:56.747Z · LW · GW

The "love minus hate" thing really holds up

Comment by lukehmiles (lcmgcd) on Don't sleep on Coordination Takeoffs · 2024-04-27T04:49:29.151Z · LW · GW

This is inspiring

Comment by lukehmiles (lcmgcd) on WSJ: Inside Amazon’s Secret Operation to Gather Intel on Rivals · 2024-04-27T04:19:32.669Z · LW · GW

Thanks for posting. I would not have seen this otherwise.

Comment by lukehmiles (lcmgcd) on Johannes C. Mayer's Shortform · 2024-04-27T00:29:12.669Z · LW · GW

I like the rough thoughts way though. I'm not here to like read a textbook.

Comment by lukehmiles (lcmgcd) on We are headed into an extreme compute overhang · 2024-04-27T00:01:08.510Z · LW · GW

This seems correct and important to me.

Comment by lukehmiles (lcmgcd) on Is being a trans woman (or just low-T) +20 IQ? · 2024-04-26T17:00:32.999Z · LW · GW

Then where are the smart trans men hiding?

Comment by lukehmiles (lcmgcd) on Is being a trans woman (or just low-T) +20 IQ? · 2024-04-26T16:57:15.977Z · LW · GW

There are plenty of stupid and/or distracting behaviors testosterone can push you for without any kind of "chemical brain damage", not only sex. Testosterone is likely to make you seek social status and status-seeking is notoriously incompatible with intellectual pursuits.

This is the strongest alternative explanation by far. I wonder what to look for to check this...

Comment by lukehmiles (lcmgcd) on Is being a trans woman (or just low-T) +20 IQ? · 2024-04-26T16:27:46.898Z · LW · GW

Yes my point is the low T did it before the transition

Did any of them have big muscles before the transition?

Comment by lukehmiles (lcmgcd) on Is being a trans woman (or just low-T) +20 IQ? · 2024-04-25T04:36:01.378Z · LW · GW

seems in tension with your smarter friends transitioning after high school.

They seemed low-T during high school though!

Yeah could be a third factor though. Maybe you are right.

Comment by lukehmiles (lcmgcd) on Is being a trans woman (or just low-T) +20 IQ? · 2024-04-25T02:00:37.719Z · LW · GW

Someone on a subreddit said "free testosterone" is what matters and they usually just measure uh "regular testosterone" in blood or something. I have no idea if that's true. Know what those studies measured?

Wildly guessing here, but my intuition is that estrogen would have a greater impact on neuroticism than testosterone. Although I can't even say which direction.

Comment by lukehmiles (lcmgcd) on Is being a trans woman (or just low-T) +20 IQ? · 2024-04-25T01:58:12.945Z · LW · GW

Like what exactly? That seems unlikely to me. I suppose we will have results from the ongoing gender transitions soon.

Comment by lukehmiles (lcmgcd) on Is being a trans woman (or just low-T) +20 IQ? · 2024-04-25T01:55:17.241Z · LW · GW

I only linked the U-shaped study to mention that someone had said something vaguely similar. Notice my words "people have posited a U-shaped curve...". Study indeed seems like garbage. Perhaps i should've said that explicitly.

But it still doesn't really prove the causality - lots of things presumably influence intelligence, and I wouldn't be surprised if some of them influence T as well.

Yes so the experiment is that a million people are starting up in taking hormones/blockers now. I don't think proper results are in but what I have myself observed seems like strong evidence that blocking T preserves or raises intelligence on the margin.

Comment by lukehmiles (lcmgcd) on Examples of Highly Counterfactual Discoveries? · 2024-04-24T07:51:57.925Z · LW · GW

Pasteur had (also highly "counterfactual") help I think! Ignaz Semmelweis worked in this maternity ward where the women & babies kept dying.  The hospital had opened up some investigations over the years as to the cause of death but kept closing them with garbage explanations. He went somewhere else for a while and when he got back he noticed that the death numbers were down in his absence. Then he noticed his hands smelled like death after one of his routine autopsies and he was about to go plunge them in some poor mother! He had washed them but just with regular soap. If he put some bleach in the washwater then his hands didn't stink. He connected the dots. He had killed hundreds of mothers & babies but wrote a book about it anyway and thereby popularized disinfection (and strongly suggested the root cause of disease).

Probably the main reason that germ theory took so long to work out is that the people with the right evidence were too guilty and ashamed to share it. 

Comment by lukehmiles (lcmgcd) on lukehmiles's Shortform · 2024-04-24T07:45:25.652Z · LW · GW

I wonder how much testosterone during puberty lowers IQ. Most of my high school math/CS friends seemed low-T and 3/4 of them transitioned since high school. They still seem smart as shit. The higher-T among us seem significantly brain damaged since high school (myself included). I wonder what the mechanism would be here...

Like 40% of my math/cs Twitter is trans women and another 30% is scrawny nerds and only like 9% big bald men.

Comment by lukehmiles (lcmgcd) on lukehmiles's Shortform · 2024-04-24T07:06:47.849Z · LW · GW

Seems it is easier / more streamlined / more googlable now for a teenage male to get testosterone blockers than testosterone. Latter is very frowned upon — I guess because it is cheating in sports. Try googling eg "get testosterone prescription high school reddit -trans -ftm". The results are exclusively people shaming the cheaters. Whereas of course googling "get testosterone blockers high school reddit" gives tons of love & support & practical advice.

Females however retain easy access to hormones via birth control.

Comment by lukehmiles (lcmgcd) on Upcoming unambiguously good tech possibilities? (Like eg indoor plumbing) · 2024-04-20T15:35:22.906Z · LW · GW

How do cancer vaccines work?

Comment by lukehmiles (lcmgcd) on LLMs for Alignment Research: a safety priority? · 2024-04-17T18:20:14.675Z · LW · GW

Oh I have 0% success with any long conversations with an LLM about anything. I usually stick to one question and rephrase and reroll a number of times. I am no pro but I do get good utility out of LLMs for nebulous technical questions

Comment by lukehmiles (lcmgcd) on lukehmiles's Shortform · 2024-04-17T17:59:35.865Z · LW · GW

I wonder how many recent trans people tried/considered doubling down on their sex (eg males taking more testosterone) instead first. Maybe (for some people) either end of gender spectrum is comfortable and being in the middle feels bad¿ Anybody know? Don't want to ask my friends because this Q will certainly anger them

Comment by lukehmiles (lcmgcd) on Speedrun ruiner research idea · 2024-04-17T05:51:13.348Z · LW · GW

All games. Find where position is stored etc automatically i mean. It will certainly have failure cases. Easy to make a game that breaks it. The question is if an adversarial agent can easily break it in a regular (ie not adversarially chosen) game.

Comment by lukehmiles (lcmgcd) on Speedrun ruiner research idea · 2024-04-14T21:39:50.484Z · LW · GW

You may be right. Perhaps the way to view this idea is "yet another fuzzy-boundary RL helper technique" that works in a very different way and so will have different strengths and weaknesses than stuff like RLHF. So if one is doing the "serially apply all cheap tricks that somewhat reduce risk" approach then this can be yet another thing in your chain.

Comment by lukehmiles (lcmgcd) on Any evidence or reason to expect a multiverse / Everett branches? · 2024-04-14T19:32:37.948Z · LW · GW

This is what I get for glossing over the math. RIP

Comment by lukehmiles (lcmgcd) on Alexander Gietelink Oldenziel's Shortform · 2024-04-14T19:27:20.570Z · LW · GW

Have you ever tried hiring someone or getting a job? Mostly lemons all around (apologies for the offense, jobseekers, i'm sure you're not the lemon)

Comment by lukehmiles (lcmgcd) on Speedrun ruiner research idea · 2024-04-14T19:24:55.979Z · LW · GW

What about mediocre optimizers? Are they not worth fooling with?

Comment by lukehmiles (lcmgcd) on Speedrun ruiner research idea · 2024-04-14T18:58:51.783Z · LW · GW

What makes you say so? Seems like 25% chance possible to me. You can find where position is stored and watch for sudden changes. Same thing with score & inventory...

Comment by lukehmiles (lcmgcd) on Speedrun ruiner research idea · 2024-04-14T18:57:48.277Z · LW · GW

I do not propose one applies this method to a prepotence

Comment by lukehmiles (lcmgcd) on Alexander Gietelink Oldenziel's Shortform · 2024-04-14T18:54:17.762Z · LW · GW

I suspect that past therapists existed in your community and knew what you're actually like so were better able to give you actual true information instead of having to digest only your bullshit and search for truth nuggets in it.

Furthermore, I suspect they didn't lose their bread when they solve your problem! We have a major incentive issue in the current arrangement!

Comment by lukehmiles (lcmgcd) on lukehmiles's Shortform · 2024-04-14T18:51:13.122Z · LW · GW

Andor is a word now. You're welcome everybody. Celebrate with champagne andor ice cream.

Comment by lukehmiles (lcmgcd) on lukehmiles's Shortform · 2024-04-13T22:16:39.707Z · LW · GW

Is it rude to make a new tag without also tagging a handful of posts for it? A few tags I kinda want:

  • explanation: thing explained.
  • idea: an idea for a thing someone could do (weaker version of "Research Agenda" tag)
  • stating the obvious: pointing out something obviously true but maybe frequently overlooked
  • experimental result
  • theoretical result
  • novel maybe: attempts to do something new (in the sense of novelty requirements for conference publications)
Comment by lukehmiles (lcmgcd) on Specification gaming: the flip side of AI ingenuity · 2024-04-12T09:42:58.135Z · LW · GW

I would watch a ten hour video of this. (It may also be more persuasive to skeptics.)

Comment by lukehmiles (lcmgcd) on Any evidence or reason to expect a multiverse / Everett branches? · 2024-04-12T09:37:07.718Z · LW · GW

I think I was wrong and you & Adele Lopez are right and pilot wave would be more lines. I am concerned about god's RAM though... Maybe if they've got good hardware for low-rank matrices then it's fine.