Posts

Alignment First, Intelligence Later 2025-03-30T22:26:55.302Z
Softmax, Emmett Shear's new AI startup focused on "Organic Alignment" 2025-03-28T21:23:46.220Z
Examples of self-fulfilling prophecies in AI alignment? 2025-03-03T02:45:51.619Z
Do clients need years of therapy, or can one conversation resolve the issue? 2025-02-28T00:06:29.276Z
The case for pay-on-results coaching 2025-01-03T18:40:22.304Z
Began a pay-on-results coaching experiment, made $40,300 since July 2024-12-29T21:12:02.574Z
Being Present is Not a Skill 2024-12-18T01:11:04.715Z
Is this a better way to do matchmaking? 2024-12-16T19:06:14.574Z
Just one more exposure bro 2024-12-12T21:37:07.069Z
Locally optimal psychology 2024-11-25T18:35:11.985Z
Social events with plausible deniability 2024-11-18T18:25:17.339Z
What is autonomy? Why boundaries are necessary. 2024-10-21T17:56:33.722Z
What is it like to be psychologically healthy? Podcast ft. DaystarEld 2024-10-05T19:14:04.743Z
Eye contact is effortless when you’re no longer emotionally blocked on it 2024-09-27T21:47:01.970Z
Pay-on-results personal growth: first success 2024-09-14T03:39:12.975Z
what becoming more secure did for me 2024-08-22T17:44:48.525Z
I didn't have to avoid you; I was just insecure 2024-08-17T16:41:50.237Z
List of Collective Intelligence Projects 2024-07-02T14:10:41.789Z
Emotional issues often have an immediate payoff 2024-06-10T23:39:40.697Z
Retrospective on Mathematical Boundaries Workshop 2024-05-12T21:58:46.367Z
Boundaries Update #1 2024-04-11T16:07:18.746Z
Plausibility of cyborgism for protecting boundaries? 2024-03-27T18:53:38.615Z
How I turned doing therapy into object-level AI safety research 2024-03-14T01:54:47.290Z
self-fulfilling prophecies when applying for funding 2024-03-01T19:01:40.991Z
Boundary Violations vs Boundary Dissolution 2024-02-26T18:59:08.713Z
The natural boundaries between people 2024-02-23T01:09:28.592Z
What does davidad want from «boundaries»? 2024-02-06T17:45:42.348Z
Protecting agent boundaries 2024-01-25T04:13:50.993Z
What are the most common social insecurities? 2024-01-16T17:24:59.761Z
What technical topics could help with boundaries/membranes? 2024-01-05T18:14:58.795Z
Agent membranes/boundaries and formalizing “safety” 2024-01-03T17:55:21.018Z
Safety First: safety before full alignment. The deontic sufficiency hypothesis. 2024-01-03T17:55:19.825Z
Agent membranes and causal distance 2024-01-02T22:43:41.508Z
Environmental allergies are curable? (Sublingual immunotherapy) 2023-12-26T19:05:08.880Z
The absence of self-rejection is self-acceptance 2023-12-21T21:54:52.116Z
Lessons from massaging myself, others, dogs, and cats 2023-12-17T04:28:40.080Z
Apply to the Conceptual Boundaries Workshop for AI Safety 2023-11-27T21:04:59.037Z
Spaced repetition for teaching two-year olds how to read (Interview) 2023-11-26T16:52:58.412Z
Formalizing «Boundaries» with Markov blankets 2023-09-19T21:01:01.901Z
Coherence Therapy with LLMs - quick demo 2023-08-14T03:34:29.102Z
"Membranes" is better terminology than "boundaries" alone 2023-05-28T22:16:21.404Z
«Boundaries» for formalizing an MVP morality 2023-05-13T19:10:51.833Z
«Boundaries/Membranes» and AI safety compilation 2023-05-03T21:41:19.124Z

Comments

Comment by Chipmonk on Unbendable Arm as Test Case for Religious Belief · 2025-04-14T01:59:49.498Z · LW · GW

tagged this post as self-fulfilling prophecies!

Comment by Chipmonk on Unbendable Arm as Test Case for Religious Belief · 2025-04-14T01:59:30.749Z · LW · GW

yes!! “focusing on what you want!” (i talk a little more about this and self-fulfilling prophecies here)

**aiming** at what you want. vector. (teleology, not etiology)

Comment by Chipmonk on Learned pain as a leading cause of chronic pain · 2025-04-10T20:00:41.438Z · LW · GW

This is also my experience helping people with lifelong anxiety/insecurity

Comment by Chipmonk on Examples of self-fulfilling prophecies in AI alignment? · 2025-04-03T19:39:56.539Z · LW · GW

https://x.com/saffronhuang/status/1907863453009867183

Comment by Chipmonk on VDT: a solution to decision theory · 2025-04-02T19:03:21.287Z · LW · GW

Now we just need to ask Sonnet to formalize VDT

Comment by Chipmonk on Announcing EXP: Experimental Summer Workshop on Collective Cognition · 2025-03-17T21:04:28.230Z · LW · GW

what are the examples of the curriculum/activities you're considering/planning?

Comment by Chipmonk on Announcing EXP: Experimental Summer Workshop on Collective Cognition · 2025-03-15T21:08:20.729Z · LW · GW

@Ivan Vendrov think you'd be interested

Comment by Chipmonk on Self-fulfilling misalignment data might be poisoning our AI models · 2025-03-03T02:54:14.867Z · LW · GW

Can you think of examples like this in the broader AI landscape? - What are the best examples of self-fulfilling prophecies in AI alignment?

Comment by Chipmonk on Examples of self-fulfilling prophecies in AI alignment? · 2025-03-03T02:50:27.022Z · LW · GW

https://x.com/sama/status/1621621724507938816 

Comment by Chipmonk on Examples of self-fulfilling prophecies in AI alignment? · 2025-03-03T02:47:39.289Z · LW · GW

Situational Awareness and race dynamics? h/t Jan Kulveit @Jan_Kulveit 

Comment by Chipmonk on Examples of self-fulfilling prophecies in AI alignment? · 2025-03-03T02:46:34.809Z · LW · GW

Training on Documents About Reward Hacking Induces Reward Hacking

Comment by Chipmonk on Do clients need years of therapy, or can one conversation resolve the issue? · 2025-02-28T20:06:04.954Z · LW · GW

I don't feel like I learned anything new from the post.

This surprises me! Wait so-

  • The "How does one-shotting happen?" section didn't have anything interesting for you? (Have you seen stuff like this elsewhere?)
  • Did you already know one-shotting was possible?
Comment by Chipmonk on Do clients need years of therapy, or can one conversation resolve the issue? · 2025-02-28T20:04:28.037Z · LW · GW

since your bullet-point list in the beginning isn't detailed enough for anyone to try to replicate the method.

Wait I'm confused- this is not the purpose of the post

Also notable is that you only have positive examples for your method

The purpose of this post is not advertisement. It's to discuss one-shots

Especially, how would you be able to distinguish between your approach convincing your customers they were helped, instead of actually changing their behavior?

See above

Comment by Chipmonk on Do clients need years of therapy, or can one conversation resolve the issue? · 2025-02-28T17:33:36.266Z · LW · GW

Would anyone like to help me edit a better version of this?

Comment by Chipmonk on Do clients need years of therapy, or can one conversation resolve the issue? · 2025-02-28T17:28:44.988Z · LW · GW

Oh I like "patients" ("clients"). I'll think about the rest, thanks. I'm just not sure how to write anything useful and legible without talking about my own experience and what I have the most data for?

Also I see the point of your last bullet where "my business" is the subject hm

Comment by Chipmonk on Do clients need years of therapy, or can one conversation resolve the issue? · 2025-02-28T15:39:58.388Z · LW · GW

any suggestions for how to talk about this stuff without having it read like an advertisement? i'm genuinely interested in the idea of one-shotting and legibilizing evidence that quick growth is possible

Comment by Chipmonk on Invest in ACX Grants projects! · 2025-02-28T05:17:04.506Z · LW · GW

any updates on how this is going btw? (doing retroactive funding research)

Comment by Chipmonk on Prizes for ML Safety Benchmark Ideas · 2025-02-25T06:45:53.761Z · LW · GW

what came of this? (doing research on bounties, prizes, and retroactive funding rn)

Comment by Chipmonk on MichaelDickens's Shortform · 2025-02-22T21:18:39.688Z · LW · GW

fwiw, FABRIC was able to get funding in November 2024 (who knows if this date is correct though)

nvm this was an "exit grant" lmao

Comment by Chipmonk on (The) Lightcone is nothing without its people: LW + Lighthaven's big fundraiser · 2025-02-16T05:17:55.497Z · LW · GW

Now that this is "over", I'd be fascinated to see a post about what the fundraising process was like for you and what can be learned. Seems like a big L for retroactive funding for example

https://x.com/ohabryka/status/1882579367110586459 

Comment by Chipmonk on We probably won't just play status games with each other after AGI · 2025-01-26T01:04:43.766Z · LW · GW

Aside: I'm surprised you're suggesting people get validation --> people feel secure ?  This does not at all seem like the causality to me (though I'm aware most people probably think like this). 

Prediction: In the absence of radically improved psychotechnology, a significant fraction of people will always find a way to feel insecure.

Comment by Chipmonk on Began a pay-on-results coaching experiment, made $40,300 since July · 2025-01-24T03:35:09.389Z · LW · GW

hmm i suspect releasing these metrics could make my customers significantly more annoying. like, early adopters are fun and experimental. but if i make it seem not risky then i get risk-averse people who tend to be prickly

so maybe i will compile and release this data but i would need to figure out how to do it in a way that doesn't change the funnel

Comment by Chipmonk on Increasing IQ is trivial · 2025-01-14T05:30:08.300Z · LW · GW

Any updates on this?

Comment by Chipmonk on Increasing IQ by 10 Points is Possible · 2025-01-14T05:29:59.747Z · LW · GW

Any updates on this?

Comment by Chipmonk on (The) Lightcone is nothing without its people: LW + Lighthaven's big fundraiser · 2025-01-13T17:16:37.009Z · LW · GW

I wonder if you could set up a conditional donation? “I donate $X, minus if total donations exceed $3M"

Comment by Chipmonk on Began a pay-on-results coaching experiment, made $40,300 since July · 2024-12-31T02:29:36.285Z · LW · GW

i like this thanks. might take a bit of time to put together but interested

Comment by Chipmonk on Began a pay-on-results coaching experiment, made $40,300 since July · 2024-12-30T18:16:13.946Z · LW · GW

made some light edits because of this comment, thanks

Comment by Chipmonk on Began a pay-on-results coaching experiment, made $40,300 since July · 2024-12-30T17:50:45.217Z · LW · GW

oh ok i might start doing that. knowing my calibration on that would be nice

Comment by Chipmonk on Began a pay-on-results coaching experiment, made $40,300 since July · 2024-12-30T17:27:55.988Z · LW · GW

oh ok hm. i also don't want to be incentivized to not give easy-for-me help to people with low odds of success though

Comment by Chipmonk on Began a pay-on-results coaching experiment, made $40,300 since July · 2024-12-30T01:18:30.880Z · LW · GW

could you give a few examples? 

also seems time-intensive hmmmm

also, i thought about it more and i really like the metric of "results generated per hour"

Comment by Chipmonk on Began a pay-on-results coaching experiment, made $40,300 since July · 2024-12-30T01:16:19.712Z · LW · GW

:D i really hope bounties catch on

Comment by Chipmonk on Began a pay-on-results coaching experiment, made $40,300 since July · 2024-12-30T01:15:23.431Z · LW · GW

wow this is contraversial (my own vote is +6) 

wonder why

Comment by Chipmonk on Shallow review of technical AI safety, 2024 · 2024-12-29T14:20:47.967Z · LW · GW

boundaries / membranes

  • One-sentence summary: Formalise one piece of morality: the causal separation between agents and their environment. See also Open Agency Architecture.
  • Theory of change: Formalise (part of) morality/safety, solve outer alignment.

Chris Lakin here - this is a very old post and What does davidad want from «boundaries»? should be the canonical link

Comment by Chipmonk on Orienting to 3 year AGI timelines · 2024-12-26T17:07:54.185Z · LW · GW

Why SPY over QQQ?

Comment by Chipmonk on The Deep Lore of LightHaven, with Oliver Habryka (TBC episode 228) · 2024-12-25T23:18:20.373Z · LW · GW

available on the website at least 

Comment by Chipmonk on Pay-on-results personal growth: first success · 2024-12-22T00:41:01.818Z · LW · GW

Update: Bob has recorded a 6-month follow-up here.

Comment by Chipmonk on Walking Sue · 2024-12-18T17:38:11.090Z · LW · GW

Why was this post tagged as boundaries/membranes? I'm inclined to remove the tag.

Comment by Chipmonk on Being Present is Not a Skill · 2024-12-18T01:44:19.038Z · LW · GW

makes sense

Comment by Chipmonk on Sorry for the downtime, looks like we got DDosd · 2024-12-02T04:22:40.197Z · LW · GW

works!

Comment by Chipmonk on Sorry for the downtime, looks like we got DDosd · 2024-12-02T04:17:30.234Z · LW · GW

another weird bug is if i click the link i was just sent in my email, it brings me to a 403 Forbidden page (even though the URLs of this functional page and that 403 page look identical)

Comment by Chipmonk on (The) Lightcone is nothing without its people: LW + Lighthaven's big fundraiser · 2024-11-30T17:41:31.901Z · LW · GW

I've run two workshops at LightHaven and it's pretty unthinkable to run a workshop anywhere else in the Bay Area. Lightcone has really made it easy to run overnight events without setup

Comment by Chipmonk on Hierarchical Agency: A Missing Piece in AI Alignment · 2024-11-29T20:03:59.406Z · LW · GW

Yeah i'm confused about what to name it. we can always change it later i guess.

also let me know if you have any posts you want me to definitely tag for it that you think i might miss otherwise

Comment by Chipmonk on Hierarchical Agency: A Missing Piece in AI Alignment · 2024-11-27T07:23:54.891Z · LW · GW

Do we have a LessWrong tag for "hierarchical agency" or "multi-scale alignment" or something? Should I make one?

Comment by Chipmonk on Hierarchical Agency: A Missing Piece in AI Alignment · 2024-11-27T07:16:07.783Z · LW · GW

I just made a twitter list with accounts interested in hierarchical agency (or what i call "multi-scale alignment"). Lmk who should be added

Comment by Chipmonk on Hierarchical Agency: A Missing Piece in AI Alignment · 2024-11-27T07:14:13.124Z · LW · GW

Random but you might like this graphic I made representing hierarchical agency from my post today on a very similar idea. What would you change about it?

Comment by Chipmonk on Hierarchical Agency: A Missing Piece in AI Alignment · 2024-11-27T07:12:36.151Z · LW · GW

This was an impressive demonstation of Claude for interviews. Was this one take?

(Also what prompt did you use? I like how your Claude speaks.)

Comment by Chipmonk on Hierarchical Agency: A Missing Piece in AI Alignment · 2024-11-27T07:12:25.207Z · LW · GW

I'm glad you wrote this! I've been wanting to tell othres about ACS's research and finally have a good link

Comment by Chipmonk on Locally optimal psychology · 2024-11-26T17:35:00.957Z · LW · GW

Great question, thanks!

I think you're correct in pointing towards the existence of basically-all-downside genetic conditions, but I still think these are in the minority. Moreover, even most of those don't create a big issue on the object level— compared to how people might feel about the issue as a result.

This argument doesn't extend to conditions like Huntington's, but if a person is missing a pinky finger, most of the issues the person is going to face are related to social factors and their own emotions, not the physical aspect.

I also just say this from experience helping others

Comment by Chipmonk on Locally optimal psychology · 2024-11-26T00:56:43.552Z · LW · GW

I did not say that depression is always a strategy for everyone.

Comment by Chipmonk on Which things were you surprised to learn are not metaphors? · 2024-11-21T18:59:03.125Z · LW · GW

I wrote about my own experience discovering “feelings in the body” here