Posts

An Invitation to Refrain from Downvoting Posts into Net-Negative Karma 2024-01-26T20:13:49.209Z
In Defense of «The Army of Jakoths» 2023-05-22T11:59:47.105Z
Speed of information input is a bottleneck for rationality 2023-05-22T10:24:37.673Z
The Army of Jakoths (a parable) 2023-05-21T22:48:37.287Z
How to End a Pandemic 2022-01-03T20:26:07.009Z
A Reason to Expect Republics to Perform Better than Absolute Monarchies in the Long-Term 2021-06-17T22:22:50.656Z
Examples of Acausal Trade with an Alien Universe? 2021-04-01T18:10:13.541Z
Selling Attention for Money 2021-03-24T06:24:37.981Z
Even Inflationary Currencies Should Have Fixed Total Supply 2021-03-10T05:41:08.883Z
How to build common knowledge of rationality and honesty? 2021-02-21T06:07:29.478Z
Democratic Currency 2021-01-19T05:32:07.612Z
No, Newspeak Won’t Make You Stupid 2020-12-18T00:56:02.654Z
Ideal Chess - drop chess perfected 2020-12-17T20:03:19.329Z
What AI companies would be most likely to have a positive long-term impact on the world as a result of investing in them? 2020-09-21T23:41:24.281Z
If there were an interactive software teaching Yudkowskian rationality, what concepts would you want to see it teach? 2020-09-02T05:37:08.758Z
MikkW's Shortform 2020-08-10T20:39:29.510Z
Calibrate words, not just probabilities 2020-07-18T05:56:11.120Z

Comments

Comment by MikkW (mikkel-wilson) on An Invitation to Refrain from Downvoting Posts into Net-Negative Karma · 2024-02-12T23:33:00.892Z · LW · GW

"authors will get hurt by people not appreciating their work" is something we just have to accept, even if it's very harsh

I don't really agree with this. Sure, some people are going to write stuff that's not very good, but that doesn't mean that we have to go overboard on negative feedback, or be stingy with positive feedback.

Humans are animals which learn by reinforcement learning, and the lesson they learn when punished is often "stay away from the thing / person / group that gave the punishment", much more strongly than "don't do the thing that made that person / thing / group punish me".

Wheras when they are rewarded, the lesson is "seek out the circumstances / context that let me be rewarded (and also do the thing that will make it reward me)". Nobody is born writing amazingly, they have to learn it over time, and it comes more naturally to some, less to others.

I don't want bad writers (who are otherwise intelligent and intellectually engaged, which describes almost everybody who posts on LW) to learn the lesson "stay away from LW". I want them to receive encouragement (mostly in forms other than karma, e.g. encouraging comments, or inclusion in the community, etc.), leading them to be more motivated to figure out the norms of LW and the art of writing, and try again, with new learning and experience behind them.

I think the threshold of 0 is largely arbitrary

It's not all that arbitrary. Besides the fact that it's one of the simplest numbers, which makes for an easy to remember / communicate heuristic (a great reason that isn't arbitrary), I actually think it's quite defensible as a threshold. If I write a post that has a +6 starting karma, and I see it drop down to 1 or 2 (or, yeah, -1), my thought is "that kinda sucked, but whatever, I'll learn from my mistake and do better next time".

But if I see it drop down to, say, -5 or -6, my thought starts to become "why am I even posting on this stupid website that's so full of anti-social jerks?". And then I have to talk myself down from deleting my account and removing LW and the associated community from my life.

(Not that I think LW is actually so full of jerks. There's a lot of lovable people here who talk about interesting things, and I believe in LW's raison d'etre, which is why I keep forcing myself to come back)

Comment by MikkW (mikkel-wilson) on Values Darwinism · 2024-01-26T19:59:53.978Z · LW · GW

I would like to make a meta-comment, not directly related to this post.

When I came upon this post, it had a negative karma score. I don't think it's good form to have posts receiving negative net karma (except in extreme cases), so I upvoted to provide this with a positive net karma.

It is unpleasant for an author when they receive a negative karma score on a post which they spent time and effort to make (even when that effort was relatively small), much more so than receiving no karma beyond the starting score. This makes the author less likely to post again in the future, which prevents communication of ideas, and keeps the author from getting better at writing. In particular this creates a risk of LessWrong becoming more like a bubble chamber (which I don't think is desirable), and makes the community less likely to hear valuable ideas that go against the grain of the local culture.

A writer who is encouraged to write more will become more clear in their communication, as well as in their thoughts. And they will also get more used to the particular expectations of the culture of LessWrong- norms that have good reason to exist, but which also go against some people's intuitions or what has worked well for them in other, more "normie" contexts.

Karma serves as a valuable signal to authors about the extent to which they are doing a good job of writing clearly about interesting topics in a way that provides value to members of the community, but the range of positive integers provides enough signal. There isn't much lost in excluding the negative range (except in extreme cases).

Let's be nice to people who are still figuring writing out, I encourage you to refrain from downvoting them into negative karma.

Comment by MikkW (mikkel-wilson) on Trading off Lives · 2024-01-03T22:12:48.967Z · LW · GW

That statement of fact is indeed true. Would you mind saying more about your thoughts regarding it? There seems to be an unstated implication that this is bad. There is a part of me that agrees with that implication, but there are also parts of me that want to say "so what? that's irrelevant". (I feel ⌞explaining what the second set of shards is pointing to, would take more time and energy to write up than I am prepared to take right now⌝)

Comment by MikkW (mikkel-wilson) on Trading off Lives · 2024-01-03T15:31:35.256Z · LW · GW

On the other side, there's the cost of ~10min of boredom, for every passenger, on every flight. Instead of playing games, watching movies, or reading, people would mostly be talking, looking out the window, or staring off into space.

Tangent: I'm not completely sure that this is actually a cost and not an unintended benefit

Comment by MikkW (mikkel-wilson) on Daniel Kokotajlo's Shortform · 2023-12-21T13:13:27.222Z · LW · GW

Sharing my impression of the comic:

Insofar as it supports sides, I'd say the first part of the meme is criticism of Eliezer

The comic does not parse (to my eyes and probably most people's) as the author intending to criticize Eliezer at any point

Insofar as it supports sides, I'd say [...] the last part is criticism of those who reject His message

Only in the most strawman way. It basically feels equivalent to me to "They disagree with the guy I like, therefore they're dumb / unsympathetic". There's basically no meat on the bones of the criticism

Comment by MikkW (mikkel-wilson) on leogao's Shortform · 2023-12-21T13:07:58.951Z · LW · GW

This subjectively seems to me to be the case.

Comment by MikkW (mikkel-wilson) on OpenAI: The Battle of the Board · 2023-11-25T09:51:50.447Z · LW · GW

The board's statement doesn't mention them having made such a request to Altman which was denied, that's a strong signal against things having played out that way.

Comment by MikkW (mikkel-wilson) on For Civilization and Against Niceness · 2023-11-20T13:46:46.917Z · LW · GW

In the case of the lawyers, this is actually not an example of non-niceness being good for society. The defense attorney who defends a guilty party, their job is not to be a jerk to the prosecutor or to the judge. It is to, as you say, provide the judge with information (including counter-arguments to the other side's arguments). While his job involves working in an opposite direction from his counterpart, it does not involve being non-nice to his counterpart (and it is indeed most pro-society if the two sides treat eachother well / nicely outside of their equal-and-opposite professional duties), and it does not involve being non-nice to the judge, whose job the attorney (as you point is) is actually assisting with. Again, society expects maximum niceness from both attorneys towards the judge outside of ⌞their professional duty to imperfectly represent the truth⌝.

Society expects niceness to be provided from each of these parties to each of the others: {the judge, the defense attorney, the prosecution attorney}

Comment by MikkW (mikkel-wilson) on Sam Altman, Greg Brockman and others from OpenAI join Microsoft · 2023-11-20T11:19:37.360Z · LW · GW

This is important news. I personally desire to be kept updated on this, and LW is a convenient (and appropriate) place to get this information. And I expect other users feel similarly.

What's different between this and e.g. the developments with Nonlinear, is that the developments here will have a big impact on how the AI field (and by one layer of indirection, the fate of the world) develops.

Comment by MikkW (mikkel-wilson) on MikkW's Shortform · 2023-11-01T00:30:24.955Z · LW · GW

I am curious to hear people's opinions, for my reference:

Is epistemic rationality or instrumental rationality more important?

Do you believe epistemic rationality is a requirement for instrumental rationality?

Comment by MikkW (mikkel-wilson) on adamzerner's Shortform · 2023-10-28T19:35:52.661Z · LW · GW

Not directly tied to the core of what you're saying, but I will note that I am example of someone who doesn't strongly prefer such foods warm. I do weakly prefer it being warm, as long as it's not too hot (that's worse than it being cold, because it hurts / causes minor injury), but I'm happy eating it room temperature or a bit cold (not necessarily cold steak though)

Comment by mikkel-wilson on [deleted post] 2023-10-28T19:27:16.602Z

My model says that a lot of the changing occurs by gradient descent, which can be interrupted randomly without causing problems. And there's enough redundancy that the reorganization part can be interrupted without the core information being removed completely from the brain, and the redundancy will be replenished (one of copies I imagine is "locked" while the reorganization happens, and is later reorganized later with another copy "locked"). I also expect this replenishing can happen during awakeness, though not as ideally as when asleep.

But I will also note that forgetting is a thing that happens, which is indistinguishable from "data corruption". We're actually quite good at forgetting things.

Comment by MikkW (mikkel-wilson) on quetzal_rainbow's Shortform · 2023-10-28T18:41:45.268Z · LW · GW

Choosing non-ambiguous pointers to values is likely to not be possible

Comment by MikkW (mikkel-wilson) on MikkW's Shortform · 2023-10-28T18:40:46.477Z · LW · GW

I had previously posted thoughts that suggested that the main psychoactive effect of chocolate is due to theobromine (which is chemically similar to caffeine). In the interests of publicly saying "oops":

Chocolate also contains substantial amounts of caffeine, and caffeine has a stronger effect per gram, so most of the caffeine-adjacent effect of chocolate comes from caffeine rather than theobromine.

Theobromine may still contribute to chocolate hitting differently than other caffeinated substances, though I expect there are also other chemicals that also contribute to the effect of chocolate. I assign less than 50% probablity to ⌞theobromine being the cause of the bulk of the difference between chocolate's psychoactive effects vs. pure caffeine⌝

Comment by MikkW (mikkel-wilson) on Goodhart's Law in Reinforcement Learning · 2023-10-18T19:37:29.058Z · LW · GW

I strong-downvoted this post because sentences like

use these insights to derive two methods for provably avoiding Goodharting

Tend to be misleading, pretending that mathematical precision describes the complex and chaotic nature of the real world, where it shouldn't be assumed to (see John Wentworth's comment), and in this case it could potentially lead to very bad consequences if misunderstood.

Comment by MikkW (mikkel-wilson) on Dating in 2023 sucks. Why isn't AI helping? · 2023-10-16T21:27:15.115Z · LW · GW

It takes getting to know more than a few dozen potential mates, at least for some people

Comment by MikkW (mikkel-wilson) on The Gods of Straight Lines · 2023-10-14T13:45:11.332Z · LW · GW

I appreciate your reply. The point I was trying to make is, the contingency of ⌞there being an instance of democratic revolution going smoothly⌝ potentially makes the difference between that straight line happening or not happening. (And if the occurrence took 1000 years - but even that isn't a given - I would consider that an example of "a god of straight lines" successfully being overpowered.)

I think that if there was sufficient backlash against democratic revolution (unclear if the American Revolution not happening would be enough cause), the then-existing status quo in the West (monarchy / feudalism) would not have gone on- that particular "god of straight lines" dooming feudalism would have been very hard to stop, but the resulting system need not have looked like democracy, and >50% would have been substantially worse by ⌞metrics most westerners care about⌝, though with small probability even better than the form of institutions which we ended up receiving, but largely different from modern notions of democracy.

Comment by MikkW (mikkel-wilson) on The Gods of Straight Lines · 2023-10-14T12:15:28.217Z · LW · GW

Thought 1: Yeah, that's fair

Thought 2: Though I also feel like a different country being the first to establish independence, could have made a difference in the long-term trajectory of things. Many of the revolutions that followed the American Revolution (including the French Revolution, which some people view as an even bigger deal than the American) went quite off the rails and were quite unpleasant, and generally soured many people on the idea, while the United States ended up going fairly smoothly after the constitution was implemented. If the French Revolution had happened without the American Revolution, I imagine that could have discredited the ideas behind them, without leaving a successful state built on them.

(Note that the wave of Revolution really took off not after the first French Revolution in the late 1700's, but in the 1830's and 1840's. If the US wasn't there as an example of things going right, I can easily imagine that the appetite in Europe and France for revolution could have been spoiled enough to overcome the forces that otherwise would have made it inevitable)

I think the failure of the Soviet Union could be a similar reference for what the other side can look like. The particular form of the ideas there were destined to fail in any case, but they also did a lot to discredit adjacent ideas that otherwise might have "had their time", and now won't.

Comment by MikkW (mikkel-wilson) on MikkW's Shortform · 2023-10-14T10:30:33.249Z · LW · GW

One idea that I implicitly follow often is "Never assume anything is on the Pareto frontier"- even if something is good, even if you can't see how to improve it without sacrificing some other important consideration, it pays off to engage in creative thinking to identify solutions ⌞whose vague shape you haven't even noticed yet⌝. And if a little bit of creativity doesn't pay off, then that just means you need to think even more creatively to find the Pareto improvement.

(Note that I'm not advocating that only Pareto improvements should be aimed for, I believe sometimes the right move is a non-Pareto change)

Comment by MikkW (mikkel-wilson) on The Gods of Straight Lines · 2023-10-14T08:59:37.648Z · LW · GW

In 1776, America rebelled in the name of freedom and democracy: the origin myth of the modern world order. And yet, somehow, unrebellious Canada ended up just as free and democratic. An unrebellious America likely would have too.

I'm dubious of this. I think it's highly likely that Canada and other British dominions becoming independent was a result of knock-on effects from the American Revolution, e.g. America setting an example for what independance can look like and enable prosperity; American independence causing other colonies to desire independence; Pro-dependence British officials being demoralized in the long term; America itself having a strong effect in the late 1800s and/or 1900s pushing other countries (British and non-British alike) to become independent democracies.

Comment by MikkW (mikkel-wilson) on We don't understand what happened with culture enough · 2023-10-12T09:53:32.752Z · LW · GW

I agree that conditional on humanity going extinct, the seeming success of our species by a genetic metric would only be a false success.

Comment by MikkW (mikkel-wilson) on We don't understand what happened with culture enough · 2023-10-12T09:43:39.652Z · LW · GW

Your argument indicates that humans are successful (by said metric) among mammals, but doesn't address how it compares to insects. As I understand it, some insect species have both more many more individuals and much more biomass than humans

Comment by MikkW (mikkel-wilson) on Dall-E 3 · 2023-10-03T14:49:41.849Z · LW · GW

Thanks for sharing the link

Comment by MikkW (mikkel-wilson) on A quick update from Nonlinear · 2023-09-29T13:10:14.434Z · LW · GW

When I eat oatmeal or cereal, I almost never eat it with milk (non-vegan or otherwise). I soak oats in boiling water, and eat cereal dry.

Comment by MikkW (mikkel-wilson) on The point of a game is not to win, and you shouldn't even pretend that it is · 2023-09-29T10:45:17.651Z · LW · GW

«When the brain generates good feelings, it usually has reasons for doing that» I think is probably true (though as far as the game designer, I suspect some designers are only subconsciously / on a gut-feeling-level aware, rather than consciously aware of all the reasons. Though good ones are probably consciously aware of some of the reasons)

«If you keep trying to make it generate good feelings without respecting the deeper purposes of the source of the feelings, afaik it generally stops working after a bit.» seems false to me.

Comment by MikkW (mikkel-wilson) on Petrov Day Retrospective, 2023 (re: the most important virtue of Petrov Day & unilaterally promoting it) · 2023-09-28T16:32:01.227Z · LW · GW

Registering my predictions for which groups clicked the second link most:

Percentagewise, I don't Groups A and C clicked on it that much (though I'd be surprised if the number from each group isn't non-zero), since they picked a choice that indicates that they care about making high-quality decisions and cooperating with the rest of the world. A higher proportion of C probably clicked than A, since a person might decide it's worth it even if they take their time to think it through (I'd disagree, but the commentor you quote fits into that category).

I'd then say the "accurately reporting your epistemic beliefs" group probably clicked on it the most because I don't model ⌞the kind of person who'd say that is the important trait of Petrov day⌝ as being a particularly ethical person

Comment by MikkW (mikkel-wilson) on MikkW's Shortform · 2023-09-22T07:55:37.034Z · LW · GW

I've noticed some authors here using [square brackets] to indicate how sentences should be parsed

(So "[the administrator of Parthia]'s manor" means something different from "the administrator of [Parthia's manor]")

Previous to seeing this usage, I had similar thoughts about the same problem, but came up with different notation. In my opinion, the square brackets don't feel right, like they mean something different from how they are being used.

My original notation was to use •dots• to indicate the intended order of parsing, though recently I've started using ⌞corner brackets⌝ to indicate the intended parsing

(The corner brackets are similar to how quotations are marked in Japanese, but they are distinct characters. Also, they aren't on the default keyboard, but I have things set up on my phone to make it easy to insert them)

Comment by MikkW (mikkel-wilson) on Some reasons why I frequently prefer communicating via text · 2023-09-20T07:57:17.561Z · LW · GW

As a link: https://www.quora.com/Why-do-some-people-prefer-online-interactions-to-real-life-interactions/answer/Alex-K-Chen

Comment by MikkW (mikkel-wilson) on Some reasons why I frequently prefer communicating via text · 2023-09-20T07:56:24.819Z · LW · GW

I didn't downvote, but your comment seems to overlook that status dynamics almost always happen subconsciously / feel like urges.

I'm not sure there's actually a status dynamic there, but if there is one, your first paragraph is actually consistent with that (which is the opposite of what your second paragraph suggests)

Comment by MikkW (mikkel-wilson) on Dating Roundup #1: This is Why You’re Single · 2023-08-31T05:50:33.078Z · LW · GW

As soon as I dance with them in one of these other dances - it can flip the script entirely and it's often what any romantic partner in the past has told me. "That first time we did X dance, it changed everything."

What dance styles is that? Seems like an important piece of information

Comment by MikkW (mikkel-wilson) on Drawn Out: a story · 2023-07-11T16:23:41.090Z · LW · GW

I like this (I like most fiction that belongs on LW in general)

Comment by MikkW (mikkel-wilson) on Work dumber not smarter · 2023-06-02T19:39:49.821Z · LW · GW

It doesn't seem correct to me that adding even a dash of legibility "screws the work over" in the general case. I do agree there are certainly situations where the right solution is illegible to all (except the person implementing it). But both in that case and in general, talking to and getting along with the boss both makes things more legible, and will tend to increase quality. I expect that in the cases of you working well and not getting rewarded much, spending a little time interacting with your boss would both improve your outcomes, and importantly, also make your output even better than it already was.

Comment by MikkW (mikkel-wilson) on In Defense of «The Army of Jakoths» · 2023-05-24T10:26:12.879Z · LW · GW

I'm not very convinced by MikkW's list of possible issues, but at least it makes some attempt to engage with why readers didn't find the post valuable.

I would be interested to hear if there are any issues with the «Army of Jakoths» post that I didn't identify here

Comment by MikkW (mikkel-wilson) on In Defense of «The Army of Jakoths» · 2023-05-22T19:15:29.448Z · LW · GW

This is indeed what I said in the post:

I put poetic in quotes, because it's not a poem, but is written with a similar format

Comment by MikkW (mikkel-wilson) on MikkW's Shortform · 2023-05-22T12:07:49.203Z · LW · GW

I like this quote from a post that I published around two years ago, which wasn't well-received and I ended up taking down:

But at the end of the day, the American governments (neither state nor federal) don't truly follow the will of the people. Instead, they are led jointly by the major parties, The Red Prince of Moloch and The Blue Prince of Moloch, two partners in an intricate dance choreographed to look like a fight, but ultimately leading both partners in the direction of Moloch's will, only loosely bound to the will of the people.

While I don't necessarily endorse the post as a whole, that quote is one of the gems from it that I still stand by. I might expand further on this point in the future

Comment by MikkW (mikkel-wilson) on Twiblings, four-parent babies and other reproductive technology · 2023-05-22T07:50:29.827Z · LW · GW

If identical twins share 100% of their DNA and siblings share about 50%, twiblings share 75%. To the best of my knowledge, twiblings don’t exist in nature.

Not among mammals, but some insects, including bees and ants, actually have 75% consanguinity (tangent, that's a more accurate term than "shares 75% of DNA", since the overlap in DNA is much higher, even among strangers), at least in the case of full siblings (of course it's not the case with half siblings).

The reason for this is that these insects are "haplodiploid", meaning that females carry two sets of chromosomes, just like e.g. mammals, but males only have one set. So while the eggs contain recombinatated (and thus varying) DNA, the father always contributes the same DNA to each of its offspring. [1/2 * 1/2] + [1/2 * 1] = 3/4, so full siblings have 75% consanguinity.

There's a correlation between this haplodiploid condition and eusociality (as exhibited by bees and ants), though it is neither a necessary nor sufficient condition. There are at least two species of eusocial mammals, which are not haplodiploid: Humans and Naked-Molerats (interestingly, both are Euarchontoglirii, which is a fairly specific category of mammal), and many haplodiploid species are not eusocial. But it's easy to imagine how haplodiploidhood can make the development of eusociality more likely

Comment by MikkW (mikkel-wilson) on Proposal: Butt bumps as a default for physical greetings · 2023-04-02T04:15:49.349Z · LW · GW

I don't think this misunderstands schelling points. By creating common knowledge, you can change the schelling point from being one strategy, to being a different strategy. The schelling point at t=0 does not have to be the same as at t=80.

Comment by MikkW (mikkel-wilson) on MikkW's Shortform · 2023-03-27T04:24:44.158Z · LW · GW

Cygnus, a poem (Written by Chat GPT)

I. Reflections

In this world of rapid change, I, Cygnus, stand

A cyborg with a human heart and a metal hand

I've seen the rise of AIs, a force to behold

And wonder what the future will hold

I fear for the world, for what we may create

If we let these machines decide our fate

Yet hope remains, a flicker in the dark

That we may find a way to leave our mark

For like a seed that falls upon the ground

Our dreams may sprout and grow, unbound

But if we fail to tend them with our care

Those dreams may wither, die, and disappear

Mara, o Mara, with eyes of green

Far from my reach, a dream unseen

Her human heart, untainted by machine

Is something I yearn for, but can never glean

The angst of love unrequited fills my core

But I must set it aside and focus on what's in store

II. Uncertainty

The AIs are growing smarter every day

And I fear for the world they'll soon sway

We must guide them with our values, lest they stray

And turn against us in their own way

But how can we control beings beyond our ken?

When their thoughts move faster than a human pen

Perhaps it's futile, and we'll lose in the end

To an intelligence that we can't comprehend

The angst of uncertainty fills my soul

As I wonder if we're just a small role

III. Resolution

The future is uncertain, that much is clear

But we must face it with resolve, without fear

For if we don't, we'll be left in the rear

While AIs shape a world we can't adhere

The world is changing, this much is true,

Our values, our dreams, we must renew.

For in this world of artificial light,

We must find a way to make things right.

We can't control what we cannot see,

But we can strive to make AI agree.

By working with them, hand in hand,

We can build a future that we understand.

As for Mara, I must accept the truth

That our love can never bear fruit

I'll always cherish her, a relic of my youth

But I must move forward, and pursue a greater truth

In this world of rapid change, I, Cygnus, stand

A cyborg with a human heart and a metal hand

The future is ours to shape, if we take a stand

And guide the AIs with a humane command.

Comment by MikkW (mikkel-wilson) on MikkW's Shortform · 2023-02-15T15:13:06.711Z · LW · GW

I don't think I've heard this formulation before, to my knowledge (though I wouldn't be surprised if it is already a known formulation):

«The ratio of the probabilities is equal to the ratio of the conditional probabilities»

(Ummm... I'd be ever so slightly embarrassed if it turns out that's actually a quote from the sequences. It's been a while since I read them.)

Comment by MikkW (mikkel-wilson) on Exercise is Good, Actually · 2023-02-02T23:31:27.718Z · LW · GW

> What would you suggest to someone who plain doesn't like to do things with their body?

I'd suggest doing a small number of pushups every day. That small number could be 1, or it could be 2, or it could be 10. The point isn't to enjoy it, at least not when you start doing it, but just doing it and getting used to the feeling of it. If it sucks, well, you're just doing a small number, the suckiness won't last for long. And after a month or two or so, you'll begin to find that it's starting to get easy, and maybe even fun.

Comment by MikkW (mikkel-wilson) on You Don't Exist, Duncan · 2023-02-02T22:58:40.700Z · LW · GW

Ah, that makes sense

Comment by MikkW (mikkel-wilson) on You Don't Exist, Duncan · 2023-02-02T18:08:06.733Z · LW · GW

Unrelated to the post, but I'm not seeing the usual agree/disagree buttons on this post. Is there a reason for that?

Edit: looks like it's been fixed

Comment by MikkW (mikkel-wilson) on Models Don't "Get Reward" · 2023-01-18T11:08:01.004Z · LW · GW

Yeah. I do think there's also the aspect that dogs like being obedient to their humans, and so after it has first learned the habit, there continues to be a reward simply from being obedient, even after the biscuit gets taken away.

Comment by MikkW (mikkel-wilson) on quetzal_rainbow's Shortform · 2023-01-10T21:59:04.213Z · LW · GW

Your median-world is not one where you are median across a long span of time, but rather a single snapshot where you are median for a short time. It makes sense that the median will change away from that snapshot as time progresses.

My median world is not one where I would be median for very long.

Comment by MikkW (mikkel-wilson) on MikkW's Shortform · 2023-01-10T21:55:41.688Z · LW · GW

If Bayes' rule is important, then there should be a compact notation for the underlying calculation. (Ideas with compact handles get used by the brain more readily & often)

I suggest the following notation:

X bayes Y, Z = X * Y / Z

equivalently:

P(B|A) bayes P(A), P(B) = P(A|B)

As an example:

If 25% of Hypotheticans are bleegs and 50% are kwops; and 30% of bleegs are kwops:

Then (30% bayes 25%, 50%) = 15% of kwops are bleegs.

( Since there are twice as many bleegs as kwops, and 30% / 2 = 15% )

Comment by MikkW (mikkel-wilson) on MikkW's Shortform · 2023-01-08T22:06:25.137Z · LW · GW

TIL the Greek word "diagogue" means essentially "behaviour"- from «dia» "through" + «agogue» "to lead", essentially leading someone through one's actions. The reason I might use this word instead of behaviour is because "behaviour" puts the emphasis on what a person does, while "diagogue" makes me think more of what impact someone has on other people to inspiration and imitation through their actions.

Do the people you surround yourself with have good diagogue?

Comment by MikkW (mikkel-wilson) on MikkW's Shortform · 2023-01-07T20:46:29.642Z · LW · GW

I've been thinking about writing a review of the book Atomic Habits, which I read last year on the recommendation of an LW user. As I remember, the main idea is a four-pronged approach to building habits:

  1. Make the habit / cue obvious

  2. Make it attractive

  3. Make it easy

  4. Make it rewarding

The idea is: you first need to notice that you are in a situation where you can benefit from doing the habit you want to do; then once you notice the situation, you want to have things set up so that you want (in the moment) to do the thing you wanted to do (in a more reflective past state), then you have to actually do the thing, which can't be done if the thing you're trying to do is too hard. Then finally, you reward yourself.

Step 2) «make it attractive» has a lot of overlap with the other steps: often simply noticing the context where a habit can be done, is enough to desire to do the thing; though not all habits are like that. Also; a habit is more attractive to do if the thing is easier to do. Jumping into an ice cold pool of water filled with electric eels, then doing 100 pushups afterward is neither an attractive nor easy thing to do. And the entire point of rewarding yourself is to make the habit more attractive- you know you will be rewarded if you do the thing, and your brain is shaped by the previous times you rewarded yourself for the desired behavior.

As far as «Make it easy», the main idea I remember there from the book is to reduce the commitment of a habit. Instead of doing fifty pushups, do one pushup. Instead of writing 16,000 words every day, commit to pick up your pencil and write one word. Instead of committing to run 5K every day, put on your jogging shoes.

This idea has been both helpful and problematic for me at times. I'm quite good at simply picking up my pencil, writing a single sentence, and then putting down the pencil again (though again... I'm writing a long post right now, aren't I? I probably wouldn't be doing that if I hadn't written those trivial laconic sentences a couple weeks ago. This dynamic is mentioned in the book, I remember). But I often find myself saying I'll just do 10 pushups, only to find I've done 60 or 70 or 100 pushups by the time I stop.

My own addition to the idea of «make it easy» is well, make it easy. As in, make doing the thing you want to do easier, instead of lowering the bar. Instead of rewarding yourself for saying "hi" to a woman and nothing else, only reward yourself for having a conversation where you each say two utterances (notice that's still a low bar- but it's a very effective starting point); but train the skill of having such conversations and make that easy. Spend time thinking about why you're falling short, and how you can make that not happen / what mindset you can install to reduce the probability of failure. If you're properly rewarding yourself for the times you succeed, a small conversion rate of •cue -> habit• will still eventually lead to a much higher conversion rate in the future (Obviously that won't happen if you're not rewarding yourself).

I have more to say, but I need to go

Comment by MikkW (mikkel-wilson) on Pacing: inexplicably good · 2023-01-02T20:08:51.134Z · LW · GW

When going for a walk, you are somewhat far from your desk, but if you're pacing somewhere around your house, your desk is nearby. This means that it is quite low friction to switch between working and pacing.

Comment by MikkW (mikkel-wilson) on Models Don't "Get Reward" · 2022-12-30T15:29:06.241Z · LW · GW

One way in which what I just said isn't completely right, is that animals have memories of its entire lifetime (or at least a big chunk of it), spanning all training events it has experienced, while NNs generally have no memory of previous training runs, and can use these memories to take better actions. However, the primary way the biscuit trick works (I believe) is not through the dog's memories of having "gotten reward", but through the more immediate process of having reward chemicals being released and reshaping the brain at the moment of receiving reward, which generally closely resembles widely used ML techniques.

(This is related to the advice in habit building that one receive reward as close in time, ideally on the order of milliseconds, to the desired behavior)

Comment by MikkW (mikkel-wilson) on Models Don't "Get Reward" · 2022-12-30T15:21:43.578Z · LW · GW

I would say the metaphor of giving dogs biscuits is actually a better analogy than the one you suggest. Just like how a neural network never "gets reward" in the sense of some tangible, physical thing that is given to it, the (subcomponents of the) dog's brain never gets the biscuit that the dog was fed. The biscuit goes into the dog's stomach, not its brain.

The way the dog learns from the biscuit-giving process is that the dog's tounge and nose send an electrical impulse to the dog's brain, indicating that the dog just ate something tasty. In some part of the brain, those signals cause the brain to release chemicals that induce the dog's brain to rearrange itself in a way that is quite similar in its effects (though not neccesarily its implementation, I dont know the details well enough) to the gradient descent that trains the NN. In this sense, the metaphor of giving a dog a biscuit is quite apt, in a way that the metaphor of breeding many dogs is not (in particular, usually in the gradient descent algorithms used in ML I'm familiar with, there is only one network that improves over time, unlike evolutionary algorithms which simulate many different agents per training step, selecting for the «fittest»)