Posts

Precision of Sets of Forecasts 2023-09-19T18:19:18.053Z
Have Attention Spans Been Declining? 2023-09-08T14:11:55.224Z
The Evidence for Question Decomposition is Weak 2023-08-28T15:46:31.529Z
If I Was An Eccentric Trillionaire 2023-08-09T07:56:46.259Z
Self-Blinded Caffeine RCT 2023-06-27T12:38:55.354Z
Properties of Good Textbooks 2023-05-07T08:38:05.243Z
Iqisa: A Library For Handling Forecasting Datasets 2023-04-14T15:16:55.726Z
Subscripts for Probabilities 2023-04-13T18:32:17.267Z
High Status Eschews Quantification of Performance 2023-03-19T22:14:16.523Z
Open & Welcome Thread — March 2023 2023-03-01T09:30:09.639Z
Open & Welcome Thread - December 2022 2022-12-04T15:06:00.579Z
Open & Welcome Thread - Oct 2022 2022-10-02T11:04:06.762Z
Turning Some Inconsistent Preferences into Consistent Ones 2022-07-18T18:40:02.243Z
What is Going On With CFAR? 2022-05-28T15:21:51.397Z
Range and Forecasting Accuracy 2022-05-27T18:47:44.315Z
scipy.optimize.curve_fit Is Awesome 2022-05-07T10:57:28.278Z
Brain-Computer Interfaces and AI Alignment 2021-08-28T19:48:52.614Z
Open & Welcome Thread – April 2021 2021-04-04T19:25:09.049Z
An Exploratory Toy AI Takeoff Model 2021-01-13T18:13:14.237Z
Considerations on Cryonics 2020-08-03T17:30:42.307Z
"Do Nothing" utility function, 3½ years later? 2020-07-20T11:09:36.946Z
niplav's Shortform 2020-06-20T21:15:06.105Z

Comments

Comment by niplav on MikkW's Shortform · 2023-09-22T21:24:11.598Z · LW · GW

I like corner brackets more!

Comment by niplav on Subscripts for Probabilities · 2023-09-12T14:23:08.735Z · LW · GW

so to quantify sentence S, i prefer ur suggestion "I think it'll rain tomorrow". the percentage is supposed to modify "I think" anyway, so it makes more sense to make them adjacent. it's just more work bc it's novel syntax, but that's temporary.

The principles you propose make a lot of sense! Dropping "I think" or "My best guess" is then for the best.

Also, the underset/underbraces stuff is promising but too much to spend weirdness points on.

Comment by niplav on A Bat and Ball made me Sad · 2023-09-11T17:47:53.550Z · LW · GW

See also drossbucket 2018.

Comment by niplav on Have Attention Spans Been Declining? · 2023-09-10T11:18:28.587Z · LW · GW

Better at getting specific information out of a present piece of information (e.g. becoming skilled at skimming), better at putting tasks aside when they need some time to be processed in the background.

Comment by niplav on New User's Guide to LessWrong · 2023-09-09T15:07:42.971Z · LW · GW

I agree that if at all there is a counternorm to that, and also with the observation that such comments are often (sadly) ignored.

Comment by niplav on Have Attention Spans Been Declining? · 2023-09-09T11:50:25.137Z · LW · GW

Yeah, there's all these damn confounders :-|

Comment by niplav on Have Attention Spans Been Declining? · 2023-09-09T11:47:52.017Z · LW · GW

I looks like movies have been getting longer:

Although there seems to have been a slight dip/stagnation in the 2000s and 2010s, and the text concludes with

In conclusion, our intuition was wrong. There is no trend in the movies runtime. The differences are too small to be noticed. We can say that for the last 60 years movies on average have the same length. No matter what criteria we take into account, the result is the same.

Comment by niplav on I Converted Book I of The Sequences Into A Zoomer-Readable Format · 2023-09-09T11:41:55.714Z · LW · GW

I can deliver, it just takes a while.

Comment by niplav on Have Attention Spans Been Declining? · 2023-09-08T19:22:31.798Z · LW · GW

Sorry @RomanHauksson! Reversed the downvote. I didn't realize this was a joke 🤦

Comment by niplav on Have Attention Spans Been Declining? · 2023-09-08T19:21:13.653Z · LW · GW

Mark 2023 looks at this, I've summarized the relevant stuff at the end of this section, and the time spent per task has indeed been declining. I don't know about research looking at task classification, but that would be interesting to do.

My current take is that this provides medium evidence—but I think this could also be evidence of higher ability at priorization.

Comment by niplav on Have Attention Spans Been Declining? · 2023-09-08T16:04:31.226Z · LW · GW

I found this comment both unhelpful and anti-scholarship, and have therefore strong - downvoted.

Comment by niplav on The Evidence for Question Decomposition is Weak · 2023-09-08T14:01:43.899Z · LW · GW

Maybe to clarify, by question decomposition I mean techniques such as saying " will happen if and only if and and ... all happen, so we estimate and and &c, and then multiply them together to estimate …", which is how it is done in the sources I linked.

Do you by chance have links about how this is done in sports betting? I'd be interested in that.

Comment by niplav on The Evidence for Question Decomposition is Weak · 2023-08-29T11:07:21.926Z · LW · GW

I don't remember anything specific from reading their stuff, but that would of course be useful. Sadly, I haven't been able to find any more recent investigations into decomposition, e.g. Connected Papers for MacGregor 1999 gives nothing worthwhile after 2006 on a first skim, but I'll perhaps look more at it.

Comment by niplav on A list of core AI safety problems and how I hope to solve them · 2023-08-28T15:20:30.636Z · LW · GW

Thanks for the clarification!

Comment by niplav on niplav's Shortform · 2023-08-28T14:54:26.433Z · LW · GW
Comment by niplav on A list of core AI safety problems and how I hope to solve them · 2023-08-28T14:40:03.342Z · LW · GW

One issue I see with this plan is that it seems to rely on some mathematics that appear to me to not be fully worked out, e.g. infrabayesianism and «boundaries» (for which I haven't been able to find a full mathematical description), and it looks unclear to me whether they will actually be finished in time, and if they are, whether they lead to algorithms that are efficient enough to be scaled to such an ambitious project.

Comment by niplav on The Game of Dominance · 2023-08-28T14:24:25.083Z · LW · GW

I disagree with this: LLMs seem capable to be ~equally good at arguing for false and true positions, if you ask them (as evidenced by the many incorrect proofs produced by Galactica) when asked for it.

Comment by niplav on Using Points to Rate Different Kinds of Evidence · 2023-08-28T14:14:02.452Z · LW · GW

I like this!

I would reduce the point score of "A simple math proof proves X" from 20 down to 16. As far as I know, there is no literature on how faulty mathematical proofs are, but from personal history and anecdotes from others (I have many times found "bugs" in my proofs (5 points), and someone I know working in formal verification would often update me on the errors they found in published proofs (8 points)). I'd give a higher weight (18 points) to formally verified proofs. Not deducting more proofs because of the curious observation that even if a proof is faulty, the result it shows is usually true (just as neural networks want to learn, mathematical proofs want to be correct).

Additionally, proofs purportedly about real-world phenomena are often importance-hacked and one needs to read the exact result pretty carefully, which is often not done.

  • Randomized Controlled Trial (RCT) results: 25 points
  • Meta-analysis of multiple RCTs: 23 points

I find it amusing that GPT-4 considers meta-analyses to be worsening the results they attempt to pool together.

Comment by niplav on Shortform · 2023-08-21T18:39:29.312Z · LW · GW

I have.

(What is your criterion for "physically fit"?)

Comment by niplav on We Should Prepare for a Larger Representation of Academia in AI Safety · 2023-08-14T00:10:01.800Z · LW · GW

See also Leech 2020.

Comment by niplav on If I Was An Eccentric Trillionaire · 2023-08-09T20:42:22.482Z · LW · GW

Thanks for the heads up! I'll correct it.

Comment by niplav on If I Was An Eccentric Trillionaire · 2023-08-09T09:57:07.052Z · LW · GW

Agreed! I'd have much more to add, but at ~7k words I decided to publish.

Or, in other words,

Comment by niplav on Feedbackloop-first Rationality · 2023-08-09T07:47:27.577Z · LW · GW

Corollary: It'd probably be much better if people used proof assistants like Lean or Coq to teach mathematics. There is some overhead, sure, but they have much better feedback loops than normal mathematics.

Comment by niplav on Feedbackloop-first Rationality · 2023-08-08T21:24:41.414Z · LW · GW

Oh nice, another post I don't need to write anymore :-D

Some disjointed thoughts on this I had:

Feedback loops can be characterized along at least three axes:

  • Speed: How quickly you get feedback from actions you take. Archery has a very fast feedback loop: You shoot an arrow and one or two seconds later you see what the outcome is.
  • Noise: How noisy the feedback is. High-frequency trading has fast feedback loops, but they have a lot of noise, and finding the signal is the difficult part.
  • Richness: How much information you're getting. Dating is one example: Online dating has extremely poor feedback loops: only a couple of bits (did the other person respond, what did they respond) per interaction, while talking & flirting with people in person has extremely rich feedback (the entire visual+acustic+sensory field (plus perhaps smell? Don't know much about human pheromones))—probably kilobytes per minimal motor-action, and megabytes per second.

Fast & low-noise & rich feedback loops are the best, and improving the feedback loop in any of those dimensions is super valuable.

As an example, forecasting has meh feedback loops: they can be very slow (days at least, but more likely months or years (!)), the feedback is kind of poor (only a few bits per forecast), but at least there's not that much noise (you forecast what the question says, but maybe this is why forecasters really don't like questions resolving on technicalities—the closest thing to noise).

But one can improve the richness of the forecasting feedback loop by writing out ones reasoning, so one can update on the entire chain of thought once the resolution comes. Similarly, programming has much better feedback loops than mathematics, which is why I'd recommend that someone learn programming before math (in general learn things with fast & rich feedback loops earlier and slow & poor ones later).

Also, feedback loops feel to me like they're in the neighbourhood of both flow & addiction? Maybe flow is a feedback loop with a constant or increasing gradient, while addiction is a feedback loop with a decreasing gradient (leading into a local & shallow minimum).

When I started reading the Sequences, I started doing forecasting on Metaculus within 3 months (while still reading them). I think being grounded at that time in actually having to do reasoning with probabilities & receiving feedback in the span of weeks made the experience of reading the Sequences much more lasting to me. I also think that the lack of focus on any rationality verification made it significantly harder to develop an art of rationality. If you have a metric you have something to grind on, even if you abandon it later.

Comment by niplav on The Benevolent Ruler’s Handbook (Part 1): The Policy Problem · 2023-08-06T11:11:29.034Z · LW · GW

See also Building Blocks of Politics: An Overview of Selectorate Theory (Yoav Ravid, 2021).

Comment by niplav on niplav's Shortform · 2023-08-04T13:49:07.673Z · LW · GW

I would simply select a Turing machine in which I am the output of the empty program as the basis of the universal distribution.

Comment by niplav on Open Thread - July 2023 · 2023-07-17T11:07:33.675Z · LW · GW

In case you were wondering: In my personal data I find no relation between recent masturbation and meditation quality, contrary to the claims of many meditative traditions.

Comment by niplav on Self-experiment Protocol: Effect of Chocolate on Sleep · 2023-07-11T11:54:07.177Z · LW · GW

Hi, did anything come out of this experiment?

Comment by niplav on Mati_Roy's Shortform · 2023-07-08T23:25:16.660Z · LW · GW

I believe VR/AR are not going to be as big of a deal as smartphones, and not produce >$200 bio. of revenue in 2030.

Comment by niplav on niplav's Shortform · 2023-07-07T05:31:24.005Z · LW · GW

There are Pareto-improvements: everyone is made better off by their own standards. There are, similarly, Pareto-worsenings: Everyone is made worse off by their own standard, or their welfare is unchanged.

Then there are Kaldor-Hicks improvements, which happen if one e.g. reallocates the available goods so that the better-off could compensate the now worse-off to create a Pareto improvement. This compensation need not occur, it needs to be merely possible.

Now can there be a Kaldor-Hicks-worsening?

The naive version (everyone is worse (or equally well) off, and there is no way of making a single person better off through redistribution) seems too strong, there is probably always a redistribution that gives the available resources to a single agent.

A simple negation of the criteria, then perhaps makes more sense: A change is a Kaldor-Hicks-worsening if and only if everyone is worse (or equally well) off and there is no way of creating a Pareto-improvement through reallocation.

This implies a anti-Kaldor-Hicks-worsening: A change makes everyone worse off, but there is some reallocation that creates a Pareto improvement.

Example: We have a Sadist and a Masochist. The Masochist starts hurting the Sadist, thus creating opportunity cost for them both. Switching the roles creates a Pareto improvement.

Comment by niplav on Self-Blinded Caffeine RCT · 2023-06-28T15:35:58.144Z · LW · GW

That's a good point, I'll keep it in mind.

Comment by niplav on Self-Blinded Caffeine RCT · 2023-06-28T15:22:26.216Z · LW · GW

Ah, I see. In the table is the log score of my pill predictions (-0.6). I made predictions abojt the accuracy of my predictions—confusing perhaps.

Comment by niplav on Self-Blinded Caffeine RCT · 2023-06-28T06:41:13.599Z · LW · GW

Blinding is powerful! Not sure where you get 80% from, do you mean the number of times when I was directionally right in the prediction?

Comment by niplav on Self-Blinded Caffeine RCT · 2023-06-28T06:39:26.533Z · LW · GW

That's a good point. I have, but shelved them for sometime later™. If it were true that withdrawal effects are important, you'd expect the difference in placebo-caffeine scores to drop as the experiment progresses, but I haven't checked yet.

Comment by niplav on Did Bengio and Tegmark lose a debate about AI x-risk against LeCun and Mitchell? · 2023-06-27T12:19:46.448Z · LW · GW

Ah, okay. That makes much more sense.

Comment by niplav on Did Bengio and Tegmark lose a debate about AI x-risk against LeCun and Mitchell? · 2023-06-27T10:38:05.066Z · LW · GW

I am of course heavily biased. I would have counted myself among the 8% of people who were unwilling to change their minds

 

Excuse me but what the fuck‽ Maybe kudos for being honest about your disposition.

But this is LessWrong! We consider this kind of stance to be deeply flawed and corrosive to figuring out what facts constitute the world. The rest of the post is okay, but I hope that people can see that this is a warning sign.

Comment by niplav on [$10k bounty] Read and compile Robin Hanson’s best posts · 2023-06-23T10:11:44.469Z · LW · GW

Does anyone know whether this bounty is still standing?

Comment by niplav on AXRP Episode 22 - Shard Theory with Quintin Pope · 2023-06-16T18:06:48.621Z · LW · GW

This podcast does not appear on my feed on Google Podcasts.

Comment by niplav on Dalcy Bremin's Shortform · 2023-06-16T18:04:52.791Z · LW · GW

The MIRI Research Guide recommends An Introduction to Decision Theory and Game Theory: An Introduction. I have read neither and am simply relaying the recommendation.

Comment by niplav on Open Thread: June 2023 (Inline Reacts!) · 2023-06-12T10:55:19.471Z · LW · GW

Courtesy of the programming language checklist.

So you're proposing a new economic policy. Here's why your policy will not work.

Your policy depends on science/theorizing that:
    ☐ has been replicated only once
    ☐ has failed to replicate
    ☐ for which there exist no replication attempts
    ☐ was last taken seriously sometime around 1900
    ☐ requires a dynamic stochastic general equilibrium model with 200 free parameters to perfectly track reality perfectly
Your policy would:
    ☐ disincentivize good things
    ☐ incentivize bad things
    ☐ both
    ☐ be wildly unpopular, even though you think it's the best thing since sliced bread (it's not)
    ☐ You seem to think that taking options away from people helps them
Your policy just reinvents
    ☐ land-value taxes, but worse
    ☐ universal basic income, but worse
    ☐ price discrimination, but worse
    ☐ demand subsidy, but worse
    ☐ demand subsidy, better, but that's still no excuse
    ☐ Your policy sneakily redistributes money from poor to rich people
    ☐ Your policy only works if every country on earth accepts it at the same time
    ☐ You actually have no idea what failure/success of your policy would look like
You claim it fixes
    ☐ climate change
    ☐ godlessness
    ☐ police violence
    ☐ wet socks
    ☐ teenage depression
    ☐ rising rents
    ☐ war
    ☐ falling/rising sperm counts/testosterone levels
You seem to assume that
    ☐ privatization always works
    ☐ privatization never works
    ☐ your country will never become a dictatorship
    ☐ your country will always stay a dictatorship
    the cost of coordination is
        ☐ negligible
        ☐ zero
        ☐ negative
        ☐ Your policy is a Pareto-worsening
In conclusion,
    ☐ You have copied and mashed together some good ideas with some mediocre ideas
    ☐ You have not even tried to understand basic economics/political science/sociology concepts
    ☐ Living under your policy is an adequate punishment for inventing it
Comment by niplav on Czynski's Shortform · 2023-06-09T18:35:32.004Z · LW · GW

One could introduce 🌵 for such users.

Comment by niplav on Quinn's Shortform · 2023-06-09T18:34:08.961Z · LW · GW

I like more.

Comment by niplav on Why is violence against AI labs a taboo? · 2023-05-27T14:46:16.861Z · LW · GW

Some people definitely say they believe climate change will kill all humans.

Comment by niplav on How could I measure the nootropic benefits testosterone injections may have? · 2023-05-19T00:01:01.654Z · LW · GW

In a current caffeine self-experiment I'm tracking the following variables:

  • Meditation performance using Meditavo (unfortunately only exportable using the premium version)
    • Mindfulness
    • Concentration
  • Cognition using Anki flashcards scores, exported from the Collection.anki2 by using sqlite3
  • Mood via Moodpatterns
    • Happiness, Contentment and Stress
    • I use the additional interested/disinterested spectrum for tracking horniness
  • Creativity and productivity for the day via a simple self-written script that activates via scron

More here. Maybe you could try to self-blind yourself, e.g. have half of your injections be saline and the other half testosterone? (I don't know much about how testosterone is consumed, maybe the benefits accrue slowly enough that randomization doesn't make sense).

Happy to respond to more questions as well, but I haven't finished my experiment yet so I can't provide any code for analysis.

Comment by niplav on New User's Guide to LessWrong · 2023-05-17T07:44:26.428Z · LW · GW

Historically, LessWrong was seeded by the writings of Eliezer Yudkowsky, an artificial intelligence researcher.

He usually descibes himself as a decision theorist if asked for a description of his job.

Comment by niplav on adamzerner's Shortform · 2023-05-15T00:08:22.374Z · LW · GW

Man I think I am providing value to the world by posting and commenting here. If it cost money I would simply stop posting here, and not post anywhere else.

The value flows in both directions. I'm fine not getting paid but paying is sending a signal of "what you do here isn't appreciated".

(Maybe I'd feel different if the money was reimbursed to particularly good posters? But then Goodharts law)

Comment by niplav on TurnTrout's shortform feed · 2023-05-11T20:20:59.193Z · LW · GW

I encourage you to fix the mistake. (I can't guarantee that the fix will be incorporated, but for something this important it's worth a try).

Comment by niplav on DragonGod's Shortform · 2023-05-10T06:05:01.031Z · LW · GW

I notice I am confused. How do you violate an axiom (completeness) without behaving in a way that violates completeness? I don't think you need an internal representation.

Elaborating more, I am not sure how you even display a behavior that violates completeness. If you're given a choice between only universe-histories and , and your preferences are imcomplete over them, what do you do? As soon as you reliably act to choose one over the other, for any such pair, you have algorithmically-revealed complete preferences.

If you don't reliably choose one over the other, what do you do then?

  • Choose randomly? But then I'd guess you are again Dutch-bookable. And according to which distribution?
  • Your choice is undefined? That seems both kinda bad and also Dutch-bookable to me tbh. Alwo don't see the difference between this and random choice (shodt of going up in flames, which would constigute a third, hitherto unassumed option).
  • Go away/refuse the trade &c? But this is denying the premise! You only have universe-histories and tp choose between! I think what happens with humans is that they are often incomplete over very low-ranking worlds and are instead searching for policies to find high-ranking worlds while not choosing. I think incomplwteness might be fine if there are two options you can guarantee to avoid, but with adversarial dynamics that becomes more and more difficult.
Comment by niplav on Could A Superintelligence Out-Argue A Doomer? · 2023-05-10T05:46:56.359Z · LW · GW

To clarify the comment for @tjaffee, a superintelligence could do the following

  • Use invalid but extremely convincing arguments that make the "doomer"[1] change his mind. This appears realistic because sometimes people become convinced of false things through invalid argumentation[2].
  • Give a complete plan for an aligned superintelligence, shutting itself down in the process, creating a true and probably convincing argument (this is vanishingly unlikely).

  1. Ugh. ↩︎

  2. Like maybe once or twice. ↩︎

Comment by niplav on Properties of Good Textbooks · 2023-05-09T22:43:19.096Z · LW · GW

Nice. Would you mind if I took inspiration from your list (crediting you of course).