Posts

Is a near-term, self-sustaining Mars colony impossible? 2020-06-03T22:43:08.501Z
ESRogs's Shortform 2020-04-29T08:03:28.820Z
Dominic Cummings: "we’re hiring data scientists, project managers, policy experts, assorted weirdos" 2020-01-03T00:33:09.994Z
'Longtermism' definitional discussion on EA Forum 2019-08-02T23:53:03.731Z
Henry Kissinger: AI Could Mean the End of Human History 2018-05-15T20:11:11.136Z
AskReddit: Hard Pills to Swallow 2018-05-14T11:20:37.470Z
Predicting Future Morality 2018-05-06T07:17:16.548Z
AI Safety via Debate 2018-05-05T02:11:25.655Z
FLI awards prize to Arkhipov’s relatives 2017-10-28T19:40:43.928Z
Functional Decision Theory: A New Theory of Instrumental Rationality 2017-10-20T08:09:25.645Z
A Software Agent Illustrating Some Features of an Illusionist Account of Consciousness 2017-10-17T07:42:28.822Z
Neuralink and the Brain’s Magical Future 2017-04-23T07:27:30.817Z
Request for help with economic analysis related to AI forecasting 2016-02-06T01:27:39.810Z
[Link] AlphaGo: Mastering the ancient game of Go with Machine Learning 2016-01-27T21:04:55.183Z
[LINK] Deep Learning Machine Teaches Itself Chess in 72 Hours 2015-09-14T19:38:11.447Z
[Link] First almost fully-formed human [foetus] brain grown in lab, researchers claim 2015-08-19T06:37:21.049Z
[Link] Neural networks trained on expert Go games have just made a major leap 2015-01-02T15:48:16.283Z
[LINK] Attention Schema Theory of Consciousness 2013-08-25T22:30:01.903Z
[LINK] Well-written article on the Future of Humanity Institute and Existential Risk 2013-03-02T12:36:39.402Z
The Center for Sustainable Nanotechnology 2013-02-26T06:55:18.542Z

Comments

Comment by ESRogs on Finite Factored Sets · 2021-05-24T03:08:22.177Z · LW · GW

Let , where  and 

[...] The second rule says that  is orthogonal to itself

Should that be "is not orthogonal to itself"? I thought the  meant non-orthogonal, so would think  means that  is not orthogonal to itself.

(The transcript accurately reflects what was said in the talk, but I'm asking whether Scott misspoke.)

Comment by ESRogs on Challenge: know everything that the best go bot knows about go · 2021-05-11T07:20:07.457Z · LW · GW

But once you let it do more computation, then it doesn't have to know anything at all, right? Like, maybe the best go bot is, "Train an AlphaZero-like algorithm for a million years, and then use it to play."

I know more about go than that bot starts out knowing, but less than it will know after it does computation.

I wonder if, when you use the word "know", you mean some kind of distilled, compressed, easily explained knowledge?

Comment by ESRogs on Challenge: know everything that the best go bot knows about go · 2021-05-11T07:13:50.544Z · LW · GW

You have to be able to know literally everything that the best go bot that you have access to knows about go.

In your mind, is this well-defined? Or are you thinking of a major part of the challenge as being to operationalize what this means?

(I don't know what it means.)

Comment by ESRogs on MIRI location optimization (and related topics) discussion · 2021-05-08T23:57:49.892Z · LW · GW

(I don't expect to live on or immediately next to the proto-campus, but it would be cool to be somewhat nearby.)

Comment by ESRogs on MIRI location optimization (and related topics) discussion · 2021-05-08T23:56:14.112Z · LW · GW

I am expecting to "settle down" in either the Bay Area or Seattle. So I like the Bellingham option.

Comment by ESRogs on The AI Timelines Scam · 2021-05-08T03:59:22.401Z · LW · GW

Thanks!

Comment by ESRogs on The irrelevance of test scores is greatly exaggerated · 2021-04-17T21:04:55.176Z · LW · GW

Here, there's minimal dependence on ACT, but a negative dependence on , meaning that extreme ACT scores (high or low) both lead to lower likely-to-graduate scores.

Does that seem counterintuitive to you? Remember, we are taking a student who is already enrolled in a particular known college and predicting how likely that are to graduate from that college.

Sounds like a classic example of Simpson's paradox, no?

Comment by ESRogs on People Will Listen · 2021-04-15T22:30:55.544Z · LW · GW

My current theory for what happened is that everyone bought into this delusion about the value of bitcoin, but that unlike other bubbles it didn't burst because Bitcoin has a limited supply and there is literally nothing to anchor its value. So there's no point where investors give up and sell because there is literally no point at which it's overpriced.

This actually sounds pretty close to what you might call the "bubble theory of money": that money is a bubble that doesn't pop, that certain (relatively) useless commodities can become money if enough people think of them that way, and when that happens their price is inflated, relative to their use value.

This isn't something that will happen to every commodity. Whether it happens depends both on the properties of the commodity, and also on things like memes and Schelling points.

Bitcoin has enough useful properties (it's like gold, but digital), and, because of its first-mover advantage, is the Schelling point for digital store-of-value (not that it couldn't be replaced, but it's a very up-hill battle), so it has become money, in this sense.

(On the memes-and-Schelling-points thing, see also: The Most Important Scarce Resource is Legitimacy, by Vitalik Buterin.)

Comment by ESRogs on "AI and Compute" trend isn't predictive of what is happening · 2021-04-04T18:12:19.215Z · LW · GW

the first 5-12 million dollar tab

You mean GPT-3? Are you asking whether it's made enough money to pay for itself yet?

Comment by ESRogs on What is the VIX? · 2021-02-26T16:49:03.011Z · LW · GW

I believe that you (and the Twitter thread) are saying something meaningful, but I'm having trouble parsing it.

I had thought of the difference between variance and volatility as just that one is the square of the other. So saying that the VIX is "variance in vol units, but not volatility" doesn't mean anything to me.

I think these are the critical tweets:

VIX is an index that measures the market implied level of 1-month variance on the S&P 500, or the square root thereof (to put it back in units we are used to).

This is not the same as volatility. A variance swap’s payoff is proportional to volatility squared. If you are short a variance swap at 10%, and then realized volatility turns out to be 40%, you lose your notional vega exposure times 16 (= 40^2 / 10^2 ).

To compensate for this, an equity index variance swap level is usually 2-3 points above the corresponding at the money implied volatility. So don’t look at VIX versus realized vol and make statements about risk premium without recognizing this extreme tail risk.

I was with him at "a variance swap's payoff is proportional to volatility squared". That matches my understanding of volatility as the square root of variance. But then I don't get the next point about realized volatility needing to be "compensated for".

Anybody care to explain?

Comment by ESRogs on The Future of Nuclear Arms Control? · 2021-02-26T16:38:54.342Z · LW · GW

Thanks!

Comment by ESRogs on How Should We Respond to Cade Metz? · 2021-02-15T22:48:03.974Z · LW · GW

Link for the curious: https://www.newyorker.com/culture/annals-of-inquiry/slate-star-codex-and-silicon-valleys-war-against-the-media

Comment by ESRogs on The ecology of conviction · 2021-02-15T22:28:27.355Z · LW · GW

Which all make them even easier targets for criticism, and make confident enthusiasm for an idea increasingly correlated with being some kind of arrogant fool.

But it also means conviction is undervalued, and it might be a good time to buy low!

Comment by ESRogs on Bitcoin and ESG Investing · 2021-02-15T22:09:12.630Z · LW · GW

I hold positions in Bitcoin, Ethereum, and Tesla through Exchange Traded Funds.

For Bitcoin and Ether, do you mean the Grayscale trusts, GBTC and ETHE? My impression is that these are similar to ETFs, but not exactly the same thing, and I'm not aware of other ETFs that give you exposure to crypto (except for the small amount of exposure you'd get from owning shares in companies that have a little BTC on their balance sheet, like Tesla, Square, or MicroStrategy).

Comment by ESRogs on The Future of Nuclear Arms Control? · 2021-02-15T22:01:11.882Z · LW · GW

The difference between a TSAR bomb (or its modern equivalent) and the lowest settings of a mini-nuke is still an order of magnitude larger than the difference between the conventional “mother of all bombs” and a hand grenade. The Beirut explosion last year was the size of the hand grenade blast in this analogy

I didn't quite understand the last sentence here. Are you saying A) that the Beirut explosion was about the same size as a mini-nuke blast would be, or that B) MOAB : hand grenade :: TSAR bomb : Beirut explosion? (In which case the Beirut explosion would be larger than a mini-nuke explosion, if your claim about relative differences in the first sentence is correct.)

In other words, I take the first part of what you wrote to be saying that (TSAR bomb / mini-nuke) > (MOAB / grenade), but then I'm not sure whether the second part is saying that A) (TSAR bomb / Beirut explosion) = (TSAR bomb / mini-nuke), or B) (TSAR bomb / Beirut explosion) = (MOAB / grenade).

Is one of either A or B correct? (Or did you mean something else entirely?)

Comment by ESRogs on Expressive Vocabulary · 2021-01-31T11:09:41.107Z · LW · GW

sometimes people think of things as being either X or Y, and then learn an argument for why this dichotomy doesn't make sense. As a result, they might reject the dichotomy entirely

This reminds me of the Fallacy of Gray.

Comment by ESRogs on Dario Amodei leaves OpenAI · 2021-01-31T00:00:55.529Z · LW · GW

I'm definitely left wondering what AI Alignment research is left at OpenAI

You may be interested to know that Jan Leike recently joined OpenAI and will lead their alignment team.

Comment by ESRogs on ESRogs's Shortform · 2021-01-30T23:55:39.505Z · LW · GW

Suppose you want to bet on interest rates rising -- would buying value stocks and shorting growth stocks be a good way to do it? (With the idea being that, if rates rise, future earnings will be discounted more and present earnings valued relatively more highly.)

And separately from whether long-value-short-growth would work, is there a more canonical or better way to bet on rates rising?

Just shorting bonds, perhaps? Is that the best you can do?

(Crossposted from Twitter)

Comment by ESRogs on How likely is it that SARS-CoV-2 originated in a laboratory? · 2021-01-26T00:25:54.735Z · LW · GW

Got it, thanks for the clarification.

Comment by ESRogs on Grokking illusionism · 2021-01-26T00:24:55.768Z · LW · GW

Hmm, maybe it's worth distinguishing two things that "mental states" might mean:

  1. intermediate states in the process of executing some cognitive algorithm, which have some data associated with them
  2. phenomenological states of conscious experience

I guess you could believe that a p-zombie could have #1, but not #2.

Comment by ESRogs on Grokking illusionism · 2021-01-26T00:16:57.218Z · LW · GW

Consciousness/subjective experience describes something that is fundamentally non-material.

More non-material than "love" or "three"?

It makes sense to me to think of "three" as being "real" in some sense independently from the existence of any collection of three physical objects, and in that sense having a non-material existence. (And maybe you could say the same thing for abstract concepts like "love".)

And also, three-ness is a pattern that collections of physical things might correspond to.

Do you think of consciousness as being non-material in a similar way? (Where the concept is not fundamentally a material thing, but you can identify it with collections of particles.)

Comment by ESRogs on Grokking illusionism · 2021-01-26T00:02:01.158Z · LW · GW

If you just assume that there's no primitive for consciousness, I would agree that the argument for illusionism is extremely strong since [unconscious matter spontaneously spawning consciousness] is extremely implausible.

How is this implausible at all? All kinds of totally real phenomena are emergent. There's no primitive for temperature, yet it emerges out of the motions of many particles. There's no primitive for wheel, but round things that roll still exist.

Maybe I've misunderstood your point though?

Comment by ESRogs on Grokking illusionism · 2021-01-25T23:52:49.172Z · LW · GW

This is a familiar dialectic in philosophical debates about whether some domain X can be reduced to Y (meta-ethics is a salient comparison to me). The anti-reductionist (A) will argue that our core intuitions/concepts/practices related to X make clear that it cannot be reduced to Y, and that since X must exist (as we intuitively think it does), we should expand our metaphysics to include more than Y. The reductionist (R) will argue that X can in fact be reduced to Y, and that this is compatible with our intuitions/concepts/everyday practices with respect to X, and hence that X exists but it’s nothing over and above Y. The nihilist (N), by contrast, agrees with A that it follows from our intuitions/concepts/practices related to X that it cannot be reduced to Y, but agrees with D that there is in fact nothing over and above Y, and so concludes that there is no X, and that our intuitions/concepts/practices related to X are correspondingly misguided. Here, the disagreement between A vs. R/N is about whether more than Y exists; the disagreement between R vs. A/N is about whether a world of only Y “counts” as a world with X. This latter often begins to seem a matter of terminology; the substantive questions have already been settled.

Is this a well-known phenomenon? I think I've observed this dynamic before and found it very frustrating. It seems like philosophers keep executing the following procedure:

  1. Take a sensible, but perhaps vague, everyday concept (e.g. consciousness, or free will), and give it a precise philosophical definition, but bake in some dubious, anti-reductionist assumptions into the definition.
  2. Discuss the concept in ways that conflate the everyday concept and the precise philosophical one. (Failing to make clear that the philosophical concept may or may not be the best formalization of the folk concept.)
  3. Realize that the anti-reductionist assumptions were false.
  4. Claim that the everyday concept is an illusion.
  5. Generate confusion (along with full employment for philosophers?).

If you'd just said that the precisely defined philosophical concept was a provisional formalization of the everyday concept in the first place, then you wouldn't have to claim that the everyday concept was an illusion once you realize that your formalization was wrong!

Comment by ESRogs on Grokking illusionism · 2021-01-25T23:32:10.900Z · LW · GW

No one ever thought that phenomenal zombies lacked introspective access to their own mental states

I'm surprised by this. I thought p-zombies were thought not to have mental states.

I thought the idea was that they replicated human input-output behavior while having "no one home". Which sounds to me like not having mental states.

If they actually have mental states, then what separates them from the rest of us?

Comment by ESRogs on How likely is it that SARS-CoV-2 originated in a laboratory? · 2021-01-25T22:09:19.474Z · LW · GW

This may be a bit of a pedantic comment, but I'm a bit confused by how your comment starts:

I've done over 200 hours of research on this topic and have read basically all the sources the article cites. That said, I don't agree with all of the claims.

The "That said, ..." part seems to imply that what follows is surprising. As though the reader expects you to agree with all the claims. But isn't the default presumption that, if you've done a whole bunch of research into some controversial question, that the evidence is mixed?

In other words, when I hear, "I've done over 200 hours of research ... and have read ... all the sources", I think, "Of course you don't agree with all the claims!" And it kind of throws me off that you seem to expect your readers to think that you would agree with all the claims.

Is the presumption that someone would only spend a whole bunch of hours researching these claims if they thought they were highly likely to be true? Or that only an uncritical, conspiracy theory true believer would put in so much time into looking into it?

Comment by ESRogs on The Box Spread Trick: Get rich slightly faster · 2021-01-21T23:21:09.778Z · LW · GW

I used SPX Dec '22, 2700/3000 (S&P was closer to those prices when I entered the position). And smart routing I think. Whatever the default is. I didn't manually choose an exchange.

Comment by ESRogs on The Box Spread Trick: Get rich slightly faster · 2021-01-21T17:01:46.080Z · LW · GW

I've been able to get closer to 0.6% on IB. I've done that by entering the order at a favorable price and then manually adjusting it by a small amount once a day until it gets filled. There's probably a better way to do it, but that's what's worked for me.

Comment by ESRogs on Coherent decisions imply consistent utilities · 2021-01-14T21:33:42.180Z · LW · GW

That makes a lot of sense to me. Good points!

Comment by ESRogs on Coherent decisions imply consistent utilities · 2021-01-13T20:13:53.944Z · LW · GW

It seems to me that there has been enough unanswered criticism of the implications of coherence theorems for making predictions about AGI that it would be quite misleading to include this post in the 2019 review.

If the post is the best articulation of a line of reasoning that has been influential in people's thinking about alignment, then even if there are strong arguments against it, I don't see why that means the post is not significant, at least from a historical perspective.

By analogy, I think Searle's Chinese Room argument is wrong and misleading, but I wouldn't argue that it shouldn't be included in a list of important works on philosophy of mind.

Would you (assuming you disagreed with it)? If not, what's the difference here?

(Put another way, I wouldn't think of the review as a collection of "correct" posts, but rather as a collection of posts that were important contributions to our thinking. To me this certainly qualifies as that.)

Comment by ESRogs on Coherent decisions imply consistent utilities · 2021-01-13T20:04:10.484Z · LW · GW

On the review: I don't think this post should be in the Alignment section of the review, without a significant rewrite / addition clarifying why exactly coherence arguments are useful or important for AI alignment.

Assuming that one accepts the arguments against coherence arguments being important for alignment (as I tentatively do), I don't see why that means this shouldn't be included in the Alignment section.

The motivation for this post was its relevance to alignment. People think about it in the context of alignment. If subsequent arguments indicate that it's misguided, I don't see why that means it shouldn't be considered (from a historical perspective) to have been in the alignment stream of work (along with the arguments against it).

(Though, I suppose if there's another category that seems like a more exact match, that seems like a fine reason to put it in that section rather than the Alignment section.)

Does that make sense? Is your concern that people will see this in the Alignment section, and not see the arguments against the connection, and continue to be misled?

Comment by ESRogs on ESRogs's Shortform · 2021-01-13T19:33:00.759Z · LW · GW

The workflow I've imagined is something like:

  1. human specifies function in English
  2. AI generates several candidate code functions
  3. AI generates test cases for its candidate functions, and computes their results
  4. AI formally analyzes its candidate functions and looks for simple interesting guarantees it can make about their behavior
  5. AI displays its candidate functions to the user, along with a summary of the test results and any guarantees about the input output behavior, and the user selects the one they want (which they can also edit, as necessary)

In this version, you go straight from English to code, which I think might be easier than from English to formal specification, because we have lots of examples of code with comments. (And I've seen demos of GPT-3 doing it for simple functions.)

I think some (actually useful) version of the above is probably within reach today, or in the very near future.

Comment by ESRogs on ESRogs's Shortform · 2021-01-13T18:38:55.948Z · LW · GW

Mostly it just seems significant in the grand scheme of things. Our mathematics is going to become formally verified.

In terms of actual consequences, it's maybe not so important on its own. But putting a couple pieces together (this, Dan Selsam's work, GPT), it seems like we're going to get much better AI-driven automated theorem proving, formal verification, code generation, etc relatively soon.

I'd expect these things to start meaningfully changing how we do programming sometime in the next decade.

Comment by ESRogs on ESRogs's Shortform · 2021-01-13T07:04:22.530Z · LW · GW

One of the most important things going on right now, that people aren't paying attention to: Kevin Buzzard is (with others) formalizing the entire undergraduate mathematics curriculum in Lean. (So that all the proofs will be formally verified.)

See one of his talks here: 

Comment by ESRogs on Imitative Generalisation (AKA 'Learning the Prior') · 2021-01-13T00:24:19.010Z · LW · GW

FYI it looks like the footnote links are broken. (Linking to "about:blank...")

Comment by ESRogs on Science in a High-Dimensional World · 2021-01-12T23:08:24.784Z · LW · GW

https://www.preposterousuniverse.com/blog/2016/07/18/space-emerging-from-quantum-mechanics/

Comment by ESRogs on Science in a High-Dimensional World · 2021-01-12T23:07:40.382Z · LW · GW

I'm not sure whether it's the standard view in physics, but Sean Carroll has suggested that we should think of locality in space as deriving from entanglement. (With space itself as basically an emergent phenomenon.) And I believe he considers this a driving principle in his quantum gravity work.

Comment by ESRogs on Fourth Wave Covid Toy Modeling · 2021-01-10T08:38:55.516Z · LW · GW

Based on what you've said, Rt never goes below one

You're saying nostalgebraist says Rt never goes below 1?

I interpreted "R is always ~1 with noise/oscillations" to mean that it could go below 1 temporarily. And that seems consistent with the current London data. No?

Comment by ESRogs on Fourth Wave Covid Toy Modeling · 2021-01-08T05:29:50.658Z · LW · GW

So you're saying that you think that a more infectious virus will not increase infections by as high a percentage of otherwise expected infections under conditions with more precautions, versus conditions with less precautions? What's the physical mechanism there?

Wouldn't "the fractal nature of risk taking" cause this? If some people are taking lots of risk, but they comply with actually strict lockdowns, then those lockdowns would work better than might otherwise be expected. No?

Comment by ESRogs on ESRogs's Shortform · 2021-01-02T03:19:17.843Z · LW · GW

See also his recent paper, which seems to like an important contribution towards using ML for symbolic / logical reasoning: Universal Policies for Software-Defined MDPs.

Comment by ESRogs on ESRogs's Shortform · 2021-01-02T03:16:08.402Z · LW · GW

If I've heard him right, it sounds like Dan Selsam (NeuroSAT, IMO Grand Challenge) thinks ML systems will be able to solve IMO geometry problems (though not other kinds of problems) by the next IMO.

(See comments starting around 38:56.)

 

Comment by ESRogs on SpaceX will have massive impact in the next decade · 2021-01-01T04:01:12.698Z · LW · GW

Sounds like you're thinking along the same lines as I was.

Comment by ESRogs on SpaceX will have massive impact in the next decade · 2020-12-31T23:54:23.665Z · LW · GW

When Tungsten rods are dropped from space onto earth they manage to store a lot of kinetic energy because they have a very high boiling point. Dropping tungsten rods from space can release as much energy as nuclear weapons without the nuclear fallout.

Doesn't that energy ultimately come from the propellant used to get the rods to orbit? Wouldn't it be more cost effective to just use the propellant itself as the explosive?

Is the advantage of the rod that it's easier to get it to the target than it would be to get the propellant there?

Comment by ESRogs on My Model of the New COVID Strain and US Response · 2020-12-27T18:34:55.557Z · LW · GW

rtnew=1.7 is an entirely different case. Suppressing it would require the sort of lockdown that would yield rt=0.6 for the old strain, a number that has never been reached by any US state for any amount of time. I see no way in hell that Americans would agree to a lockdown much stricter than any we’ve had so far, especially after they’ve been promised that the worst is behind them.

As mentioned on Twitter, I don't buy this. I think we'd get more infections and deaths, but once hospitals are overwhelmed, society's negative feedback loop will kick in and we'll get R back close to 1.

I believe that lots of individuals could be a lot more cautious than they already are, and I don't think people will stand for hospitals being overwhelmed.

Comment by ESRogs on 2020 AI Alignment Literature Review and Charity Comparison · 2020-12-22T17:43:09.095Z · LW · GW

This is commonly said on the basis of his $1b pledge

Wasn't it supposed to be a total of $1b pledged, from a variety of sources, including Reid Hoffman and Peter Thiel, rather than $1b just from Musk?

EDIT: yes, it was.

Sam, Greg, Elon, Reid Hoffman, Jessica Livingston, Peter Thiel, Amazon Web Services (AWS), Infosys, and YC Research are donating to support OpenAI. In total, these funders have committed $1 billion, although we expect to only spend a tiny fraction of this in the next few years.

https://openai.com/blog/introducing-openai/

Comment by ESRogs on 2020 AI Alignment Literature Review and Charity Comparison · 2020-12-22T17:41:43.378Z · LW · GW

the only entities that are listed as contributing money or loans are Sam Altman, Y Combinator Research, and OpenAI LP

Possible that he funded OpenAI LP? Or was that only created later, and funded by Microsoft and other non-founding investors?

Comment by ESRogs on Extrapolating GPT-N performance · 2020-12-20T23:19:10.403Z · LW · GW

Ah, gotcha. Thanks!

Comment by ESRogs on Homogeneity vs. heterogeneity in AI takeoff scenarios · 2020-12-20T19:14:02.326Z · LW · GW

Key point for those who don't click through (that I didn't realize at first) -- both types turned out to work and were in fact used. The gun-type "Little Boy" was dropped on Hiroshima, and the implosion-type "Fat Man" was dropped on Nagasaki.

Comment by ESRogs on Homogeneity vs. heterogeneity in AI takeoff scenarios · 2020-12-20T18:59:40.700Z · LW · GW

For those organizations that do choose to compete... I think it is highly likely that they will attempt to build competing systems in basically the exact same way as the first organization did

...

It's unlikely for there to exist both aligned and misaligned AI systems at the same time

If the first group sunk some cost into aligning their system, but that wasn't integral to its everyday task performance, wouldn't a second competing group be somewhat likely to skimp on the alignment part?

It seems like this calls into the question the claim that we wouldn't get a mix of aligned and misaligned systems.

Do you expect it to be difficult to disentangle the alignment from the training, such that the path of least resistance for the second group will necessarily include doing a similar amount of alignment?

Comment by ESRogs on Extrapolating GPT-N performance · 2020-12-20T17:57:44.930Z · LW · GW

Thanks. Still not sure I understand though:

> It's just a round-about way of saying that the upper end of s-curves (on a linear-log scale) eventually look roughly like power laws (on a linear-linear scale).

Doesn't the upper end of an s-curve plateau to an asymptote (on any scale), which a power law does not (on any scale)?

Comment by ESRogs on Extrapolating GPT-N performance · 2020-12-20T07:47:55.131Z · LW · GW

Note that, if the network converges towards the irreducible error like a negative exponential (on a plot with reducible error on the y-axis), it would be a straight line on a plot with the logarithm of the reducible error on the y-axis.

Was a little confused by this note. This does not apply to any of the graphs in the post, right? (Since you plot the straight reducible error on the y-axis, and not its logarithm, as I understand.)