Posts

Comments

Comment by Icehawk78 on Replicating the replication crisis with GPT-3? · 2020-07-23T00:15:22.086Z · LW · GW

The main thing I've noticed is that most of the posts that are talking about its capabilities (or even what theoretical future entities might be capable of, based on a biased assumption of this version's capabilities) is that people are trying to figure out how to get it to succeed, rather than trying to get it to fail in interesting and informative ways.

For example, one of the evaluations I've seen was having it do multi-digit addition, and discussing various tricks to improve its success rate, going off the assumption that if it can fairly regularly do 1-3 digit addition, that's evidence of it learning arithmetic. One null hypothesis against this would be "in its 350-700GB model, it has stored lookup tables for 1-3 digit addition, which it will semi-frequently end up engaging.

The evaluation against a lookup table was to compare its success rate at 5+ digit numbers, and show that storing a lookup table for those numbers would be an increasingly large portion of its model, and then suggests that this implies it must be capable, sometimes, of doing math (and thus the real trick is in convincing it to actually do that). However, this ignores significantly more probable outcomes, and also doesn't look terribly closely at what the incorrect outputs are for large-digit addition, to try and evaluate *what* exactly it is the model did wrong (because the outputs obviously aren't random).

I've also seen very little by the way of discussing what the architectural limitations of its capabilities are, despite them being publicly known; for example, any problem requiring deep symbolic recursion is almost certainly not possible simply due to the infrastructure of the model - it's doing a concrete number of matrix multiplications, and can't, as the result of any of those, step backwards through the transformer and reapply a particular set of steps again. On the plus side, this also means you can't get it stuck in an infinite loop before receiving the output.

Comment by Icehawk78 on Fresh Bread · 2020-07-21T23:27:48.923Z · LW · GW

One recommendation I would add to your recipe - the first batch of steps for the sponge says "start with the water" and then uses a numbered list. It would likely be more readable and easier to follow if you added a new first step of "put the water in the bowl". It took me a few passes where I was trying to figure out when you added the water before I realized I missed it outside of the steps.

Comment by Icehawk78 on Why I haven't signed up for cryonics · 2014-01-16T14:56:02.889Z · LW · GW

I can't say on behalf of advancedatheist, but others who I've heard make similar statements generally seem to base them on a manner of factor analysis; namely, assuming that you're evaluating a statement by a self-proclaimed transhumanist predicting the future development of some technology that currently does not exist, the factor which best predicts what date that technology will be predicted as is the current age of the predictor.

As I've not read much transhumanist writing, I have no real way to evaluate whether this is an accurate analysis, or simply cherry picking examples of the most egregious/popularly published examples (I frequently see Kurzweil and... mostly just Kurzeil, really, popping up when I've heard this argument before).

[As an aside, I just now, after finishing this comment, made the connection that you're the author that he cited as the example, rather than just a random commenter, so I'd assume you're much more familiar with the topic at hand than me.]

Comment by Icehawk78 on Why I haven't signed up for cryonics · 2014-01-14T18:31:48.777Z · LW · GW

Presumably, the implication is that these predictions are not based on facts, but had their bottom line written first, and then everything else added later.

[I make no endorsement in support or rejection of this being a valid conclusion, having given it very little personal thought, but this being the issue that advancedatheist was implying seems fairly obvious to me.]

Comment by Icehawk78 on Prisoner's Dilemma (with visible source code) Tournament · 2013-06-06T14:47:06.377Z · LW · GW

Except that over some threshold, any Anti-Absolutism bots (which have some way of "escaping" while still containing the same first 117 characters, like having a C preprocessor directive that redefines TRUE to equal FALSE) would necessarily be superior.

Comment by Icehawk78 on LW Women- Minimizing the Inferential Distance · 2012-11-27T13:54:42.160Z · LW · GW

Personally, I (and I assume many others) would have a drastically different response than any of these four.

Parent: You need to [cook/clean, job/dress well], or what person would want to marry you? Child: Why should I learn these skills for the benefit of someone else, rather than for myself?

Regardless of the interest or not in marriage, these are skills/actions that are useful for anyone, marriage-oriented or not, to have, simply to live as a socially well-rounded adult. (Obviously, alternate options are available, such as getting such a well-paying job that you can pay for a maid/chef, or some alternate situation in which "getting a good job" is unnecessary to your well-being, as well.)

Comment by Icehawk78 on Kurzweil's predictions: good accuracy, poor self-calibration · 2012-07-12T13:55:59.584Z · LW · GW

These don't use any form of natural language recognition - they work by having very rigidly defined responses that they can interpret (ie "say 'one' for hard to recognize or easily obfuscated department").

Comment by Icehawk78 on Kurzweil's predictions: good accuracy, poor self-calibration · 2012-07-11T14:06:59.195Z · LW · GW

It seems strange to call Siri ubiquitous when smartphone penetration among teenagers is less than 50%.

It also seems strange to call Siri ubiquitous when, on top of that, iOS only has (as of March 2012) between 30-45% market share (depending on how you measure it), which includes numerous models of iPhone that do not have/support Siri, as well as the numerous people who have access to, but don't primarily use Siri on their iPhones. (In my biased sample of software developer/cubicle dweller coworkers, as well as friends and family, I'd estimate maybe 5-10% of those who I know that have iPhones with Siri actually use Siri on a daily basis.)

Does this mean virtual experience software is more popular than the others, or that it's the most popular type of digital entertainment when you look beyond the others?

By my reading, the statement is saying that music, pictures and movies are more popular than "virtual experience software", and that VES is the next most popular.

Additionally, to respond to Stuart_Armstrong below, without a direct reference, I'd imagine that the Economist simply took into account popularity by sales data, which would ignore things like Pandora/Spotify/YouTube/Reddit usage/browsing that may happen significantly more than paid consumption of music/video (at least for certain segments of society with ubiquitous internet access).

Comment by Icehawk78 on [deleted post] 2012-05-16T15:34:04.615Z

This would be close to ideal, regardless of whether it was the intended meaning or not. (I'd prefer simply removing the "This will be deleted" aspect, unless after calming down he no longer feels apologetic.)

Comment by Icehawk78 on [deleted post] 2012-05-16T15:25:07.516Z

For reference: It is currently 11:20 EST, Wednesday, May 16th. The comment to the main article saying "I've been a jerk, will delete this in 48 hours" was posted at 04:05 EST, Saturday, May 12th, approximately 103 hours from now.

I do not support deleting articles which have been posted and to which you've received a negative response to, but I also wanted to point out that the last statement made is another factual error. My preference, if anything is done to correct this, would be simply to remove the "This will be deleted in 48 hours" and possibly be replaced with an apology, if Aurini feels apologetic, or just leaving the "I've acted like a jerk" on its own.

Comment by Icehawk78 on [Poll] Method of Recruitment · 2012-02-07T04:00:17.746Z · LW · GW

I came via MoR which was posted in an IRC chat by [random internet person] for [random unrelated activity]. I've since gotten at least three others (two females, one male) to read MoR, of whom one female (my SO) has come to an LW meetup but doesn't read LW itself much, and one other may start reading the Sequences in the somewhat near future.

Comment by Icehawk78 on Meetup : Interest in Reason Rally meetup? · 2012-02-05T01:41:29.217Z · LW · GW

Oh, that's right, I forgot that a few people were already going.

Comment by Icehawk78 on Meetup : Interest in Reason Rally meetup? · 2012-02-04T17:03:16.317Z · LW · GW

For reference to other commenters, I think the majority of the Ohio LW meetup group decided against going ourselves, due to a lack of interest in this as a specific event, though you're obviously still free to go.

Comment by Icehawk78 on Help! Name suggestions needed for Rationality-Inst! · 2012-01-29T23:43:57.783Z · LW · GW

I'm not sure that "Bayes" or "Bayesian" has a strong public association with anything unless you're already interested in statistics. I've used it in several discussions and every time had to give a quick explanation of what it meant. (Good practice for honing my explanations and reinforcing the concept in my own brain, as well.)

Comment by Icehawk78 on Meetup : Columbus or Cincinnati Meetup · 2011-12-27T15:53:52.429Z · LW · GW

I mostly read, rather than comment, but I'm also in the greater Cincinnati area.

Comment by Icehawk78 on Welcome to Less Wrong! · 2011-12-19T13:30:52.140Z · LW · GW

I'm curious which modifications EY has proposed (specifically) that you don't want made, unless it's just generically the suggestion that people could be improved in any ways whatsoever and your preference is to not have any modifications made to yourself (in a "be true to yourself" manner, perhaps?) that you didn't "choose".

If you could be convinced that a given change to "who you are" would necessarily be an improvement (by your own standards, not externally imposed standards, since you sound very averse to such restrictions) such as "being able to think faster" or "having taste preferences for foods which are most healthy for you" (to use very primitive off-the-cuff examples), and then given the means to effect these changes on yourself, would you choose to do so, or would you be averse simply on the grounds of "then I wouldn't be 'me' anymore" or something similar?

Comment by Icehawk78 on An akrasia case study · 2011-12-19T13:17:13.587Z · LW · GW

Good to know, thanks.

And as a follow up, in the spirit of "what worked for me may not work for anyone else, but I publish it here in the hope that we can pull some good ideas out of it", as mid-20s adult who has dealt with similar situations with undesirable projects and the like, I would add that it may be helpful to consider talking to a psychiatrist to see if it could be helpful for you again.

Comment by Icehawk78 on An akrasia case study · 2011-12-12T14:01:45.369Z · LW · GW

Were you taking sort of ADD-type of medication (Ritalin/Adderall/Strattera) during this time or during the "lost" three weeks? This sounds very similar to a situation I was in earlier this year, and while medication was helpful for me, I'm curious if this is the sort of issue that can "creep" back in if you're not starting fresh (and thus getting the "full kick" due to tolerance or something similar)?

Comment by Icehawk78 on Against WBE (Whole Brain Emulation) · 2011-11-28T22:36:29.556Z · LW · GW

Ah, the RSS feed did not display that. I too was unsure at first what this was until I read the first paragraph or two.

Comment by Icehawk78 on An Alien God · 2010-12-30T15:43:06.438Z · LW · GW

That's kind of the point of this article. Evolution doesn't "choose" something, it just has changes happen, and if, like a rattle happening to scare off threats or reduce lethal damage, it aids survival, then it increases in the population.