[prize] Spaced Repetition literature review

post by jsalvatier · 2011-06-07T03:28:55.842Z · LW · GW · Legacy · 35 comments

Contents

35 comments

EDIT: I am canceling this contest because I feel that the structure of the incentives were poorly thought out (see gwern's comments). I will be posting a new better structured contest in the near future (now posted). If you feel this is unfair or otherwise feel slighted, please contact me at my username at gmail.com.

 

I'm interested in making projects happen on Less Wrong. In order to find out what works and to inspire others to try things too, I'm sponsoring the following small project:

Spaced Repetition is often mentioned on on Less Wrong as a technique for remembering things. I've started using Anki and it certainly seems to be useful. However, I haven't seen a good summary of evidence on Spaced Repetition. 

I hereby offer a prize to the first person to submit a good summary of the evidence on Spaced Repetition to the main page. The winner will get the prize, currently: $265 + 40 to charity (see comments)

The summary should address at least the following questions:

  • What spacing is best?
  • How much does spaced repetition actually help memory?
  • Does spaced repetition have hidden benefits or costs?
  • Does the effectiveness vary across domains? How much?
  • Is there research on the kinds of questions that work best?
  • What questions do researchers think are most important?
  • Is there any interesting ongoing research? If so, what is it on?
  • What, if any, questions do researchers think it is important to answer? Are there other unanswered questions that would jump out at a smart person?
  • What does spaced repetition not do that people might expect it to?
The post should summarize the state of current evidence and provide citations to back up the claims in the article.
If you think you would benefit from the result of this project, please add to the prize! You can contribute to the prize on the ChipIn page.
Whether the summary is 'good' will be judged by me. If there is a serious dispute, I'll agree to dispute resolution by any uninvolved party with more than 5k karma. 
If you have suggestions, questions or comments, please leave them in the comments. 
If you would like to work on this project, please say so in the comments below. Collaboration is encouraged. 
This project is tagged with the 'project' tag and listed on the  Projects wiki  page.

35 comments

Comments sorted by top scores.

comment by Dr_Manhattan · 2011-06-07T12:29:03.235Z · LW(p) · GW(p)

summary of the evidence on Spaced Repetition

Would probably help to quantify that somewhat - as in "evidence that spaced repetition does X" (X being what you want to get from it).

With this framework evidence that SRS does not do X should also get the prize.

Putting it as "claims are made that SRS does X, here is the pro/con evidence" would also be a good format.

As I also consider this topic as very valuable I'll follow Joshua's suggestion and offer $20 to SIAI for this post.

comment by gwern · 2011-06-07T14:25:21.259Z · LW(p) · GW(p)

I'd like to work on this (be paid to substantially improve my http://www.gwern.net/Mnemosyne ? Where's the downside?), but you ask hard questions which would require a lot of research to be done to my satisfaction and during which I could be scooped. I'm not sure a race is the best way to run this.

Replies from: Duke, jsalvatier
comment by Duke · 2011-06-07T15:18:01.509Z · LW(p) · GW(p)

Why do you care if it is done to your satisfaction when the prize is awarded based on other's satisfaction with it?

This is an expected value problem. Decide how much a unit of your time is worth, how much time you are willing to invest and then (the hard part) estimate your likelihood for success.

So, if you value your time at $10/hr, are willing to invest 10 hours and estimate you will win the $155 X% of the time then we get this equation:

P(winning)$amount won - P(losing)$amount not won = initial investment

x$155 - (1-x)$0 = $10/hr * 10 hours

solve for x

x = 64.5%, in other words, to "break even" you need to be sure that if you invest 10 hours of your time in this project that you will win it at least 64.5% of the time.

A race may not be the best way to run this for you, since I suspect that you value your time a high rate relative to the potential payoff. But someone who values their time less (or is more productive than you per unit time) may think a race is a wonderful idea.

Replies from: gwern
comment by gwern · 2011-06-07T16:19:00.582Z · LW(p) · GW(p)

Why do you care if it is done to your satisfaction when the prize is awarded based on other's satisfaction with it?

Why do I care... what I care? Seriously?

But I do agree with your expected utility calculations. The problem is, I can't motivate myself to write under conditions of uncertainty like that. I understand that shouldn't matter, especially since I consider such a project worthwhile even in the absence of $155, but it does.

Replies from: Duke
comment by Duke · 2011-06-07T16:40:57.003Z · LW(p) · GW(p)

Getting it done to other's satisfaction and getting it done to your own are not mutually exclusive. You can work quickly to win the prize and then go back and expand.

Replies from: gwern
comment by gwern · 2011-06-07T19:20:30.465Z · LW(p) · GW(p)

You can work quickly to win the prize and then go back and expand.

No, I can't. That's my problem, as I've been explaining in my comments.

Replies from: jsalvatier
comment by jsalvatier · 2011-06-07T21:59:02.666Z · LW(p) · GW(p)

Perhaps you two should collaborate and split the prize in some way.

Replies from: gwern
comment by gwern · 2011-06-08T00:57:16.875Z · LW(p) · GW(p)

I think it's a bit late for that. But I've PMed Duke.

comment by jsalvatier · 2011-06-07T14:59:08.907Z · LW(p) · GW(p)

Could you specify the drawbacks you see to a race? Does something else seem like it would work better? Does a dated prize (where the best entry is selected on a specific date) or a work contract seem like it would work significantly better? I'm Just Trying Things right now, and I'm not especially attached to this format.

Which are the especially hard questions? What is the primary source of difficulty?

Replies from: steven0461, gwern
comment by steven0461 · 2011-06-07T19:29:33.390Z · LW(p) · GW(p)

Could you specify the drawbacks you see to a race?

Duplicated effort, some of which is presumably invisible.

comment by gwern · 2011-06-07T16:31:22.852Z · LW(p) · GW(p)

From what I remember of the prize literature, prizes work best for well-delineated, objectively judgeable accomplishments. Like putting a rocket into orbit - you can plan out what is necessary, and it's clear when you succeeded. A literature review isn't really either. So I'm not sure a prize in general is succeeded for the task.


My personal problems are that:

  1. I can't bring myself to start a new project like intensively researching spaced repetition to establish exactly how good the evidence is, how effective it is, and what domains it does and does not work in, without having a good idea of what I might receive in turn. Uncertainty is paralyzing for me.
  2. This is especially true because your prize is large - I felt guilty taking a bitcoin from Kiba for Bitcoin is Worse is Better because I knew it was incomplete in several respects, although it's one of my best essays and gave him a lot of traffic; and a bitcoin is only worth $20, so how would I feel delivering something bad for $155? I'd feel really terrible!

The difficult questions are:

  • How much does spaced repetition actually help memory?

    Quantifying the advantage in any more than an ad hoc way like http://www.gwern.net/Mnemosyne#fn3 would be difficult since it requires delving into the specifics of the data, and forgetting curves are non-linear so it's not clear what is even meant by 'how much' - memories per review at 1 year? At 10 years? At 50 years? Integrated over an actuarially projected lifespan?

  • Does spaced repetition have hidden benefits or costs?

    There's probably no direct research on this, so now one has to delve into related research fields and look for very weak evidence or correlations, like maybe high memory correlated with longevity in old people.

  • Is there any interesting ongoing research? If so, what is it on?

    New research tends to be hard to get (= wasted time going around paywalls) and requires following a lot of citations in bibliographies and checking out blogs and emails to really get a sense of where a field is now.

The best way is probably to informally or formally contract with someone like lukeprog (who does this sort of literature review routinely) to conduct the review.

Replies from: jsalvatier, jsalvatier
comment by jsalvatier · 2011-06-07T17:23:51.214Z · LW(p) · GW(p)

Great comment. Perhaps I should be reading the prize literature! I'll think about how to structure this better.

On your personal problems: consider finding someone who has more time and acting as a consultant for them. For example, Duke, who comments elsewhere in this thread.

Just to make sure I'm clear, I'm not trying to gain a detailed or very specific understanding of the literature on SR, but to figure out how confident I should be in recommending SR to others and to myself and what I should recommend others and I do and what I should expect it to help and how much.

I actually hadn't read your page before, so thanks. I actually think your page is a fairly good start.

comment by jsalvatier · 2011-06-07T18:25:07.442Z · LW(p) · GW(p)

Do you have a good starting point for reading on the prize literature?

Replies from: gwern
comment by gwern · 2011-06-07T19:19:25.664Z · LW(p) · GW(p)

Hm, not really. What I would do is google Robin Hanson, Anders Broberg, and LW for mentions of 'prize' and use them as entrance into the academic literature, and spider authors & citations & bibliographies from there.

Replies from: jsalvatier
comment by jsalvatier · 2011-06-07T19:51:07.146Z · LW(p) · GW(p)

ok, thanks.

Replies from: gwern
comment by gwern · 2011-06-18T21:04:28.862Z · LW(p) · GW(p)

It's not a literature review, but http://33bits.org/2011/06/06/the-surprising-effectiveness-of-prizes-as-catalysts-of-innovation/ does link to a McKinsey report on prizes, which ought to be good reading if their reputation can be relied upon.

Replies from: randomwalker
comment by randomwalker · 2011-06-18T22:35:06.007Z · LW(p) · GW(p)

It's funny you linked to my post on prizes, which brought me here, because I'm actually very interested in spaced repetition. In fact just a couple of weeks ago I mentioned it's potentially worth millions of dollars to me: http://arvindn.livejournal.com/132233.html

You guys need to get more rational about the value and importance of what you're doing in this community. You're arguing over trivial amounts of money here. I'd say that any post that's good enough to get voted up to the main page is worth tens of thousands of dollars to mankind. Easily.

I'm putting my money where my mouth is chipping in $100 for the new prize. [Edit. Hmm, I got a paypal receipt but the total on chipin hasn't changed. Maybe just need to wait for a bit.]

Finally, I'm only a sporadic reader, but allow me to express my deep gratitude and admiration for the community.

Replies from: gwern, Bongo
comment by gwern · 2011-06-18T23:11:03.778Z · LW(p) · GW(p)

! You're Arvind? I had no idea. Gosh, the past few days have been interesting for me - first Piotr Wozniak tells me he's incrementally reading 8 articles mentioning or by me, then Danny Hillis replies to a comment of mine over at the Technium, and now here's Arvind Narayanan.

I'd say that any post that's good enough to get voted up to the main page is worth tens of thousands of dollars to mankind. Easily.

On one hand, I agree - if a main page post improves one high-value person's time, that alone is probably a few tens of thousands' worth. The other part of me is cynically remarking that Wozniak and psychologists in general have been advocating spaced repetition for the last >2 decades / centuries (respectively), the research is unassailable, and yet... So what's the marginal return on another literature survey?

A few comments:

  • You mention creatine on your blog. Creatine only works in specific demographics; I don't know whether they include you.
  • You mention dual n-back; the results seem to keep getting weaker since Jaeggi 2008, and Jaeggi 2011 seems extremely weak. I used to be extremely enthusiastic, but as the studies keep coming in... At this point, I'm not sure how much time is worth putting into DNB for just the WM benefits. Probably not much past D5B though.
Replies from: randomwalker
comment by randomwalker · 2011-06-18T23:49:30.217Z · LW(p) · GW(p)

Thanks for the information! That does look pretty damning for dual n-back. As for creatine, I'm not vegetarian but my meat consumption is lower than average, especially when I'm not weight training. At any rate, I stopped taking it after a couple of months because I decided the unknown long-term health risk from increased renal stress wasn't worth it.

comment by Bongo · 2011-06-24T15:27:46.532Z · LW(p) · GW(p)

voted up to the main page

Note that posts aren't voted up to anywhere; they're promoted by editors.

Replies from: randomwalker
comment by randomwalker · 2011-06-26T19:44:46.010Z · LW(p) · GW(p)

Ah, I see, thanks.

comment by lukeprog · 2011-06-08T07:18:21.810Z · LW(p) · GW(p)

For relevant research, see all the sources I listed here.

comment by JoshuaZ · 2011-06-07T04:23:06.423Z · LW(p) · GW(p)

I'm not quite willing to chipin for the individual in question. But I'm happy to donate $20 to the charity of their choice.

Replies from: jsalvatier
comment by jsalvatier · 2011-06-07T14:55:23.893Z · LW(p) · GW(p)

Can you explain your reasons?

Replies from: JoshuaZ
comment by JoshuaZ · 2011-06-07T15:27:03.579Z · LW(p) · GW(p)

It is a compromise between fuzzies v. utilons because I am unable to purchase them separately along with some other constraints. If the money goes to a charity I get more fuzzies, and given that whoever would do this would likely pick a charity that I consider to be a worthwhile charity (given the general framework of LW) there's a high chance that it will produce heavy utilons. I also don't have that much money so I can convince myself to part with more if I know that it is going to a charity rather than going to someone's personal use.

Replies from: jsalvatier
comment by jsalvatier · 2011-06-07T16:10:38.665Z · LW(p) · GW(p)

Thanks :)

comment by Nick_Roy · 2011-06-07T20:40:52.760Z · LW(p) · GW(p)

I chipped in $10 for the winner of the prize.

My sincere thanks to anyone who participates!

comment by Duke · 2011-06-07T12:36:39.284Z · LW(p) · GW(p)

I am an Anki user and I am interested in working on this project.

comment by Duke · 2011-06-07T18:05:56.023Z · LW(p) · GW(p)

I am working on this vigorously and expect to produce a submitted product soon. Proceed at your own risk if you are considering competing for this prize.

Replies from: jsalvatier
comment by jsalvatier · 2011-06-07T22:35:34.661Z · LW(p) · GW(p)

Can you describe what does and does not increase your motivation? Does more money increase/decrease? The race aspect? The competitive aspect?

comment by nazgulnarsil · 2011-06-07T05:46:24.513Z · LW(p) · GW(p)

the answers to your first few questions seem to unfortunately be "it depends". the point of SRS is that you review when you are on the cusp of forgetting a piece of knowledge, which varies drastically depending on individual, conditions, and topic.

comment by Duke · 2011-06-11T21:04:17.837Z · LW(p) · GW(p)

Cross-commented from submission thread:

I think it is unfair of you to post a public critique of my submission since this is a contest. I have effectively been penalized for being first. Every submission that follows will have the benefit of seeing this critique.

I am also concerned that you have decided to change the contest format immediately following my submission. In my estimation, you had either already decided to change the format prior to my submission (clearly a major disadvantage to me), or you decided to change the format based on my submission, which, again, effectively penalizes me for being first.


Additionally, I request arbitration. I think this contest has been mismanaged to my detriment; as such, I am entitled to financial compensation.

Replies from: jsalvatier
comment by jsalvatier · 2011-06-11T21:45:35.486Z · LW(p) · GW(p)

Link

Replies from: Duke
comment by Duke · 2011-06-14T20:30:47.041Z · LW(p) · GW(p)

After some deliberation, I've decided to withdraw this request. I am content with my submission. I am also content not to receive the prize or any portion thereof.