LessWrong analytics (February 2009 to January 2017)
post by riceissa · 2017-04-16T22:45:35.807Z · LW · GW · Legacy · 8 commentsContents
Table of contents Introduction Pageviews and sessions Top posts Source code Further reading Acknowledgments None 8 comments
Table of contents
Introduction
In January 2017, Vipul Naik obtained Google Analytics daily sessions and pageviews data for LessWrong from Kaj Sotala. Vipul asked me to write a short post giving an overview of the data, so here it is.
This post covers just the basics. Vipul and I are eager to hear thoughts on what sort of deeper analysis people are interested in; we may incorporate these ideas in future posts.
Pageviews and sessions
The data for both sessions and pageviews span from February 26, 2009 to January 3, 2017. LessWrong seems to have launched in February 2009, so this is close to the full duration for which LessWrong has existed.
Pageviews plot:
Total pageviews recorded by Google Analytics for this period is 52.2 million.
Sessions plot:
Total sessions recorded by Google Analytics for this period is 19.7 million.
Both plots end with an upward swing, coinciding with the effort to revive LessWrong that began in late November 2016. However, as of early January 2017 (the latest period for which we have data) the scale of any recent increase in LessWrong usage is small in the context of the general decline starting in early 2012.
Top posts
The top 20 posts of all time (by total pageviews), with pageviews and unique pageviews rounded to the nearest thousand, are as follows:
Title | Pageviews (thousands) | Unique Pageviews (thousands) |
---|---|---|
Don’t Get Offended | 681 | 128 |
How to Be Happy | 551 | 482 |
How to Beat Procrastination | 378 | 342 |
The Best Textbooks on Every Subject | 266 | 233 |
Do you have High-Functioning Asperger’s Syndrome? | 188 | 168 |
Superhero Bias | 169 | 154 |
The Quantum Physics Sequence | 157 | 130 |
Bayesian Judo | 140 | 126 |
An Alien God | 125 | 113 |
An Intuitive Explanation of Quantum Mechanics | 123 | 106 |
Three Worlds Collide (0/8) | 121 | 93 |
Bayes’ Theorem Illustrated (My Way) | 121 | 112 |
9/26 is Petrov Day | 121 | 115 |
The Baby-Eating Aliens (1/8) | 109 | 98 |
The noncentral fallacy - the worst argument in the world? | 107 | 99 |
Advanced Placement exam cutoffs and superficial knowledge over deep knowledge | 107 | 94 |
Guessing the Teacher’s Password | 102 | 96 |
The Fun Theory Sequence | 102 | 90 |
Optimal Employment | 102 | 97 |
Ugh fields | 95 | 86 |
Note that Google Analytics reports are subject to sampling when the number of sessions is large (as it is here) so the input numbers are not exact. More details can be found in a post at LunaMetrics. This doesn’t affect the estimates for the top posts, but those wishing to work with the exported data should be aware of this.
Each post on LessWrong can have numerous URLs. In the case of posts that were renamed, a significant number of pageviews could be recorded at both the old and new URL. To take an example, the following URLs all point to lukeprog’s post “How to Be Happy”:
- http://lesswrong.com/lw/4su/the_science_of_happiness/
- http://lesswrong.com/lw/4su/how_to_be_happy/
- http://lesswrong.com/lw/4su
- http://lesswrong.com/lw/4su/foo
- http://lesswrong.com/r/lesswrong/lw/4su/how_to_be_happy/
- http://lesswrong.com/r/lesswrong/lw/4su/foo/
- http://lesswrong.com/r/discussion/lw/4su/foo/
All that matters for identifying this particular post is that we have the substring “/lw/4su” in the URL. In the above table, I have grouped the URLs by this identifying substring and summed to get the pageview counts.
In addition, each post has two “canonical” URLs that can be obtained by clicking on the post titles: one that begins with either “/r/lesswrong/lw” or “/r/discussion/lw” and one that begins with just “/lw”. I have used the latter in linking to the posts from my table.
Source code
The data, source code used to generate the plots, as well as the Markdown source of this post are available in a GitHub Gist.
Clone the Git repository with:
git clone https://gist.github.com/cbdd400180417c689b2befbfbe2158fc.git
Further reading
- Alexa profile for lesswrong.com
- SimilarWeb profile for lesswrong.com
- “Effective Altruism Forum web traffic from Google Analytics”, a post by Vipul
- gwern.net analytics by gwern
Here are a few related PredictionBook predictions:
- Total LessWrong Google Analytics Sessions for 2017 will be higher than for 2016
- Total LessWrong Google Analytics Pageviews for 2017 will be higher than for 2016
- A LessWrong post published during 2017 will become a top-10 post by pageviews on LW by 2018-01-01
Acknowledgments
Thanks to Kaj for providing the data used in this post. Thanks to Vipul for asking around for the data, for the idea of this post, and for sponsoring my work on this post.
8 comments
Comments sorted by top scores.
comment by Kaj_Sotala · 2017-04-21T15:09:51.740Z · LW(p) · GW(p)
Huh, some of the top articles are totally not what I'd have expected. "Don't Get Offended" is non-promoted and currently only has an upvote total of 32. "Advanced Placement exam cutoffs and superficial knowledge over deep knowledge" is also not promoted and has an upvote total of 4.
Would be interesting for someone to run an analysis to see how closely upvotes and page views correlate. Apparently not as much as I'd have guessed.
Replies from: ChristianKl↑ comment by ChristianKl · 2017-04-21T15:38:45.835Z · LW(p) · GW(p)
"Don't Get Offended" seems to rank highly on Google for the term "Don't Get Offended" which has search volume.
comment by scarcegreengrass · 2017-04-20T21:33:29.043Z · LW(p) · GW(p)
I notice a contradiction that i don't yet understand. This post and the wiki page (https://wiki.lesswrong.com/wiki/History_of_Less_Wrong) say that LessWrong started in 2009. However, there are comments here with earlier timestamps (arbitrary example: http://lesswrong.com/lw/qd/science_isnt_strict_enough/k2t). I was under the impression lesswrong.com was an active community at least since 2007. Is the wiki's "2009" a typo?
Also, i am updating my PoV on recent LW history based on the analytics charts. I take it that pageviews have not yet dropped below 2010 levels, even if commenting rates did?
Replies from: entirelyuseless↑ comment by entirelyuseless · 2017-04-21T00:52:02.707Z · LW(p) · GW(p)
Comments and posts were ported over from Overcoming Bias and so they preceded the Less Wrong website.
Replies from: scarcegreengrass↑ comment by scarcegreengrass · 2017-04-23T03:23:04.945Z · LW(p) · GW(p)
Ah, the comments too! Okay, now I understand.
comment by riceissa · 2018-11-24T07:43:02.644Z · LW(p) · GW(p)
There are some more data (post count, comment count, vote count, etc., but not pageviews) at "History of LessWrong: Some Data Graphics" [LW · GW].