Covariance in your sample vs covariance in the general population
post by RomeoStevens · 2012-05-16T00:17:34.009Z · LW · GW · Legacy · 3 commentsContents
3 comments
A popular-media take on a subtle problem in sampling. I found the graph quite illustrative.
3 comments
Comments sorted by top scores.
comment by othercriteria · 2012-05-16T02:16:32.593Z · LW(p) · GW(p)
Sampling effects like this can be really pernicious for network data (and I imagine similarly for other dependent data). It can be difficult to tell if a network is scale-free from observing a subnetwork [1] or impossible to learn an ERGM (basically, a maximum entropy distribution with graph properties as its statistics) from a subnetwork [2].
[1] M. P. H. Stumpf, C. Wiuf, and R. M. May, “Subnets of scale-free networks are not scale-free: sampling properties of networks,” Proceedings of the National Academy of Sciences of the United States of America, vol. 102, no. 12, p. 4221, 2005.
[2] C. Shalizi, “Consistency under Sampling of Exponential Random Graph Models,” arXiv.org. 2011.
comment by jsalvatier · 2012-05-16T02:39:30.413Z · LW(p) · GW(p)
That was quite good.