Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Counterintuitively, if the sample is truly randomly distributed, you gain very little additional information as you go beyond 300 samples. This is why every political poll has an error margin of + or - 3%.


Right, but that doesn't mean that 300 (or 3000) samples total is enough. You can't make the detailed map about burning the national flag with 3000 samples. More data is helpful until you have 300 samples per pixel.


The real problem is most samples are not random. So, you are bound by the bias of your methods and you can't really get all that accurate. In theory when you double your sample size you do reduce your margin of error by a reasonable degree, but reality does not mesh until you start taking a large percentage of the population.

Think of it like a coin, that has a 1% bias you want the percentage to some accuracy (say 4 digits) how many flips do you need?. Now what if the problem is not the coin but the person doing the flipping. At some point more testers help more than more flips.


Glad someone pointed this out.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: