NetNews Usenet Archive 1992 #18

home *** CD-ROM | disk | FTP | other *** search

/ NetNews Usenet Archive 1992 #18 / NN_1992_18.iso / spool / sci / math / stat / 1658 < prev next >

Wrap

Text File | 1992-08-14 | 2.2 KB | 45 lines

Newsgroups: sci.math.stat Path: sparky!uunet!stanford.edu!leland.Stanford.EDU!dhinds From: dhinds@leland.Stanford.EDU (David Hinds) Subject: Re: Fwd: Standard Deviation. Message-ID: <1992Aug15.000049.29790@leland.Stanford.EDU> Sender: news@leland.Stanford.EDU (Mr News) Organization: DSG, Stanford University, CA 94305, USA References: <1992Aug14.172833.11844@cbfsb.cb.att.com> <seX2yRq00Uh785H2EB@andre <1992Aug14.231916.23479@magnus.acs.ohio-state.edu> Date: Sat, 15 Aug 92 00:00:49 GMT Lines: 33 In article <1992Aug14.231916.23479@magnus.acs.ohio-state.edu> regeorge@magnus.acs.ohio-state.edu (Robert E George) writes: > >More intuitively, if we take a very sample, we are less likely to get >extreme values and so our notion of what the population variance is (note >that I am not proposing some particular estimator) will be unrealistic. >For instance, I give an exam to two students. Their scores are 67 and >71. I think, "Gee, there's not a lot of variability in these scores." >But then 8 more students take the exam: > 60 78 100 38 50 88 99 39 > >and it now is clear there *is* more variability in these scores. > >But let me reiterate that T will *always* have a negative bias for the >population variance whatever the sample size is This is wrong. If this set of 8 values is the "population", sure, for your particular sample, the predicted variance happens to be less than the population variance. However, the mean variance of all samples from this population will equal the population variance (i.e., the sample variance is an unbiased estimator). Note that most samples of size 2 from this population will have much larger variances than the one you picked, and many will have variances larger than the parent population (i.e., you are just as likely to get [100 38] as [67 71]). The reason the N-weighted variance is biased, is because it measures variability around the sample mean rather than the population mean, and this estimate absorbs some of the variability in the sample. The N-weighted variance of the sample around the population mean, and the N-1 weighted variance of the sample around the sample mean, are both unbiased estimates of the variance in the population. - David Hinds dhinds@allegro.stanford.edu