sample v. population confidence intervals - gov1000-list

Jason Lakin

24 Sep 24 Sep

10:58 p.m.

Mike, et al: My intepretation of the events in question is that you always should use the n-1 because you never know the actual population mean. However, on the homework, we were told (at least tonight we were told- it doesn't say this on the homework) to assume that we did know the population mean, and were working backwards. In that case, you can use the population sd. The difference seems to have to do with the presumption about what the population mean is. if you assume that your estimate is right (a false assumption, but the one we were supposed to use in the homework), then you can use n. otherwise, its n-1. However, i would note that i have never heard of anyone actually doing this, probably because the difference between n and n-1 is so small as to be irrelevant in general. So most people just round the n-1 to n, and forget about it. This is what i have learned in the past... best jason ----- Original Message ----- From: "Michael Richard Kellermann" <kellerm(a)fas.harvard.edu> To: <gov1000-list(a)fas.harvard.edu> Sent: Wednesday, September 24, 2003 10:39 PM Subject: [gov1000-list] sample v. population confidence intervals

...

Hi all - I have a question about the discussion we had tonight regarding the confidence interval versus the sample confidence interval. I understand that when you have to estimate both the mean and the standard deviation to calculate the confidence interval, you have to use n-1 rather than n. The formula that we are using to calculate the confidence interval only requires us to estimate one parameter, p. Using this method for calculating confidence intervals, when would we need to use n-1? Cheers, Mike _______________________________________________ gov1000-list mailing list gov1000-list(a)fas.harvard.edu http://www.fas.harvard.edu/mailman/listinfo/gov1000-list

Reply

Jacob M. Kline

2:15 p.m.

You really don't need to use the n-1, not the least reason for this is that the sample is quite large are the difference is neglible. Let this go for now, because we have not discussed it in the lecture and because there are many more important things to study in detail. On Thu, 25 Sep 2003, Michael Richard Kellermann wrote:

...

Hi - I buy that argument for most of the things for which we would want to estimate the mean and standard deviation of the sample in order to construct a confidence interval. Say we were using a thermometer score for feelings about the president (leaving aside the problems with such a measure). In that case there are many different sets of n responses that would yield the same estimate for the mean. To calculate the confidence interval we would also have to estimate how tightly the particular sample that we drew clustered around the mean. Since we would be estimating two separate parameters, we burn an additional degree of freedom. In this case, however, since the responses are approve/disapprove (success/failure), there is only one possible set of n responses for any given estimate of the proportion of approvals. It seems to me that the estimated sample variance follows directly from the estimated mean and does not have to be estimated separately. I still don't see why we would need to ever use n-1 for this particular type of question. Cheers, Mike On Wed, 24 Sep 2003, Jason Lakin wrote:

Mike, et al: My intepretation of the events in question is that you always should use the n-1 because you never know the actual population mean. However, on the homework, we were told (at least tonight we were told- it doesn't say this on the homework) to assume that we did know the population mean, and were working backwards. In that case, you can use the population sd. The difference seems to have to do with the presumption about what the population mean is. if you assume that your estimate is right (a false assumption, but the one we were supposed to use in the homework), then you can use n. otherwise, its n-1. However, i would note that i have never heard of anyone actually doing this, probably because the difference between n and n-1 is so small as to be irrelevant in general. So most people just round the n-1 to n, and forget about it. This is what i have learned in the past... best jason

_______________________________________________ gov1000-list mailing list gov1000-list(a)fas.harvard.edu http://www.fas.harvard.edu/mailman/listinfo/gov1000-list

Reply

Zoe Turner VanderWolk

7:55 a.m.

Hi Mike, I think you use n when you're estimating a population parameter, and you use n-1 when you're estimating a sample parameter. So if we're trying to get a confidence interval for the *sample* mean, we use sqrt(p(1-p)/n, but when we're talking about the mean of the entire population that the sample was drawn from, you use n-1 instead. It just depends on whether the thing in the middle of your CI is a statistic or a population parameter (and they usually tell you in the question which one they want a CI for!) Cheers, Zoe On Wed, 24 Sep 2003, Michael Richard Kellermann wrote:

...

Hi all - I have a question about the discussion we had tonight regarding the confidence interval versus the sample confidence interval. I understand that when you have to estimate both the mean and the standard deviation to calculate the confidence interval, you have to use n-1 rather than n. The formula that we are using to calculate the confidence interval only requires us to estimate one parameter, p. Using this method for calculating confidence intervals, when would we need to use n-1? Cheers, Mike _______________________________________________ gov1000-list mailing list gov1000-list(a)fas.harvard.edu http://www.fas.harvard.edu/mailman/listinfo/gov1000-list

Reply

Zoe Turner VanderWolk

11:38 a.m.

New subject: samgov1000-list@fas.harvard.edu <gov1000-list@fas.harvard.edu>ple v. population confidence intervals

Yes, you're right...this is what happens when you reply to emails at 7.30am :-) Zoe On Thu, 25 Sep 2003, Jason Lakin wrote:

...

greetings. i think you have the first sentence right, but the equations backwards in the rest of the paragraph... jason ----- Original Message ----- From: "Zoe Turner VanderWolk" <vanderw(a)fas.harvard.edu> To: "Michael Richard Kellermann" <kellerm(a)fas.harvard.edu> Cc: <gov1000-list(a)fas.harvard.edu> Sent: Thursday, September 25, 2003 7:55 AM Subject: Re: [gov1000-list] sample v. population confidence intervals

Hi Mike, I think you use n when you're estimating a population parameter, and you use n-1 when you're estimating a sample parameter. So if we're trying to get a confidence interval for the *sample* mean, we use sqrt(p(1-p)/n, but when we're talking about the mean of the entire population that the sample was drawn from, you use n-1 instead. It just depends on whether the thing in the middle of your CI is a statistic or a population parameter (and they usually tell you in the question which one they want a CI for!) Cheers, Zoe On Wed, 24 Sep 2003, Michael Richard Kellermann wrote: > > Hi all - > > I have a question about the discussion we had tonight regarding the > confidence interval versus the sample confidence interval. I understand > that when you have to estimate both the mean and the standard deviation

to

calculate the confidence interval, you have to use n-1 rather than n. The formula that we are using to calculate the confidence interval only requires us to estimate one parameter, p. Using this method for calculating confidence intervals, when would we need to use n-1? Cheers, Mike _______________________________________________ gov1000-list mailing list gov1000-list(a)fas.harvard.edu http://www.fas.harvard.edu/mailman/listinfo/gov1000-list

_______________________________________________ gov1000-list mailing list gov1000-list(a)fas.harvard.edu http://www.fas.harvard.edu/mailman/listinfo/gov1000-list

Reply