The Central Limit Theorem says that, for large samples (samples of size \(n \ge 30\)), when viewed as a random variable the sample mean \(\overline\) is normally distributed with mean \(\mu_< \overline>=\mu\) and standard deviation \(\sigma_<\overline>=\frac>\). The Empirical Rule says that we must go about two standard deviations from the mean to capture \(95\%\) of the values of \(\overline\) generated by sample after sample. A more precise distance based on the normality of \(\overline\) is \(1.960\) standard deviations, which is \( E=\frac>\).
The key idea in the construction of the \(95\%\) confidence interval is this, as illustrated in Figure \(\PageIndex\), because in sample after sample \(95\%\) of the values of \(\overline\) lie in the interval \([\mu -E,\mu +E]\), if we adjoin to each side of the point estimate \( x-a\) “wing” of length \(E\), \(95\%\) of the intervals formed by the winged dots contain \(\mu\). The \(95\%\) confidence interval is thus \(\bar\pm 1.960\frac>\). For a different level of confidence, say \(90\%\) or \(99\%\), the number \(1.960\) will change, but the idea is the same.
Figure \(\PageIndex\) shows the intervals generated by a computer simulation of drawing \(40\) samples from a normally distributed population and constructing the \(95\%\) confidence interval for each one. We expect that about \( (0.05)(40)=2\) of the intervals so constructed would fail to contain the population mean \(\mu \), and in this simulation two of the intervals, shown in red, do.
It is standard practice to identify the level of confidence in terms of the area \(α\) in the two tails of the distribution of \(\overline\) when the middle part specified by the level of confidence is taken out. This is shown in Figure \(\PageIndex\), drawn for the general situation, and in Figure \(\PageIndex\), drawn for \(95\%\) confidence.
Remember from Section 5.4 that the \(z\)-value that cuts off a right tail of area \(c\) is denoted \(z_c\). Thus the number \(1.960\) in the example is \( z_\), which is \(z_>\) for \(\alpha =1-0.95=0.05\).
For \(95\%\) confidence the area in each tail is \(\alpha /2=0.025\).
The level of confidence can be any number between \(0\) and \(100\%\), but the most common values are probably \(90\%\) \((\alpha =0.10)\), \(95\%\) \((\alpha =0.05)\), and \(99\%\) \((\alpha =0.01)\).
Thus in general for a \(100(1-\alpha )\%\) confidence interval, \(E=z_(\sigma /\sqrt)\), so the formula for the confidence interval is \(\bar\pm z_(\sigma /\sqrt)\). While sometimes the population standard deviation \(\sigma\) is known, typically it is not. If not, for \(n\geq 30\) it is generally safe to approximate \(\sigma\) by the sample standard deviation \(s\).
A sample is considered large when \(n\geq 30\).
As mentioned earlier, the number
is called the margin of error of the estimate.
Find the number \(z_\) needed in construction of a confidence interval:
using the tables in Figure \(\PageIndex\) below.
Use Figure \(\PageIndex\) below to find the number \(z_\) needed in construction of a confidence interval:
Figure \(\PageIndex\) can be used to find \(z_c\) only for those values of \(c\) for which there is a column with the heading \(t_c\) appearing in the table; otherwise we must use Figure \(\PageIndex\) in reverse. But when it can be done it is both faster and more accurate to use the last line of Figure \(\PageIndex\) to find \(z_c\) than it is to do so using Figure \(\PageIndex\) in reverse.
A sample of size \(49\) has sample mean \(35\) and sample standard deviation \(14\). Construct a \(98\%\) confidence interval for the population mean using this information. Interpret its meaning.
For confidence level \(98\%\), \(\alpha =1-0.98=0.02\), so \(z_=z_\). From Figure \(\PageIndex\) we read directly that \(z_=2.326\).Thus
\[\bar\pm z_\frac>=35\pm 2.326\left ( \frac> \right )=35\pm 4.652\approx 35\pm 4.7 \nonumber \]
We are \(98\%\) confident that the population mean \(\mu\) lies in the interval \([30.3,39.7]\), in the sense that in repeated sampling \(98\%\) of all intervals constructed from the sample data in this manner will contain \(\mu\).
A random sample of \(120\) students from a large university yields mean GPA \(2.71\) with sample standard deviation \(0.51\). Construct a \(90\%\) confidence interval for the mean GPA of all students at the university.
For confidence level \(90\%\), \(\alpha =1-0.90=0.10\), so \(z_=z_\). From Figure \(\PageIndex\) we read directly that \(z_=1.645\). Since \(n=120\), \(\bar=2.71\), and \(s=0.51\),
One may be \(90\%\) confident that the true average GPA of all students at the university is contained in the interval \((2.71-0.08,2.71+0.08)=(2.63,2.79)\).
This page titled 7.1: Large Sample Estimation of a Population Mean is shared under a CC BY-NC-SA 3.0 license and was authored, remixed, and/or curated by Anonymous via source content that was edited to the style and standards of the LibreTexts platform.