9.1: Point Estimates

Colorado Online

9.1: Point Estimates

Learning Objectives

Determine point estimates in simple cases, and make the connection between the sampling distribution of a statistic, and its properties as a point estimator.

Point estimation is the form of statistical inference in which, based on the sample data, we estimate the unknown parameter of interest using a single value (hence the name point estimation). As the following two examples illustrate, this form of inference is quite intuitive.

Example

Suppose that we are interested in studying the IQ levels of students at Smart University (SU). In particular (since IQ level is a quantitative variable), we are interested in estimating μ, the mean IQ level of all the students at SU.

A random sample of 100 SU students was chosen, and their (sample) mean IQ level was found to be [latex]\overline{x}=115[/latex]

If we wanted to estimate μ, the population mean IQ level, by a single number based on the sample, it would make intuitive sense to use the corresponding quantity in the sample, the sample mean [latex]\overline{x}=115[/latex]. We say that 115 is the point estimate for μ, and in general, we’ll always use [latex]\overline{x}[/latex] as the point estimator for μ. (Note that when we talk about the specific value (115), we use the term estimate, and when we talk in general about the statistic x , we use the term estimator. The following figure summarizes this example:

A large circle represents the Population of all Students at SU. We are interested in the variable IQ, and the parameter is μ, the population mean IQ level. From this population we create a sample of size n=100, represented by a smaller circle. In this sample, we find that x bar (the point estimator) is 115. Using point estimation we estimate μ using x bar.

Here is another example.

Example

Suppose that we are interested in the opinions of U.S. adults regarding legalizing the use of marijuana. In particular, we are interested in the parameter p, the proportion of U.S. adults who believe marijuana should be legalized.

Suppose a poll of 1,000 U.S. adults finds that 560 of them believe marijuana should be legalized. If we wanted to estimate p, the population proportion, using a single number based on the sample, it would make intuitive sense to use the corresponding quantity in the sample, the sample proportion [latex]\hat{p}=\frac{560}{1000}=.56[/latex]. We say in this case that .56 is the point estimate for p, and in general, we’ll always use [latex]\hat{p}[/latex] as the point estimator for p. (Note, again, that when we talk about the specific value (.56), we use the term estimate, and when we talk in general about the statistic [latex]\hat{p}[/latex], we use the term estimator. Here is a visual summary of this example:

Learn by Doing

A study on exercise habits used a random sample of 2,540 college students (1,220 females and 1,320 males).

The study found the following:

818 of the females in the sample exercise on a regular basis.
924 of the males in the sample exercise on a regular basis.
The average time that the 1742 students who exercise on a regular basis (818 + 924) spend exercising per week is 4.2 hours.

Did I get this?

A psychology researcher was conducting a study about newlywed heterosexual couples during the first two years of their marriage. 513 newlywed couples were randomly chosen for the study. One of the questions that the researcher was interested in was “During a typical week, how many times do you have sex?” The 513 responses had an average of 2.35 and standard deviation of 1.2. Another question that was asked is “During a typical week, how many evenings do you go out?” 171 of the couples answered that they go out more than twice a week.

Comment 1

You may feel that since it is so intuitive, you could have figured out point estimation on your own, even without the benefit of an entire course in statistics. Certainly, our intuition tells us that the best estimator for μ should be [latex]\overline{x}[/latex], and the best estimator for p should be [latex]\hat{p}[/latex].

Probability theory does more than this; it actually gives an explanation (beyond intuition) why [latex]\overline{x}[/latex] and [latex]\hat{p}[/latex] are the good choices as point estimators for $μ$ and p, respectively. In the Sampling Distributions module of the Probability unit, we learned about the sampling distributions of [latex]\overline{X}[/latex] and found that as long as a sample is taken at random, the distribution of sample means is exactly centered at the value of population mean.

A normal distribution curve, in which the horizontal axis is labeled "X bar." The possible values of x-bar are centered at μ.

[latex]\overline{X}[/latex] is therefore said to be an unbiased estimator for $μ$ . Any particular sample mean might turn out to be less than the actual population mean, or it might turn out to be more. But in the long run, such sample means are “on target” in that they will not underestimate any more or less often than they overestimate.

Likewise, we learned that the sampling distribution of the sample proportion, [latex]\hat{p}[/latex], is centered at the population proportion p (as long as the sample is taken at random), thus making [latex]\hat{p}[/latex] an unbiased estimator for p.

A normal distribution curve with a horizontal axis labeled "p hat." The possible values of p-hat are centered at p .

As stated in the introduction, probability theory plays an essential role as we establish results for statistical inference. Our assertion above that sample mean and sample proportion are unbiased estimators is the first such instance.

Comment 2

Notice how important the principles of sampling and design are for our above results: if the sample of U.S. adults in (example 2 on the previous page) was not random, but instead included predominantly college students, then .56 would be a biased estimate for p, the proportion of all U.S. adults who believe marijuana should be legalized. If the survey design were flawed, such as loading the question with a reminder about the dangers of marijuana leading to hard drugs, or a reminder about the benefits of marijuana for cancer patients, then .56 would be biased on the low or high side, respectively. Our point estimates are truly unbiased estimates for the population parameter only if the sample is random and the study design is not flawed.

Exercises

A researcher wanted to estimate µ, the mean number of hours that students at a large state university spend exercising per week. The researcher collects data from a sample of 150 students who leave the university gym following a workout.

Comment 3

Not only are sample mean and sample proportion on target as long as the samples are random, but their accuracy improves as sample size increases. Again, there are two “layers” here for explaining this.

Intuitively, larger sample sizes give us more information with which to pin down the true nature of the population. We can therefore expect the sample mean and sample proportion obtained from a larger sample to be closer to the population mean and proportion, respectively. In the extreme, when we sample the whole population (which is called a census), the sample mean and sample proportion will exactly coincide with the population mean and population proportion.

There is another layer here that, again, comes from what we learned about the sampling distributions of the sample mean and the sample proportion. Let’s use the sample mean for the explanation.

Recall that the sampling distribution of the sample mean [latex]\overline{X}[/latex] is, as we mentioned before, centered at the population mean [latex]\mu[/latex] and has a standard deviation of [latex]\frac{\sigma }{\sqrt{n}}[/latex]. As a result, as the sample size n increases, the sampling distribution of [latex]\overline{X}[/latex] gets less spread out. This means that values of [latex]\overline{X}[/latex] that are based on a larger sample are more likely to be closer to μ (as the figure below illustrates):

Two sampling distribution curves for x-bar. One is squished down and wider, while the other is much taller and narrower. Both curves share the same μ. The tall, narrow distribution was based on a larger sample size, which has a smaller standard deviation, and so is less spread out. This means that values of x-bar are more likely to be closer to μ when the sample size is larger.

Similarly, since the sampling distribution of [latex]\hat{p}[/latex] is centered at p and has a standard deviation of [latex]\sqrt{\frac{p(1-p)}{n}}[/latex], which decreases as the sample size gets larger, values of [latex]\hat{p}[/latex] are more likely to be closer to p when the sample size is larger.

Did I get this?

In May 2015, two opinion polls were conducted regarding same-sex marriage. In particular, both polls were conducted in order to estimate p, the proportion of all U.S. adults who believe that same-sex marriage should be legal.

• The Gallup poll was based on a random sample of 1,024 U.S. adults and estimated the proportion of all U.S. adults who support gay marriage to be 0.60.

• The Pew Research Center poll was based on a random sample of 2,002 U.S. adults and estimated the proportion of all U.S. adults who support gay marriage to be 0.57.

Comment 4

Another example of a point estimate is using sample variance, [latex]\mathcal{s}^2=\frac{\left(\mathcal{x}_1-\bar{\mathcal{x}}\right)^2+...+\left(\mathcal{x}_\mathcal{n}-\bar{\mathcal{x}}\right)^2}{\mathcal{n}-1}[/latex] to estimate population variance, $σ^{2}$ .

In this course, we will not be concerned with estimating $σ^{2}$ for its own sake, but since we will often substitute s for $σ$ when standardizing the sample mean, it is worth pointing out that $s^{2}$ is an unbiased estimator for $σ^{2}$ . If we had divided by n instead of n – 1 in our estimator for population variance, then in the long run our sample variance would be guilty of a slight underestimation. Division by n – 1 accomplishes the goal of making this point estimator unbiased. Making unbiased estimators a top priority is, in fact, the reason that our formula for s, introduced in the Exploratory Data Analysis unit, involves division by n – 1 instead of by n.

Let’s Summarize

We use $ˆ p$ (sample proportion) as a point estimator for p (population proportion). It is an unbiased estimator: its long-run distribution is centered at p as long as the sample is random.

We use $¯ x$ (sample mean) as a point estimator for $μ$ (population mean). It is an unbiased estimator: its long-run distribution is centered at $μ$ as long as the sample is random.

In both cases, the larger the sample size, the more accurate the point estimator is. In other words, the larger the sample size, the more likely it is that the sample mean (proportion) is close to the unknown population mean (proportion).

Did I get this?

License

Icon for the Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License

Comment 1

Comment 2

Comment 3

Comment 4

Let’s Summarize

License

Share This Book