{"id":565,"date":"2024-10-18T02:44:18","date_gmt":"2024-10-18T02:44:18","guid":{"rendered":"https:\/\/pressbooks.ccconline.org\/mat1260\/?post_type=chapter&#038;p=565"},"modified":"2025-01-20T20:55:35","modified_gmt":"2025-01-20T20:55:35","slug":"9-2-introduction-to-confidence-intervals","status":"publish","type":"chapter","link":"https:\/\/pressbooks.ccconline.org\/mat1260\/chapter\/9-2-introduction-to-confidence-intervals\/","title":{"raw":"9.2: Introduction to Confidence Intervals","rendered":"9.2: Introduction to Confidence Intervals"},"content":{"raw":"<div id=\"lobjh\" class=\"\">\r\n<div class=\"textbox textbox--learning-objectives\"><header class=\"textbox__header\">\r\n<h2 class=\"textbox__title\">Learning Objectives<\/h2>\r\n<\/header>\r\n<div class=\"textbox__content\">\r\n<ul>\r\n \t<li id=\"explain_confidence_interval\">Explain what a confidence interval represents and determine how changes in sample size and confidence level affect the precision of the confidence interval.<\/li>\r\n<\/ul>\r\n<\/div>\r\n<\/div>\r\n<\/div>\r\n<p id=\"N10AF1\">Point estimation is simple and intuitive, but also a bit problematic. Here is why:<\/p>\r\n<p id=\"N10AF4\">When we estimate, say, \u03bc by the sample mean\u00a0[latex]\\overline{x}[\/latex], we are almost guaranteed to make some kind of error. Even though we know that the values of\u00a0[latex]\\overline{x}[\/latex]\u00a0fall around \u03bc, it is very unlikely that the value of\u00a0[latex]\\overline{x}[\/latex]\u00a0will fall exactly at \u03bc .<\/p>\r\n<p id=\"N10B27\">Given that such errors are a fact of life for point estimates (by the mere fact that we are basing our estimate on one sample that is a small fraction of the population), these estimates are in themselves of limited usefulness, unless we are able to quantify the extent of the estimation error. Interval estimation addresses this issue. The idea behind\u00a0<em>interval estimation<\/em>\u00a0is, therefore, to enhance the simple point estimates by supplying information about the size of the error attached.<\/p>\r\n<p id=\"N10B2F\">In this introduction, we\u2019ll provide examples that will give you a solid intuition about the basic idea behind interval estimation.<\/p>\r\n\r\n<div class=\"examplewrap\">\r\n<div class=\"example clearfix\">\r\n<div class=\"textbox textbox--examples\"><header class=\"textbox__header\">\r\n<h3 class=\"textbox__title\">Example<\/h3>\r\n<\/header>\r\n<div class=\"textbox__content\">\r\n<p id=\"N10B34\">Consider the example that we discussed in the point estimation section:<\/p>\r\n<p id=\"N10B37\">Suppose that we are interested in studying the IQ levels of students in a Smart University (SU). In particular (since IQ level is a quantitative variable), we are interested in estimating \u03bc, the mean IQ level of all the students in SU. A random sample of 100 SU students was chosen, and their (sample) mean IQ level was found to be\u00a0[latex]\\overline{x}=115[\/latex].<\/p>\r\n<span class=\"imagewrap\"><span class=\"image\"><img id=\"_i_0\" class=\"img-responsive popimg aligncenter\" title=\"A large circle represents the Population of all Students at SU. We are interested in the variable IQ, and the parameter is \u03bc, the population mean IQ level. From this population we create a sample of size n=100, represented by a smaller circle. In this sample, we find that x bar is 115.\" src=\"https:\/\/oli.cmu.edu\/repository\/webcontent\/72712ec00a0001dc418a87e73e8ebb77\/_u5_inference\/_m1_inference_one_variable\/webcontent\/image029.gif\" alt=\"A large circle represents the Population of all Students at SU. We are interested in the variable IQ, and the parameter is \u03bc, the population mean IQ level. From this population we create a sample of size n=100, represented by a smaller circle. In this sample, we find that x bar is 115.\" \/><\/span><\/span>\r\n<p id=\"N10B59\">In point estimation we used\u00a0[latex]\\overline{x}=115[\/latex]\u00a0as the point estimate for \u03bc. However, we had no idea of what the estimation error involved in such an estimation might be. Interval estimation takes point estimation a step further and says something like:<\/p>\r\n<p id=\"N10B72\">\u201cI am 95% confident that by using the point estimate\u00a0[latex]\\overline{x}=115[\/latex]\u00a0to estimate \u03bc, I am off by no more than 3 IQ points. In other words, I am 95% confident that \u03bc is within 3 of 115, or between 112 (115 \u2013 3) and 118 (115 + 3).\u201d<\/p>\r\n<p id=\"N10B8B\">Yet another way to say the same thing is: I am 95% confident that \u03bc is somewhere in (or covered by) the interval (112,118). (<em>Comment:<\/em>\u00a0At this point you should not worry about, or try to figure out, how we got these numbers. We\u2019ll do that later. All we want to do here is make sure you understand the idea.)<\/p>\r\n<p id=\"N10B91\">Note that while point estimation provided just one number as an estimate for \u03bc (115), interval estimation provides a whole interval of \u201cplausible values\u201d for \u03bc (between 112 and 118), and also attaches the level of our confidence that this interval indeed includes the value of \u03bc to our estimation (in our example, 95% confidence). The interval (112,118) is therefore called \u201ca 95% confidence interval for \u03bc.\u201d<\/p>\r\n\r\n<\/div>\r\n<\/div>\r\nLet\u2019s look at another example:\r\n\r\n<\/div>\r\n<\/div>\r\n<div class=\"examplewrap\">\r\n<div class=\"example clearfix\">\r\n<div class=\"textbox textbox--examples\"><header class=\"textbox__header\">\r\n<h3 class=\"textbox__title\">Example<\/h3>\r\n<\/header>\r\n<div class=\"textbox__content\">\r\n<p id=\"N10B9A\">Let\u2019s consider the second example from the point estimation section.<\/p>\r\n<p id=\"N10B9D\">Suppose that we are interested in the opinions of U.S. adults regarding legalizing the use of marijuana. In particular, we are interested in the parameter p, the proportion of U.S. adults who believe marijuana should be legalized.<\/p>\r\n<p id=\"N10BA0\">Suppose a poll of 1,000 U.S. adults finds that 560 of them believe marijuana should be legalized.<\/p>\r\n<span class=\"imagewrap\"><span class=\"image\"><img id=\"_i_1\" class=\"img-responsive popimg aligncenter\" title=\"A large circle represents the population of all US Adults. We are interested in the parameter p, the proportion who believe marijuana should be legalized. From this population we create a sample of size n=1000, represented by a smaller circle. In this sample, we find that p hat is .56 .\" src=\"https:\/\/oli.cmu.edu\/repository\/webcontent\/72712ec00a0001dc418a87e73e8ebb77\/_u5_inference\/_m1_inference_one_variable\/webcontent\/image030.gif\" alt=\"A large circle represents the population of all US Adults. We are interested in the parameter p, the proportion who believe marijuana should be legalized. From this population we create a sample of size n=1000, represented by a smaller circle. In this sample, we find that p hat is .56 .\" \/><\/span><\/span>\r\n<p id=\"N10BA9\">If we wanted to estimate p, the population proportion, by a single number based on the sample, it would make intuitive sense to use the corresponding quantity in the sample, the sample proportion [latex]\\hat{\\mathcal{p}}=\\frac{560}{1000}=.56[\/latex]<\/p>\r\n<p id=\"N10BD2\">Interval estimation would take this a step further and say something like:<\/p>\r\n<p id=\"N10BD5\">\u201cI am 90% sure that by using .56 to estimate the true population proportion, p, I am off by (or, I have an error of) no more than .03 (or 3 percentage points). In other words, I am 90% confident that the actual value of p is somewhere between .53 (.56 \u2013 .03) and .59 (.56 + .03).\u201d<\/p>\r\n<p id=\"N10BD8\">Yet another way of saying this is: \u201cI am 90% confident that p is covered by the interval (.53, .59).\u201d<\/p>\r\n<p id=\"N10BDB\">In this example, (.53, .59) is a 90% confidence interval for p.<\/p>\r\n\r\n<\/div>\r\n<\/div>\r\n<h2>Let\u2019s summarize<\/h2>\r\n<\/div>\r\n<\/div>\r\n<div id=\"N10BDF\" class=\"section\">\r\n<div class=\"sectionContain\">\r\n<p id=\"N10BE6\">The two examples showed us that the idea behind interval estimation is, instead of providing just one number for estimating an unknown parameter of interest, to provide an interval of plausible values of the parameter plus a level of confidence that the value of the parameter is covered by this interval.<\/p>\r\n<p id=\"N10BE9\">We are now going to go into more detail and learn how these confidence intervals are created. As you\u2019ll see, the ideas that were developed in the \u201cSampling Distributions\u201d module of the Probability unit will, again, be very important (as they were in point estimation).<\/p>\r\n<p id=\"N10BEC\">We\u2019ll start by discussing confidence intervals for the population mean \u03bc, and later discuss confidence intervals for the population proportion p.<\/p>\r\n\r\n<\/div>\r\n<\/div>","rendered":"<div id=\"lobjh\" class=\"\">\n<div class=\"textbox textbox--learning-objectives\">\n<header class=\"textbox__header\">\n<h2 class=\"textbox__title\">Learning Objectives<\/h2>\n<\/header>\n<div class=\"textbox__content\">\n<ul>\n<li id=\"explain_confidence_interval\">Explain what a confidence interval represents and determine how changes in sample size and confidence level affect the precision of the confidence interval.<\/li>\n<\/ul>\n<\/div>\n<\/div>\n<\/div>\n<p id=\"N10AF1\">Point estimation is simple and intuitive, but also a bit problematic. Here is why:<\/p>\n<p id=\"N10AF4\">When we estimate, say, \u03bc by the sample mean\u00a0[latex]\\overline{x}[\/latex], we are almost guaranteed to make some kind of error. Even though we know that the values of\u00a0[latex]\\overline{x}[\/latex]\u00a0fall around \u03bc, it is very unlikely that the value of\u00a0[latex]\\overline{x}[\/latex]\u00a0will fall exactly at \u03bc .<\/p>\n<p id=\"N10B27\">Given that such errors are a fact of life for point estimates (by the mere fact that we are basing our estimate on one sample that is a small fraction of the population), these estimates are in themselves of limited usefulness, unless we are able to quantify the extent of the estimation error. Interval estimation addresses this issue. The idea behind\u00a0<em>interval estimation<\/em>\u00a0is, therefore, to enhance the simple point estimates by supplying information about the size of the error attached.<\/p>\n<p id=\"N10B2F\">In this introduction, we\u2019ll provide examples that will give you a solid intuition about the basic idea behind interval estimation.<\/p>\n<div class=\"examplewrap\">\n<div class=\"example clearfix\">\n<div class=\"textbox textbox--examples\">\n<header class=\"textbox__header\">\n<h3 class=\"textbox__title\">Example<\/h3>\n<\/header>\n<div class=\"textbox__content\">\n<p id=\"N10B34\">Consider the example that we discussed in the point estimation section:<\/p>\n<p id=\"N10B37\">Suppose that we are interested in studying the IQ levels of students in a Smart University (SU). In particular (since IQ level is a quantitative variable), we are interested in estimating \u03bc, the mean IQ level of all the students in SU. A random sample of 100 SU students was chosen, and their (sample) mean IQ level was found to be\u00a0[latex]\\overline{x}=115[\/latex].<\/p>\n<p><span class=\"imagewrap\"><span class=\"image\"><img decoding=\"async\" id=\"_i_0\" class=\"img-responsive popimg aligncenter\" title=\"A large circle represents the Population of all Students at SU. We are interested in the variable IQ, and the parameter is \u03bc, the population mean IQ level. From this population we create a sample of size n=100, represented by a smaller circle. In this sample, we find that x bar is 115.\" src=\"https:\/\/oli.cmu.edu\/repository\/webcontent\/72712ec00a0001dc418a87e73e8ebb77\/_u5_inference\/_m1_inference_one_variable\/webcontent\/image029.gif\" alt=\"A large circle represents the Population of all Students at SU. We are interested in the variable IQ, and the parameter is \u03bc, the population mean IQ level. From this population we create a sample of size n=100, represented by a smaller circle. In this sample, we find that x bar is 115.\" \/><\/span><\/span><\/p>\n<p id=\"N10B59\">In point estimation we used\u00a0[latex]\\overline{x}=115[\/latex]\u00a0as the point estimate for \u03bc. However, we had no idea of what the estimation error involved in such an estimation might be. Interval estimation takes point estimation a step further and says something like:<\/p>\n<p id=\"N10B72\">\u201cI am 95% confident that by using the point estimate\u00a0[latex]\\overline{x}=115[\/latex]\u00a0to estimate \u03bc, I am off by no more than 3 IQ points. In other words, I am 95% confident that \u03bc is within 3 of 115, or between 112 (115 \u2013 3) and 118 (115 + 3).\u201d<\/p>\n<p id=\"N10B8B\">Yet another way to say the same thing is: I am 95% confident that \u03bc is somewhere in (or covered by) the interval (112,118). (<em>Comment:<\/em>\u00a0At this point you should not worry about, or try to figure out, how we got these numbers. We\u2019ll do that later. All we want to do here is make sure you understand the idea.)<\/p>\n<p id=\"N10B91\">Note that while point estimation provided just one number as an estimate for \u03bc (115), interval estimation provides a whole interval of \u201cplausible values\u201d for \u03bc (between 112 and 118), and also attaches the level of our confidence that this interval indeed includes the value of \u03bc to our estimation (in our example, 95% confidence). The interval (112,118) is therefore called \u201ca 95% confidence interval for \u03bc.\u201d<\/p>\n<\/div>\n<\/div>\n<p>Let\u2019s look at another example:<\/p>\n<\/div>\n<\/div>\n<div class=\"examplewrap\">\n<div class=\"example clearfix\">\n<div class=\"textbox textbox--examples\">\n<header class=\"textbox__header\">\n<h3 class=\"textbox__title\">Example<\/h3>\n<\/header>\n<div class=\"textbox__content\">\n<p id=\"N10B9A\">Let\u2019s consider the second example from the point estimation section.<\/p>\n<p id=\"N10B9D\">Suppose that we are interested in the opinions of U.S. adults regarding legalizing the use of marijuana. In particular, we are interested in the parameter p, the proportion of U.S. adults who believe marijuana should be legalized.<\/p>\n<p id=\"N10BA0\">Suppose a poll of 1,000 U.S. adults finds that 560 of them believe marijuana should be legalized.<\/p>\n<p><span class=\"imagewrap\"><span class=\"image\"><img decoding=\"async\" id=\"_i_1\" class=\"img-responsive popimg aligncenter\" title=\"A large circle represents the population of all US Adults. We are interested in the parameter p, the proportion who believe marijuana should be legalized. From this population we create a sample of size n=1000, represented by a smaller circle. In this sample, we find that p hat is .56 .\" src=\"https:\/\/oli.cmu.edu\/repository\/webcontent\/72712ec00a0001dc418a87e73e8ebb77\/_u5_inference\/_m1_inference_one_variable\/webcontent\/image030.gif\" alt=\"A large circle represents the population of all US Adults. We are interested in the parameter p, the proportion who believe marijuana should be legalized. From this population we create a sample of size n=1000, represented by a smaller circle. In this sample, we find that p hat is .56 .\" \/><\/span><\/span><\/p>\n<p id=\"N10BA9\">If we wanted to estimate p, the population proportion, by a single number based on the sample, it would make intuitive sense to use the corresponding quantity in the sample, the sample proportion [latex]\\hat{\\mathcal{p}}=\\frac{560}{1000}=.56[\/latex]<\/p>\n<p id=\"N10BD2\">Interval estimation would take this a step further and say something like:<\/p>\n<p id=\"N10BD5\">\u201cI am 90% sure that by using .56 to estimate the true population proportion, p, I am off by (or, I have an error of) no more than .03 (or 3 percentage points). In other words, I am 90% confident that the actual value of p is somewhere between .53 (.56 \u2013 .03) and .59 (.56 + .03).\u201d<\/p>\n<p id=\"N10BD8\">Yet another way of saying this is: \u201cI am 90% confident that p is covered by the interval (.53, .59).\u201d<\/p>\n<p id=\"N10BDB\">In this example, (.53, .59) is a 90% confidence interval for p.<\/p>\n<\/div>\n<\/div>\n<h2>Let\u2019s summarize<\/h2>\n<\/div>\n<\/div>\n<div id=\"N10BDF\" class=\"section\">\n<div class=\"sectionContain\">\n<p id=\"N10BE6\">The two examples showed us that the idea behind interval estimation is, instead of providing just one number for estimating an unknown parameter of interest, to provide an interval of plausible values of the parameter plus a level of confidence that the value of the parameter is covered by this interval.<\/p>\n<p id=\"N10BE9\">We are now going to go into more detail and learn how these confidence intervals are created. As you\u2019ll see, the ideas that were developed in the \u201cSampling Distributions\u201d module of the Probability unit will, again, be very important (as they were in point estimation).<\/p>\n<p id=\"N10BEC\">We\u2019ll start by discussing confidence intervals for the population mean \u03bc, and later discuss confidence intervals for the population proportion p.<\/p>\n<\/div>\n<\/div>\n","protected":false},"author":150,"menu_order":9,"template":"","meta":{"pb_show_title":"on","pb_short_title":"","pb_subtitle":"","pb_authors":[],"pb_section_license":""},"chapter-type":[48],"contributor":[],"license":[],"class_list":["post-565","chapter","type-chapter","status-publish","hentry","chapter-type-numberless"],"part":421,"_links":{"self":[{"href":"https:\/\/pressbooks.ccconline.org\/mat1260\/wp-json\/pressbooks\/v2\/chapters\/565","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/pressbooks.ccconline.org\/mat1260\/wp-json\/pressbooks\/v2\/chapters"}],"about":[{"href":"https:\/\/pressbooks.ccconline.org\/mat1260\/wp-json\/wp\/v2\/types\/chapter"}],"author":[{"embeddable":true,"href":"https:\/\/pressbooks.ccconline.org\/mat1260\/wp-json\/wp\/v2\/users\/150"}],"version-history":[{"count":5,"href":"https:\/\/pressbooks.ccconline.org\/mat1260\/wp-json\/pressbooks\/v2\/chapters\/565\/revisions"}],"predecessor-version":[{"id":1055,"href":"https:\/\/pressbooks.ccconline.org\/mat1260\/wp-json\/pressbooks\/v2\/chapters\/565\/revisions\/1055"}],"part":[{"href":"https:\/\/pressbooks.ccconline.org\/mat1260\/wp-json\/pressbooks\/v2\/parts\/421"}],"metadata":[{"href":"https:\/\/pressbooks.ccconline.org\/mat1260\/wp-json\/pressbooks\/v2\/chapters\/565\/metadata\/"}],"wp:attachment":[{"href":"https:\/\/pressbooks.ccconline.org\/mat1260\/wp-json\/wp\/v2\/media?parent=565"}],"wp:term":[{"taxonomy":"chapter-type","embeddable":true,"href":"https:\/\/pressbooks.ccconline.org\/mat1260\/wp-json\/pressbooks\/v2\/chapter-type?post=565"},{"taxonomy":"contributor","embeddable":true,"href":"https:\/\/pressbooks.ccconline.org\/mat1260\/wp-json\/wp\/v2\/contributor?post=565"},{"taxonomy":"license","embeddable":true,"href":"https:\/\/pressbooks.ccconline.org\/mat1260\/wp-json\/wp\/v2\/license?post=565"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}