{"id":81,"date":"2022-05-18T16:36:27","date_gmt":"2022-05-18T16:36:27","guid":{"rendered":"https:\/\/pressbooks.ccconline.org\/accintrostats\/chapter\/measures-of-the-location-of-the-data\/"},"modified":"2022-11-09T16:37:09","modified_gmt":"2022-11-09T16:37:09","slug":"measures-of-the-location-of-the-data","status":"publish","type":"chapter","link":"https:\/\/pressbooks.ccconline.org\/accintrostats\/chapter\/measures-of-the-location-of-the-data\/","title":{"raw":"Chapter 2.7: Measures of Position","rendered":"Chapter 2.7: Measures of Position"},"content":{"raw":"&nbsp;\r\n<p id=\"element-280\">The common measures of position or location are <span data-type=\"term\">quartiles<\/span> and <span data-type=\"term\">percentiles<\/span><\/p>\r\n<p id=\"fs-idp16986528\">Quartiles are special percentiles. The first quartile, <em data-effect=\"italics\">Q<\/em><sub>1<\/sub>, is the same as the 25<sup>th<\/sup> percentile, and the third quartile, <em data-effect=\"italics\">Q<\/em><sub>3<\/sub>, is the same as the 75<sup>th<\/sup> percentile. The median, <em data-effect=\"italics\">M<\/em>, is called both the second quartile and the 50<sup>th<\/sup> percentile.<\/p>\r\n<p id=\"element-105\">To calculate quartiles and percentiles, the data must be ordered from smallest to largest. Quartiles divide ordered data into quarters. Percentiles divide ordered data into hundredths. To score in the 90<sup>th<\/sup> percentile of an exam does not mean, necessarily, that you received 90% on a test. It means that 90% of test scores are the same or less than your score and 10% of the test scores are the same or greater than your test score.<\/p>\r\n<p id=\"fs-idm12500320\">Percentiles are useful for comparing values. For this reason, universities and colleges use percentiles extensively. One instance in which colleges and universities use percentiles is when SAT results are used to determine a minimum testing score that will be used as an acceptance factor. For example, suppose Duke accepts SAT scores at or above the 75<sup>th<\/sup> percentile. That translates into a score of at least 1220.<\/p>\r\n<p id=\"fs-idp48110304\">Percentiles are mostly used with very large populations. Therefore, if you were to say that 90% of the test scores are less (and not the same or less) than your score, it would be acceptable because removing one particular data value is not significant.<\/p>\r\n<p id=\"element-681\">The <span data-type=\"term\">median<\/span> is a number that measures the \"center\" of the data. You can think of the median as the \"middle value,\" but it does not actually have to be one of the observed values. It is a number that separates ordered data into halves. Half the values are the same number or smaller than the median, and half the values are the same number or larger. For example, consider the following data. <span data-type=\"newline\">\r\n<\/span>1;\u00a0 11.5;\u00a0 6;\u00a0 7.2;\u00a0 4;\u00a0 8;\u00a0 9;\u00a0 10;\u00a0 6.8;\u00a0 8.3;\u00a0 2;\u00a0 2;\u00a0 10;\u00a0 1 <span data-type=\"newline\">\r\n<\/span>Ordered from smallest to largest: <span data-type=\"newline\">\r\n<\/span>1;\u00a0 1;\u00a0 2;\u00a0 2;\u00a0 4;\u00a0 6;\u00a0 6. 8;\u00a0 7.2;\u00a0 8;\u00a0 8.3;\u00a0 9;\u00a0 10;\u00a0 10;\u00a0 11.5<\/p>\r\n<p id=\"element-546\">Since there are 14 observations, the median is between the seventh value, 6.8, and the eighth value, 7.2. To find the median, add the two values together and divide by two.<\/p>\r\n\r\n<div data-type=\"equation\">\\(\\frac{6.8+7.2}{2}=7\\)<\/div>\r\n<p id=\"element-995\">The median is seven. Half of the values are smaller than seven and half of the values are larger than seven.<\/p>\r\n<p id=\"element-308\"><span data-type=\"term\">Quartiles<\/span> are numbers that separate the data into quarters. Quartiles may or may not be part of the data. To find the quartiles, first find the median or second quartile. The first quartile, <em data-effect=\"italics\">Q<\/em><sub>1<\/sub>, is the middle value of the lower half of the data, and the third quartile, <em data-effect=\"italics\">Q<\/em><sub>3<\/sub>, is the middle value, or median, of the upper half of the data. To get the idea, consider the same data set: <span data-type=\"newline\">\r\n<\/span>1;\u00a0 1;\u00a0 2;\u00a0 2;\u00a0 4;\u00a0 6;\u00a0 6.8;\u00a0 7.2;\u00a0 8;\u00a0 8.3;\u00a0 9;\u00a0 10;\u00a0 10;\u00a0 11.5<\/p>\r\n<p id=\"element-805\">The median or <strong>second quartile<\/strong> is seven. The lower half of the data are 1,\u00a0 1,\u00a0 2,\u00a0 2,\u00a0 4,\u00a0 6,\u00a0 6.8. The middle value of the lower half is two. <span data-type=\"newline\">\r\n<\/span>1;\u00a0 1;\u00a0 2;\u00a0 2;\u00a0 4;\u00a0 6;\u00a0 6.8<\/p>\r\n<p id=\"element-227\">The number two, which is part of the data, is the <span data-type=\"term\">first quartile<\/span>. One-fourth of the entire sets of values are the same as or less than two and three-fourths of the values are more than two.<\/p>\r\nThe upper half of the data is 7.2,\u00a0 8,\u00a0 8.3,\u00a0 9,\u00a0 10,\u00a0 10,\u00a0 11.5. The middle value of the upper half is nine.\r\n<p id=\"element-386\">The <span data-type=\"term\">third quartile<\/span>, <em data-effect=\"italics\">Q<\/em>3, is nine. Three-fourths (75%) of the ordered data set are less than nine. One-fourth (25%) of the ordered data set are greater than nine. The third quartile is part of the data set in this example.<\/p>\r\n<p id=\"element-716\">The <span data-type=\"term\">interquartile range<\/span> is a number that indicates the spread of the middle half or the middle 50% of the data. It is the difference between the third quartile (<em data-effect=\"italics\">Q<\/em><sub>3<\/sub>) and the first quartile (<em data-effect=\"italics\">Q<\/em><sub>1<\/sub>).<\/p>\r\n<p id=\"delete_me\"><em data-effect=\"italics\">IQR<\/em> = <em data-effect=\"italics\">Q<\/em><sub>3<\/sub> \u2013 <em data-effect=\"italics\">Q<\/em><sub>1<\/sub><\/p>\r\nThe <em data-effect=\"italics\">IQR<\/em> can help to determine potential <strong>outliers<\/strong>. <strong>A value is suspected to be a potential outlier if it is less than (1.5)(<em data-effect=\"italics\">IQR<\/em>) below the first quartile or more than (1.5)(<em data-effect=\"italics\">IQR<\/em>) above the third quartile<\/strong>. Potential outliers always require further investigation.\r\n<div id=\"fs-idm10803744\" data-type=\"note\" data-has-label=\"true\" data-label=\"\">\r\n<div data-type=\"title\">NOTE<\/div>\r\n<p id=\"fs-idp4345696\">A potential outlier is a data point that is significantly different from the other data points. These special data points may be errors or some kind of abnormality or they may be a key to understanding the data.<\/p>\r\n\r\n<\/div>\r\n<div id=\"element-826\" class=\"textbox textbox--examples\" data-type=\"example\">\r\n<div id=\"exer5\" data-type=\"exercise\">\r\n<div id=\"id45036025\" data-type=\"problem\">\r\n<p id=\"element-720\">For the following 13 real estate prices, calculate the <em data-effect=\"italics\">IQR<\/em> and determine if any prices are potential outliers. Prices are in dollars. <span data-type=\"newline\">\r\n<\/span>389,950;\u00a0 230,500;\u00a0 158,000;\u00a0 479,000;\u00a0 639,000;\u00a0 114,950;\u00a0 5, 500,000;\u00a0 387,000;\u00a0 659,000;\u00a0 529,000;\u00a0 575,000;\u00a0 488,800;\u00a0 1,095,000<\/p>\r\n\r\n<\/div>\r\n<div id=\"id45746296\" data-type=\"solution\">\r\n<p id=\"element-939\">Order the data from smallest to largest. <span data-type=\"newline\">\r\n<\/span>114,950;\u00a0 158,000;\u00a0 230,500;\u00a0 387,000;\u00a0 389,950;\u00a0 479,000;\u00a0 488,800;\u00a0 529,000;\u00a0 575,000; 639,000; 659,000; 1,095,000; 5,500,000<\/p>\r\n<p id=\"element-170\"><em data-effect=\"italics\">M<\/em> = 488, 800<\/p>\r\n<em data-effect=\"italics\">Q<\/em><sub>1<\/sub> = \\(\\frac{\\text{230,500 + 387,000}}{2}\\) = 308,750\r\n\r\n<em data-effect=\"italics\">Q<\/em><sub>3<\/sub> = \\(\\frac{\\text{639,000 + 659,000}}{2}\\) = 649,000\r\n<p id=\"element-290\"><em data-effect=\"italics\">IQR<\/em> = 649,000 \u2013 308,750 = 340,250<\/p>\r\n<p id=\"element-166\">(1.5)(<em data-effect=\"italics\">IQR<\/em>) = (1.5)(340,250) = 510,375<\/p>\r\n<p id=\"element-348\"><em data-effect=\"italics\">Q<\/em><sub>1<\/sub> \u2013 (1.5)(<em data-effect=\"italics\">IQR<\/em>) = 308,750 \u2013 510,375 = \u2013201,625<\/p>\r\n<p id=\"element-211\"><em data-effect=\"italics\">Q<\/em><sub>3<\/sub> + (1.5)(<em data-effect=\"italics\">IQR<\/em>) = 649,000 + 510,375 = 1,159,375<\/p>\r\n<p id=\"element-109\">No house price is less than \u2013201,625. However, 5,500,000 is more than 1,159,375. Therefore, 5,500,000 is a potential <span data-type=\"term\">outlier<\/span>.<\/p>\r\n\r\n<\/div>\r\n<\/div>\r\n<\/div>\r\n<div id=\"fs-idp16250528\" class=\"statistics try\" data-type=\"note\" data-has-label=\"true\" data-label=\"\">\r\n<div data-type=\"title\">Try It<\/div>\r\n<div id=\"fs-idp63302352\" data-type=\"exercise\">\r\n<div id=\"fs-idm22548992\" data-type=\"problem\">\r\n<p id=\"fs-idp42507600\">For the following 11 salaries, calculate the <em data-effect=\"italics\">IQR<\/em> and determine if any salaries are outliers. The salaries are in dollars.<\/p>\r\n<p id=\"fs-idm25187088\"><span data-type=\"list\" data-list-type=\"labeled-item\" data-display=\"inline\"><span data-type=\"item\">\\$33,000\u00a0 \u00a0\\$<\/span><span data-type=\"item\">64,500\u00a0 \u00a0\\$<\/span><span data-type=\"item\">28,000\u00a0 \u00a0\\$<\/span><span data-type=\"item\">54,000\u00a0 \u00a0\\$<\/span><span data-type=\"item\">72,000\u00a0 \u00a0\\$<\/span><span data-type=\"item\">68,500\u00a0 \u00a0\\$<\/span><span data-type=\"item\">69,000\u00a0 \u00a0\\$<\/span><span data-type=\"item\">42,000\u00a0 \u00a0\\$<\/span><span data-type=\"item\">54,000\u00a0 \u00a0\\$<\/span><span data-type=\"item\">120,000\u00a0 \u00a0\\$<\/span><span data-type=\"item\">40,500<\/span><\/span><\/p>\r\n\r\n<\/div>\r\n<\/div>\r\n<\/div>\r\n<div id=\"element-17\" class=\"textbox textbox--examples\" data-type=\"example\">\r\n<div data-type=\"exercise\">\r\n<div id=\"id45587381\" data-type=\"problem\">\r\n<p id=\"element-880\">For the two data sets in the <a href=\"#element-583\">test scores example<\/a>, find the following:<\/p>\r\n\r\n<ol type=\"a\" data-mark-suffix=\".\">\r\n \t<li>The interquartile range. Compare the two interquartile ranges.<\/li>\r\n \t<li>Any outliers in either set.<\/li>\r\n<\/ol>\r\n<\/div>\r\n<div id=\"fs-idm13740032\" data-type=\"solution\">\r\n<p id=\"fs-idp37987952\">The five number summary for the day and night classes is<\/p>\r\n\r\n<table id=\"fs-idp36487328\" summary=\"\">\r\n<thead>\r\n<tr>\r\n<th><\/th>\r\n<th>Minimum<\/th>\r\n<th><em data-effect=\"italics\">Q<\/em><sub>1<\/sub><\/th>\r\n<th>Median<\/th>\r\n<th><em data-effect=\"italics\">Q<\/em><sub>3<\/sub><\/th>\r\n<th>Maximum<\/th>\r\n<\/tr>\r\n<\/thead>\r\n<tbody>\r\n<tr>\r\n<td><strong data-effect=\"bold\">Day<\/strong><\/td>\r\n<td>32<\/td>\r\n<td>56<\/td>\r\n<td>74.5<\/td>\r\n<td>82.5<\/td>\r\n<td>99<\/td>\r\n<\/tr>\r\n<tr>\r\n<td><strong data-effect=\"bold\">Night<\/strong><\/td>\r\n<td>25.5<\/td>\r\n<td>78<\/td>\r\n<td>81<\/td>\r\n<td>89<\/td>\r\n<td>98<\/td>\r\n<\/tr>\r\n<\/tbody>\r\n<\/table>\r\n<ol id=\"fs-idm23962720\" type=\"a\">\r\n \t<li>The IQR for the day group is <em data-effect=\"italics\">Q<\/em><sub>3<\/sub> \u2013 <em data-effect=\"italics\">Q<\/em><sub>1<\/sub> = 82.5 \u2013 56 = 26.5\r\n<p id=\"fs-idm7044352\">The IQR for the night group is <em data-effect=\"italics\">Q<\/em><sub>3<\/sub> \u2013 <em data-effect=\"italics\">Q<\/em><sub>1<\/sub> = 89 \u2013 78 = 11<\/p>\r\n<p id=\"fs-idp42547504\">The interquartile range (the spread or variability) for the day class is larger than the night class <em data-effect=\"italics\">IQR<\/em>. This suggests more variation will be found in the day class\u2019s class test scores.<\/p>\r\n<\/li>\r\n \t<li>Day class outliers are found using the IQR times 1.5 rule. So,\r\n<ul id=\"fs-idm52257968\" data-labeled-item=\"true\">\r\n \t<li><em data-effect=\"italics\">Q<\/em><sub>1<\/sub> - <em data-effect=\"italics\">IQR<\/em>(1.5) = 56 \u2013 26.5(1.5) = 16.25<\/li>\r\n \t<li><em data-effect=\"italics\">Q<\/em><sub>3<\/sub> + <em data-effect=\"italics\">IQR<\/em>(1.5) = 82.5 + 26.5(1.5) = 122.25<\/li>\r\n<\/ul>\r\n<p id=\"fs-idp38341744\">Since the minimum and maximum values for the day class are greater than 16.25 and less than 122.25, there are no outliers.<\/p>\r\n<p id=\"fs-idm23940160\">Night class outliers are calculated as:<\/p>\r\n\r\n<ul id=\"fs-idp29569184\" data-labeled-item=\"true\">\r\n \t<li><em data-effect=\"italics\">Q<\/em><sub>1<\/sub> \u2013 <em data-effect=\"italics\">IQR<\/em> (1.5) = 78 \u2013 11(1.5) = 61.5<\/li>\r\n \t<li><em data-effect=\"italics\">Q<\/em><sub>3<\/sub> + IQR(1.5) = 89 + 11(1.5) = 105.5<\/li>\r\n<\/ul>\r\n<p id=\"fs-idp5005056\">For this class, any test score less than 61.5 is an outlier. Therefore, the scores of 45 and 25.5 are outliers. Since no test score is greater than 105.5, there is no upper end outlier.<\/p>\r\n<\/li>\r\n<\/ol>\r\n<\/div>\r\n<\/div>\r\n<\/div>\r\n<div id=\"fs-idp58037360\" class=\"statistics try\" data-type=\"note\" data-has-label=\"true\" data-label=\"\">\r\n<div data-type=\"title\">Try It<\/div>\r\n<div id=\"fs-idp23368176\" data-type=\"exercise\">\r\n<div id=\"fs-idp23368304\" data-type=\"problem\">\r\n<p id=\"fs-idp5269648\">Find the interquartile range for the following two data sets and compare them.<\/p>\r\n<p id=\"fs-idp4060048\">Test Scores for Class <em data-effect=\"italics\">A<\/em> <span data-type=\"newline\">\r\n<\/span>69;\u00a0 96;\u00a0 81;\u00a0 79;\u00a0 65;\u00a0 76;\u00a0 83;\u00a0 99;\u00a0 89;\u00a0 67;\u00a0 90;\u00a0 77;\u00a0 85;\u00a0 98;\u00a0 66;\u00a0 91;\u00a0 77;\u00a0 69;\u00a0 80;\u00a0 94 <span data-type=\"newline\">\r\n<\/span>Test Scores for Class <em data-effect=\"italics\">B<\/em> <span data-type=\"newline\">\r\n<\/span>90;\u00a0 72;\u00a0 80;\u00a0 92;\u00a0 90;\u00a0 97;\u00a0 92;\u00a0 75;\u00a0 79;\u00a0 68;\u00a0 70;\u00a0 80;\u00a0 99;\u00a0 95;\u00a0 78;\u00a0 73;\u00a0 71;\u00a0 68;\u00a0 95;\u00a0 100<\/p>\r\n\r\n<\/div>\r\n<\/div>\r\n<\/div>\r\n<div id=\"element-84\" class=\"textbox textbox--examples\" data-type=\"example\">\r\n<p id=\"element-913\">Fifty statistics students were asked how much sleep they get per school night (rounded to the nearest hour). The results were:<\/p>\r\n\r\n<table id=\"id4431204\" summary=\"This table presents the amount of sleep per school night in hours in the first column, from 4-10 hours, frequency in the second column, relative frequency in the third column, and cumulative relative frequency in the fourth column.\">\r\n<thead>\r\n<tr>\r\n<th>AMOUNT OF SLEEP PER SCHOOL NIGHT (HOURS)<\/th>\r\n<th>FREQUENCY<\/th>\r\n<th>RELATIVE FREQUENCY<\/th>\r\n<th>CUMULATIVE RELATIVE FREQUENCY<\/th>\r\n<\/tr>\r\n<\/thead>\r\n<tbody>\r\n<tr>\r\n<td>4<\/td>\r\n<td>2<\/td>\r\n<td>0.04<\/td>\r\n<td>0.04<\/td>\r\n<\/tr>\r\n<tr>\r\n<td>5<\/td>\r\n<td>5<\/td>\r\n<td>0.10<\/td>\r\n<td>0.14<\/td>\r\n<\/tr>\r\n<tr>\r\n<td>6<\/td>\r\n<td>7<\/td>\r\n<td>0.14<\/td>\r\n<td>0.28<\/td>\r\n<\/tr>\r\n<tr>\r\n<td>7<\/td>\r\n<td>12<\/td>\r\n<td>0.24<\/td>\r\n<td>0.52<\/td>\r\n<\/tr>\r\n<tr>\r\n<td>8<\/td>\r\n<td>14<\/td>\r\n<td>0.28<\/td>\r\n<td>0.80<\/td>\r\n<\/tr>\r\n<tr>\r\n<td>9<\/td>\r\n<td>7<\/td>\r\n<td>0.14<\/td>\r\n<td>0.94<\/td>\r\n<\/tr>\r\n<tr>\r\n<td>10<\/td>\r\n<td>3<\/td>\r\n<td>0.06<\/td>\r\n<td>1.00<\/td>\r\n<\/tr>\r\n<\/tbody>\r\n<\/table>\r\n<p id=\"element-688\"><strong>Find the 28<sup>th<\/sup> percentile<\/strong>. Notice the 0.28 in the \"cumulative relative frequency\" column. Twenty-eight percent of 50 data values is 14 values. There are 14 values less than the 28<sup>th<\/sup> percentile. They include the two 4s, the five 5s, and the seven 6s. The 28<sup>th<\/sup> percentile is between the last six and the first seven. <strong>The 28<sup>th<\/sup> percentile is 6.5.<\/strong><\/p>\r\n<p id=\"element-488\"><strong>Find the median<\/strong>. Look again at the \"cumulative relative frequency\" column and find 0.52. The median is the 50<sup>th<\/sup> percentile or the second quartile. 50% of 50 is 25. There are 25 values less than the median. They include the two 4s, the five 5s, the seven 6s, and eleven of the 7s. The median or 50<sup>th<\/sup> percentile is between the 25<sup>th<\/sup>, or seven, and 26<sup>th<\/sup>, or seven, values. <strong>The median is seven.<\/strong><\/p>\r\n<p id=\"element-539\"><strong>Find the third quartile<\/strong>. The third quartile is the same as the 75<sup>th<\/sup> percentile. You can \"eyeball\" this answer. If you look at the \"cumulative relative frequency\" column, you find 0.52 and 0.80. When you have all the fours, fives, sixes and sevens, you have 52% of the data. When you include all the 8s, you have 80% of the data. <strong>The 75<sup>th<\/sup> percentile, then, must be an eight<\/strong>. Another way to look at the problem is to find 75% of 50, which is 37.5, and round up to 38. The third quartile, <em data-effect=\"italics\">Q<\/em><sub>3<\/sub>, is the 38<sup>th<\/sup> value, which is an eight. You can check this answer by counting the values. (There are 37 values below the third quartile and 12 values above.)<\/p>\r\n\r\n<\/div>\r\n<div id=\"fs-idm52647472\" class=\"statistics try\" data-type=\"note\" data-has-label=\"true\" data-label=\"\">\r\n<div data-type=\"title\">Try it<\/div>\r\n<div id=\"fs-idm18606176\" data-type=\"exercise\">\r\n<div id=\"fs-idm21314496\" data-type=\"problem\">\r\n<p id=\"fs-idm44305856\">Forty bus drivers were asked how many hours they spend each day running their routes (rounded to the nearest hour). Find the 65<sup>th<\/sup> percentile.<\/p>\r\n\r\n<table id=\"fs-idm24649760\" summary=\"\">\r\n<thead>\r\n<tr>\r\n<th>Amount of time spent on route (hours)<\/th>\r\n<th>Frequency<\/th>\r\n<th>Relative Frequency<\/th>\r\n<th>Cumulative Relative Frequency<\/th>\r\n<\/tr>\r\n<\/thead>\r\n<tbody>\r\n<tr>\r\n<td>2<\/td>\r\n<td>12<\/td>\r\n<td>0.30<\/td>\r\n<td>0.30<\/td>\r\n<\/tr>\r\n<tr>\r\n<td>3<\/td>\r\n<td>14<\/td>\r\n<td>0.35<\/td>\r\n<td>0.65<\/td>\r\n<\/tr>\r\n<tr>\r\n<td>4<\/td>\r\n<td>10<\/td>\r\n<td>0.25<\/td>\r\n<td>0.90<\/td>\r\n<\/tr>\r\n<tr>\r\n<td>5<\/td>\r\n<td>4<\/td>\r\n<td>0.10<\/td>\r\n<td>1.00<\/td>\r\n<\/tr>\r\n<\/tbody>\r\n<\/table>\r\n<\/div>\r\n<\/div>\r\n<\/div>\r\n<div id=\"element-572\" class=\"textbox textbox--examples\" data-type=\"example\">\r\n<div id=\"element-2353\" data-type=\"exercise\">\r\n<div id=\"id45288379\" data-type=\"problem\">\r\n<p id=\"element-23532\">Using <a class=\"autogenerated-content\" href=\"#id4431204\">(Figure)<\/a>:<\/p>\r\n\r\n<ol type=\"a\">\r\n \t<li>Find the 80<sup>th<\/sup> percentile.<\/li>\r\n \t<li>Find the 90<sup>th<\/sup> percentile.<\/li>\r\n \t<li>Find the first quartile. What is another name for the first quartile?<\/li>\r\n<\/ol>\r\n<\/div>\r\n<div id=\"fs-idp60869984\" data-type=\"solution\">\r\n<p id=\"fs-idp15042704\">Using the data from the frequency table, we have:<\/p>\r\n\r\n<ol id=\"fs-idm54301152\" type=\"a\">\r\n \t<li>The 80<sup>th<\/sup> percentile is between the last eight and the first nine in the table (between the 40<sup>th<\/sup> and 41<sup>st<\/sup> values). Therefore, we need to take the mean of the 40<sup>th<\/sup> an 41<sup>st<\/sup> values. The 80<sup>th<\/sup> percentile \\(=\\frac{8+9}{2}=8.5\\)<\/li>\r\n \t<li>The 90<sup>th<\/sup> percentile will be the 45<sup>th<\/sup> data value (location is 0.90(50) = 45) and the 45<sup>th<\/sup> data value is nine.<\/li>\r\n \t<li><em data-effect=\"italics\">Q<\/em><sub>1<\/sub> is also the 25<sup>th<\/sup> percentile. The 25<sup>th<\/sup> percentile location calculation: <em data-effect=\"italics\">P<\/em><sub>25<\/sub> = 0.25(50) = 12.5 \u2248 13 the 13<sup>th<\/sup> data value. Thus, the 25th percentile is six.<\/li>\r\n<\/ol>\r\n<\/div>\r\n<\/div>\r\n<\/div>\r\n<div id=\"fs-idm56651440\" class=\"statistics try\" data-type=\"note\" data-has-label=\"true\" data-label=\"\">\r\n<div data-type=\"title\">Try It<\/div>\r\n<div id=\"fs-idp38065168\" data-type=\"exercise\">\r\n<div id=\"fs-idm27880528\" data-type=\"problem\">\r\n<p id=\"fs-idp54653312\">Refer to the <a class=\"autogenerated-content\" href=\"#fs-idm24649760\">(Figure)<\/a>. Find the third quartile. What is another name for the third quartile?<\/p>\r\n\r\n<\/div>\r\n<\/div>\r\n<\/div>\r\n<div id=\"fs-idm13393536\" class=\"statistics collab\" data-type=\"note\" data-has-label=\"true\" data-label=\"\">\r\n<div data-type=\"title\">Collaborative Statistics<\/div>\r\n<p id=\"element-758\">Your instructor or a member of the class will ask everyone in class how many sweaters they own. Answer the following questions:<\/p>\r\n\r\n<ol id=\"exlist\">\r\n \t<li>How many students were surveyed?<\/li>\r\n \t<li>What kind of sampling did you do?<\/li>\r\n \t<li>Construct two different histograms. For each, starting value = _____ ending value = ____.<\/li>\r\n \t<li>Find the median, first quartile, and third quartile.<\/li>\r\n \t<li>Construct a table of the data to find the following:\r\n<ol id=\"exlist2\" type=\"a\">\r\n \t<li>the 10<sup>th<\/sup> percentile<\/li>\r\n \t<li>the 70<sup>th<\/sup> percentile<\/li>\r\n \t<li>the percent of students who own less than four sweaters<\/li>\r\n<\/ol>\r\n<\/li>\r\n<\/ol>\r\n<\/div>\r\n<div id=\"fs-idm21580416\" class=\"bc-section section\" data-depth=\"1\">\r\n<h3 data-type=\"title\">A Formula for Finding the <em data-effect=\"italics\">k<\/em>th Percentile<\/h3>\r\n<p id=\"fs-idp1786064\">If you were to do a little research, you would find several formulas for calculating the <em data-effect=\"italics\">k<\/em><sup>th<\/sup> percentile. Here is one of them.<\/p>\r\n<p id=\"fs-idp3096416\"><em data-effect=\"italics\">k<\/em> = the <em data-effect=\"italics\">k<sup>th<\/sup><\/em> percentile. It may or may not be part of the data.<\/p>\r\n<p id=\"fs-idp1947472\"><em data-effect=\"italics\">i<\/em> = the index (ranking or position of a data value)<\/p>\r\n<p id=\"fs-idm946480\"><em data-effect=\"italics\">n<\/em> = the total number of data<\/p>\r\n\r\n<ul id=\"fs-idm9831088\">\r\n \t<li>Order the data from smallest to largest.<\/li>\r\n \t<li>Calculate \\(i=\\frac{k}{100}\\left(n+1\\right)\\)<\/li>\r\n \t<li>If <em data-effect=\"italics\">i<\/em> is an integer, then the <em data-effect=\"italics\">k<sup>th<\/sup><\/em> percentile is the data value in the <em data-effect=\"italics\">i<sup>th<\/sup><\/em> position in the ordered set of data.<\/li>\r\n \t<li>If <em data-effect=\"italics\">i<\/em> is not an integer, then round <em data-effect=\"italics\">i<\/em> up and round <em data-effect=\"italics\">i<\/em> down to the nearest integers. Average the two data values in these two positions in the ordered data set. This is easier to understand in an example.<\/li>\r\n<\/ul>\r\n<div id=\"fs-idm4569232\" class=\"textbox textbox--examples\" data-type=\"example\">\r\n<div id=\"fs-idm105708208\" data-type=\"exercise\">\r\n<div id=\"fs-idm3783968\" data-type=\"problem\">\r\n<p id=\"fs-idp1509664\">Listed are 29 ages for Academy Award winning best actors <em data-effect=\"italics\">in order from smallest to largest.<\/em> <span data-type=\"newline\">\r\n<\/span>18;\u00a0 21;\u00a0 22;\u00a0 25;\u00a0 26;\u00a0 27;\u00a0 29;\u00a0 30;\u00a0 31;\u00a0 33;\u00a0 36;\u00a0 37;\u00a0 41;\u00a0 42;\u00a0 47;\u00a0 52;\u00a0 55;\u00a0 57;\u00a0 58;\u00a0 62;\u00a0 64;\u00a0 67;\u00a0 69;\u00a0 71;\u00a0 72;\u00a0 73;\u00a0 74;\u00a0 76;\u00a0 77<\/p>\r\n\r\n<ol id=\"fs-idm1901040\" type=\"a\">\r\n \t<li>Find the 70<sup>th<\/sup> percentile.<\/li>\r\n \t<li>Find the 83<sup>rd<\/sup> percentile.<\/li>\r\n<\/ol>\r\n<\/div>\r\n<div id=\"fs-idp40713040\" data-type=\"solution\">\r\n<ol id=\"fs-idm62647008\" type=\"a\">\r\n \t<li>\r\n<ul id=\"fs-idp14170864\" data-labeled-item=\"true\">\r\n \t<li><em data-effect=\"italics\">k<\/em> = 70<\/li>\r\n \t<li><em data-effect=\"italics\">i<\/em> = the index<\/li>\r\n \t<li><em data-effect=\"italics\">n<\/em> = 29<\/li>\r\n<\/ul>\r\n<em data-effect=\"italics\">i<\/em> = \\(\\frac{k}{100}\\) (<em data-effect=\"italics\">n<\/em> + 1) = (\\(\\frac{70}{100}\\))(29 + 1) = 21. Twenty-one is an integer, and the data value in the 21<sup>st<\/sup> position in the ordered data set is 64. The 70<sup>th<\/sup> percentile is 64 years.<\/li>\r\n \t<li>\r\n<ul id=\"fs-idm21563168\" data-labeled-item=\"true\">\r\n \t<li><em data-effect=\"italics\">k<\/em> = 83<sup>rd<\/sup> percentile<\/li>\r\n \t<li><em data-effect=\"italics\">i<\/em> = the index<\/li>\r\n \t<li><em data-effect=\"italics\">n<\/em> = 29<\/li>\r\n<\/ul>\r\n<em data-effect=\"italics\">i<\/em> \u00a0= \\(\\frac{k}{100}\\) (<em data-effect=\"italics\">n<\/em> + 1) = (\\(\\frac{83}{100}\\))(29 + 1) = 24.9, which is NOT an integer.\r\n\r\nRound it down to 24 and up to 25. The age in the 24<sup>th<\/sup> position is 71 and the age in the 25<sup>th<\/sup> position is 72. Average 71 and 72. The 83<sup>rd<\/sup> percentile is 71.5 years.<\/li>\r\n<\/ol>\r\n<\/div>\r\n<\/div>\r\n<\/div>\r\n<div id=\"fs-idm16529696\" class=\"statistics try\" data-type=\"note\" data-has-label=\"true\" data-label=\"\">\r\n<div data-type=\"title\">Try It<\/div>\r\n<div id=\"fs-idm3894192\" data-type=\"exercise\">\r\n<div id=\"fs-idp25866864\" data-type=\"problem\">\r\n<p id=\"fs-idp25866992\">Listed are 29 ages for Academy Award winning best actors <em data-effect=\"italics\">in order from smallest to largest.<\/em><\/p>\r\n<p id=\"fs-idm19734064\">18;\u00a0 21;\u00a0 22;\u00a0 25;\u00a0 26;\u00a0 27;\u00a0 29;\u00a0 30;\u00a0 31;\u00a0 33;\u00a0 36;\u00a0 37;\u00a0 41;\u00a0 42;\u00a0 47;\u00a0 52;\u00a0 55;\u00a0 57;\u00a0 58;\u00a0 62;\u00a0 64;\u00a0 67;\u00a0 69;\u00a0 71;\u00a0 72;\u00a0 73;\u00a0 74;\u00a0 76;\u00a0 77 <span data-type=\"newline\">\r\n<\/span>Calculate the 20<sup>th<\/sup> percentile and the 55<sup>th<\/sup> percentile.<\/p>\r\n\r\n<\/div>\r\n<\/div>\r\n<\/div>\r\n<div id=\"eip-404\" class=\"finger\" data-type=\"note\" data-has-label=\"true\" data-label=\"\">\r\n<div data-type=\"title\">NOTE<\/div>\r\n<p id=\"fs-idp26669920\">You can calculate percentiles using calculators and computers. There are a variety of online calculators.<\/p>\r\n\r\n<\/div>\r\n<\/div>\r\n<div id=\"fs-idp2972176\" class=\"bc-section section\" data-depth=\"1\">\r\n<h3 data-type=\"title\">A Formula for Finding the Percentile of a Value in a Data Set<\/h3>\r\n<ul id=\"fs-idm17756640\">\r\n \t<li>Order the data from smallest to largest.<\/li>\r\n \t<li><em data-effect=\"italics\">x<\/em> = the number of data values counting from the bottom of the data list up to but not including the data value for which you want to find the percentile.<\/li>\r\n \t<li><em data-effect=\"italics\">y<\/em> = the number of data values equal to the data value for which you want to find the percentile.<\/li>\r\n \t<li><em data-effect=\"italics\">n<\/em> = the total number of data.<\/li>\r\n \t<li>Calculate \\(\\frac{x+0.5y}{n}\\)(100). Then round to the nearest integer.<\/li>\r\n<\/ul>\r\n<div id=\"fs-idm3849664\" class=\"textbox textbox--examples\" data-type=\"example\">\r\n<div id=\"fs-idp28609648\" data-type=\"exercise\">\r\n<div id=\"fs-idp28609904\" data-type=\"problem\">\r\n<p id=\"fs-idp38890112\">Listed are 29 ages for Academy Award winning best actors <em data-effect=\"italics\">in order from smallest to largest.<\/em> <span data-type=\"newline\">\r\n<\/span>18;\u00a0 21;\u00a0 22;\u00a0 25;\u00a0 26;\u00a0 27;\u00a0 29;\u00a0 30;\u00a0 31;\u00a0 33;\u00a0 36;\u00a0 37;\u00a0 41;\u00a0 42;\u00a0 47;\u00a0 52;\u00a0 55;\u00a0 57;\u00a0 58;\u00a0 62;\u00a0 64;\u00a0 67;\u00a0 69;\u00a0 71;\u00a0 72;\u00a0 73;\u00a0 74;\u00a0 76;\u00a0 77<\/p>\r\n\r\n<ol id=\"fs-idm21168272\" type=\"a\">\r\n \t<li>Find the percentile for 58.<\/li>\r\n \t<li>Find the percentile for 25.<\/li>\r\n<\/ol>\r\n<\/div>\r\n<div id=\"fs-idm170490752\" data-type=\"solution\">\r\n<ol id=\"fs-idm170490496\" type=\"a\">\r\n \t<li>Counting from the bottom of the list, there are 18 data values less than 58. There is one value of 58.\r\n<p id=\"fs-idm3871584\"><em data-effect=\"italics\">x<\/em> = 18 and <em data-effect=\"italics\">y<\/em> = 1. \\(\\frac{x+0.5y}{n}\\)(100) = \\(\\frac{18+0.5\\left(1\\right)}{29}\\)(100) = 63.80. 58 is the 64<sup>th<\/sup> percentile.<\/p>\r\n<\/li>\r\n \t<li>Counting from the bottom of the list, there are three data values less than 25. There is one value of 25.\r\n<p id=\"fs-idm21523472\"><em data-effect=\"italics\">x<\/em> = 3 and <em data-effect=\"italics\">y<\/em> = 1. \\(\\frac{x+0.5y}{n}\\)(100) = \\(\\frac{3+0.5\\left(1\\right)}{29}\\)(100) = 12.07. Twenty-five is the 12<sup>th<\/sup> percentile.<\/p>\r\n<\/li>\r\n<\/ol>\r\n<\/div>\r\n<\/div>\r\n<\/div>\r\n<div id=\"fs-idm170943360\" class=\"statistics try\" data-type=\"note\" data-has-label=\"true\" data-label=\"\">\r\n<div data-type=\"title\">Try It<\/div>\r\n<div id=\"fs-idm170942864\" data-type=\"exercise\">\r\n<div id=\"fs-idp35294448\" data-type=\"problem\">\r\n<p id=\"fs-idp35294576\">Listed are 30 ages for Academy Award winning best actors <u data-effect=\"underline\">in order from smallest to largest.<\/u><\/p>\r\n<p id=\"fs-idp13252768\">18;\u00a0 21;\u00a0 22;\u00a0 25;\u00a0 26;\u00a0 27;\u00a0 29;\u00a0 30;\u00a0 31;\u00a0 31;\u00a0 33;\u00a0 36;\u00a0 37;\u00a0 41;\u00a0 42;\u00a0 47;\u00a0 52;\u00a0 55;\u00a0 57;\u00a0 58;\u00a0 62;\u00a0 64;\u00a0 67;\u00a0 69;\u00a0 71;\u00a0 72;\u00a0 73;\u00a0 74;\u00a0 76;\u00a0 77 <span data-type=\"newline\">\r\n<\/span>Find the percentiles for 47 and 31.<\/p>\r\n\r\n<\/div>\r\n<\/div>\r\n<\/div>\r\n<\/div>\r\n<div id=\"fs-idp45793312\" class=\"bc-section section\" data-depth=\"1\">\r\n<h3 data-type=\"title\">Interpreting Percentiles, Quartiles, and Median<\/h3>\r\n<p id=\"eip-400\">A percentile indicates the relative standing of a data value when data are sorted into numerical order from smallest to largest. Percentages of data values are less than or equal to the pth percentile. For example, 15% of data values are less than or equal to the 15<sup>th<\/sup> percentile.<\/p>\r\n\r\n<ul id=\"eip-id1164310609380\" data-bullet-style=\"bullet\">\r\n \t<li>Low percentiles always correspond to lower data values.<\/li>\r\n \t<li>High percentiles always correspond to higher data values.<\/li>\r\n<\/ul>\r\n<p id=\"fs-idp44902944\">A percentile may or may not correspond to a value judgment about whether it is \"good\" or \"bad.\" The interpretation of whether a certain percentile is \"good\" or \"bad\" depends on the context of the situation to which the data applies. In some situations, a low percentile would be considered \"good;\" in other contexts a high percentile might be considered \"good\". In many situations, there is no value judgment that applies.<\/p>\r\n<p id=\"fs-idm23920480\">Understanding how to interpret percentiles properly is important not only when describing data, but also when calculating probabilities in later chapters of this text.<\/p>\r\n\r\n<\/div>\r\n<div id=\"fs-idm106923680\" data-type=\"note\" data-has-label=\"true\" data-label=\"\">\r\n<div data-type=\"title\">NOTE<\/div>\r\n<p id=\"fs-idm20251376\">When writing the interpretation of a percentile in the context of the given data, the sentence should contain the following information.<\/p>\r\n\r\n<ul id=\"eip-id1168197264788\">\r\n \t<li>information about the context of the situation being considered<\/li>\r\n \t<li>the data value (value of the variable) that represents the percentile<\/li>\r\n \t<li>the percent of individuals or items with data values below the percentile<\/li>\r\n \t<li>the percent of individuals or items with data values above the percentile.<\/li>\r\n<\/ul>\r\n<\/div>\r\n<div id=\"eip-id1170215995305\" class=\"textbox textbox--examples\" data-type=\"example\">\r\n<div id=\"fs-idm91768592\" data-type=\"exercise\">\r\n<div id=\"fs-idm91768464\" data-type=\"problem\">\r\n<p id=\"eip-id1170184310084\">On a timed math test, the first quartile for time it took to finish the exam was 35 minutes. Interpret the first quartile in the context of this situation.<\/p>\r\n\r\n<\/div>\r\n<div id=\"fs-idm53128368\" data-type=\"solution\">\r\n<ul id=\"eip-id1170179452695\">\r\n \t<li>Twenty-five percent of students finished the exam in 35 minutes or less.<\/li>\r\n \t<li>Seventy-five percent of students finished the exam in 35 minutes or more.<\/li>\r\n \t<li>A low percentile could be considered good, as finishing more quickly on a timed exam is desirable. (If you take too long, you might not be able to finish.)<\/li>\r\n<\/ul>\r\n<\/div>\r\n<\/div>\r\n<\/div>\r\n<div id=\"fs-idp16945648\" class=\"statistics try\" data-type=\"note\" data-has-label=\"true\" data-label=\"\">\r\n<div data-type=\"title\">Try It<\/div>\r\n<div id=\"fs-idp33388848\" data-type=\"exercise\">\r\n<div id=\"fs-idm41955616\" data-type=\"problem\">\r\n<p id=\"fs-idp20699248\">For the 100-meter dash, the third quartile for times for finishing the race was 11.5 seconds. Interpret the third quartile in the context of the situation.<\/p>\r\n\r\n<\/div>\r\n<\/div>\r\n<\/div>\r\n<div id=\"eip-id1170441826663\" class=\"textbox textbox--examples\" data-type=\"example\">\r\n<div id=\"fs-idm148596320\" data-type=\"exercise\">\r\n<div id=\"fs-idm170402432\" data-type=\"problem\">\r\n<p id=\"eip-id1170436117670\">On a 20 question math test, the 70<sup>th<\/sup> percentile for number of correct answers was 16. Interpret the 70<sup>th<\/sup> percentile in the context of this situation.<\/p>\r\n\r\n<\/div>\r\n<\/div>\r\n<\/div>\r\n<div id=\"fs-idp77029680\" class=\"statistics try\" data-type=\"note\" data-has-label=\"true\" data-label=\"\">\r\n<div data-type=\"title\">Try It<\/div>\r\n<div id=\"fs-idp34034288\" data-type=\"exercise\">\r\n<div id=\"fs-idp48692144\" data-type=\"problem\">\r\n<p id=\"fs-idp55037680\">On a 60 point written assignment, the 80<sup>th<\/sup> percentile for the number of points earned was 49. Interpret the 80<sup>th<\/sup> percentile in the context of this situation.<\/p>\r\n\r\n<\/div>\r\n<\/div>\r\n<\/div>\r\n<div id=\"eip-id7060500\" class=\"textbox textbox--examples\" data-type=\"example\">\r\n<div id=\"fs-idm205091056\" data-type=\"exercise\">\r\n<div id=\"fs-idm15124096\" data-type=\"problem\">\r\n<p id=\"eip-id1170610063171\">At a community college, it was found that the 30<sup>th<\/sup> percentile of credit units that students are enrolled for is seven units. Interpret the 30<sup>th<\/sup> percentile in the context of this situation.<\/p>\r\n\r\n<\/div>\r\n<\/div>\r\n<\/div>\r\n<div id=\"fs-idp80590208\" class=\"statistics try\" data-type=\"note\" data-has-label=\"true\" data-label=\"\">\r\n<div data-type=\"title\">Try It<\/div>\r\n<div id=\"fs-idp73731328\" data-type=\"exercise\">\r\n<div id=\"fs-idp42792528\" data-type=\"problem\">\r\n<p id=\"fs-idm23433888\">During a season, the 40<sup>th<\/sup> percentile for points scored per player in a game is eight. Interpret the 40<sup>th<\/sup> percentile in the context of this situation.<\/p>\r\n\r\n<\/div>\r\n<\/div>\r\n<\/div>\r\n<div id=\"fs-idp9603904\" class=\"textbox textbox--examples\" data-type=\"example\">\r\n<p id=\"fs-idp45664304\">Sharpe Middle School is applying for a grant that will be used to add fitness equipment to the gym. The principal surveyed 15 anonymous students to determine how many minutes a day the students spend exercising. The results from the 15 anonymous students are shown.<\/p>\r\n<p id=\"fs-idp39768656\">0 minutes; 40 minutes; 60 minutes; 30 minutes; 60 minutes<\/p>\r\n<p id=\"fs-idm13969776\">10 minutes; 45 minutes; 30 minutes; 300 minutes; 90 minutes;<\/p>\r\n<p id=\"fs-idp22597008\">30 minutes; 120 minutes; 60 minutes; 0 minutes; 20 minutes<\/p>\r\n<p id=\"fs-idp53167440\">Determine the following five values.<\/p>\r\n\r\n<ul id=\"fs-idp70490496\" data-labeled-item=\"true\">\r\n \t<li>Min = 0<\/li>\r\n \t<li><em data-effect=\"italics\">Q<\/em><sub>1<\/sub> = 20<\/li>\r\n \t<li>Med = 40<\/li>\r\n \t<li><em data-effect=\"italics\">Q<\/em><sub>3<\/sub> = 60<\/li>\r\n \t<li>Max = 300<\/li>\r\n<\/ul>\r\n<p id=\"fs-idp83565376\">If you were the principal, would you be justified in purchasing new fitness equipment? Since 75% of the students exercise for 60 minutes or less daily, and since the <em data-effect=\"italics\">IQR<\/em> is 40 minutes (60 \u2013 20 = 40), we know that half of the students surveyed exercise between 20 minutes and 60 minutes daily. This seems a reasonable amount of time spent exercising, so the principal would be justified in purchasing the new equipment.<\/p>\r\n<p id=\"fs-idm77236544\">However, the principal needs to be careful. The value 300 appears to be a potential outlier.<\/p>\r\n<p id=\"fs-idm9669376\"><em data-effect=\"italics\">Q<\/em><sub>3<\/sub> + 1.5(<em data-effect=\"italics\">IQR<\/em>) = 60 + (1.5)(40) = 120.<\/p>\r\n<p id=\"fs-idp13270336\">The value 300 is greater than 120 so it is a potential outlier. If we delete it and calculate the five values, we get the following values:<\/p>\r\n\r\n<ul id=\"fs-idm2894688\" data-labeled-item=\"true\">\r\n \t<li>Min = 0<\/li>\r\n \t<li><em data-effect=\"italics\">Q<\/em><sub>1<\/sub> = 20<\/li>\r\n \t<li><em data-effect=\"italics\">Q<\/em><sub>3<\/sub> = 60<\/li>\r\n \t<li>Max = 120<\/li>\r\n<\/ul>\r\n<p id=\"fs-idm6660656\">We still have 75% of the students exercising for 60 minutes or less daily and half of the students exercising between 20 and 60 minutes a day. However, 15 students is a small sample and the principal should survey more students to be sure of his survey results.<\/p>\r\n\r\n<\/div>\r\n<div id=\"fs-idm63224784\" class=\"footnotes\" data-depth=\"1\">\r\n<h3 data-type=\"title\">References<\/h3>\r\n<p id=\"fs-idm63224288\">Cauchon, Dennis, Paul Overberg. \u201cCensus data shows minorities now a majority of U.S. births.\u201d USA Today, 2012. Available online at http:\/\/usatoday30.usatoday.com\/news\/nation\/story\/2012-05-17\/minority-birthscensus\/55029100\/1 (accessed April 3, 2013).<\/p>\r\n<p id=\"fs-idm76887104\">Data from the United States Department of Commerce: United States Census Bureau. Available online at http:\/\/www.census.gov\/ (accessed April 3, 2013).<\/p>\r\n<p id=\"fs-idm76886560\">\u201c1990 Census.\u201d United States Department of Commerce: United States Census Bureau. Available online at http:\/\/www.census.gov\/main\/www\/cen1990.html (accessed April 3, 2013).<\/p>\r\n<p id=\"fs-idm76885984\">Data from <em data-effect=\"italics\">San Jose Mercury News<\/em>.<\/p>\r\n<p id=\"fs-idm76885600\">Data from <em data-effect=\"italics\">Time Magazine<\/em>; survey by Yankelovich Partners, Inc.<\/p>\r\n\r\n<\/div>\r\n<div id=\"fs-idm13790128\" class=\"summary\" data-depth=\"1\">\r\n<h3 data-type=\"title\">Chapter Review<\/h3>\r\n<p id=\"fs-idp1397504\">The values that divide a rank-ordered set of data into 100 equal parts are called percentiles. Percentiles are used to compare and interpret data. For example, an observation at the 50<sup>th<\/sup> percentile would be greater than 50 percent of the other obeservations in the set. Quartiles divide data into quarters. The first quartile (<em data-effect=\"italics\">Q<\/em><sub>1<\/sub>) is the 25<sup>th<\/sup> percentile,the second quartile (<em data-effect=\"italics\">Q<\/em><sub>2<\/sub> or median) is 50<sup>th<\/sup> percentile, and the third quartile (<em data-effect=\"italics\">Q<\/em><sub>3<\/sub>) is the the 75<sup>th<\/sup> percentile. The interquartile range, or <em data-effect=\"italics\">IQR<\/em>, is the range of the middle 50 percent of the data values. The <em data-effect=\"italics\">IQR<\/em> is found by subtracting <em data-effect=\"italics\">Q<\/em><sub>1<\/sub> from <em data-effect=\"italics\">Q<\/em><sub>3<\/sub>, and can help determine outliers by using the following two expressions.<\/p>\r\n\r\n<ul id=\"fs-idp12766560\">\r\n \t<li><em data-effect=\"italics\">Q<\/em><sub>3<\/sub> + <em data-effect=\"italics\">IQR<\/em>(1.5)<\/li>\r\n \t<li><em data-effect=\"italics\">Q<\/em><sub>1<\/sub> \u2013 <em data-effect=\"italics\">IQR<\/em>(1.5)<\/li>\r\n<\/ul>\r\n<\/div>\r\n<div id=\"fs-idm202752\" class=\"formula-review\" data-depth=\"1\">\r\n<h3 data-type=\"title\">Formula Review<\/h3>\r\n<p id=\"fs-idp5126816\">\\(i=\\left(\\frac{k}{100}\\right)\\left(n+1\\right)\\)<\/p>\r\n<p id=\"fs-idp706784\">where <em data-effect=\"italics\">i<\/em> = the ranking or position of a data value,<\/p>\r\n<p id=\"fs-idm55046864\"><em data-effect=\"italics\">k<\/em> = the kth percentile,<\/p>\r\n<p id=\"fs-idm6916704\"><em data-effect=\"italics\">n<\/em> = total number of data.<\/p>\r\n<p id=\"fs-idp294352\">Expression for finding the percentile of a data value: \\(\\left(\\frac{x\\text{ + }0.5y}{n}\\right)\\)(100)<\/p>\r\n<p id=\"fs-idp17176000\">where <em data-effect=\"italics\">x<\/em> = the number of values counting from the bottom of the data list up to but not including the data value for which you want to find the percentile,<\/p>\r\n<p id=\"fs-idm2884704\"><em data-effect=\"italics\">y<\/em> = the number of data values equal to the data value for which you want to find the percentile,<\/p>\r\n<p id=\"fs-idm5691392\"><em data-effect=\"italics\">n<\/em> = total number of data<\/p>\r\n\r\n<\/div>\r\n<div id=\"fs-idp40431760\" class=\"practice\" data-depth=\"1\">\r\n<div id=\"fs-idm1110784\" data-type=\"exercise\">\r\n<div id=\"fs-idm38839376\" data-type=\"problem\">\r\n<p id=\"fs-idm38839120\">Listed are 29 ages for Academy Award winning best actors <em data-effect=\"italics\">in order from smallest to largest.<\/em><\/p>\r\n<p id=\"fs-idm6939584\">18;\u00a0 21;\u00a0 22;\u00a0 25;\u00a0 26;\u00a0 27;\u00a0 29;\u00a0 30;\u00a0 31;\u00a0 33;\u00a0 36;\u00a0 37;\u00a0 41;\u00a0 42;\u00a0 47;\u00a0 52;\u00a0 55;\u00a0 57;\u00a0 58;\u00a0 62;\u00a0 64;\u00a0 67;\u00a0 69;\u00a0 71;\u00a0 72;\u00a0 73;\u00a0 74;\u00a0 76;\u00a0 77<\/p>\r\n\r\n<ol id=\"fs-idp12728784\" type=\"a\">\r\n \t<li>Find the 40<sup>th<\/sup> percentile.<\/li>\r\n \t<li>Find the 78<sup>th<\/sup> percentile.<\/li>\r\n<\/ol>\r\n<\/div>\r\n<div id=\"fs-idm60642032\" data-type=\"solution\">\r\n<ol id=\"fs-idp34749472\" type=\"a\">\r\n \t<li>The 40<sup>th<\/sup> percentile is 37 years.<\/li>\r\n \t<li>The 78<sup>th<\/sup> percentile is 70 years.<\/li>\r\n<\/ol>\r\n<\/div>\r\n<\/div>\r\n<div id=\"fs-idm4719584\" data-type=\"exercise\">\r\n<div id=\"fs-idm4719328\" data-type=\"problem\">\r\n<p id=\"fs-idp14289472\">Listed are 32 ages for Academy Award winning best actors <em data-effect=\"italics\">in order from smallest to largest.<\/em><\/p>\r\n<p id=\"fs-idm62636912\">18;\u00a0 18;\u00a0 21;\u00a0 22;\u00a0 25;\u00a0 26;\u00a0 27;\u00a0 29;\u00a0 30;\u00a0 31;\u00a0 31;\u00a0 33;\u00a0 36;\u00a0 37;\u00a0 37;\u00a0 41;\u00a0 42;\u00a0 47;\u00a0 52;\u00a0 55;\u00a0 57;\u00a0 58;\u00a0 62;\u00a0 64;\u00a0 67;\u00a0 69;\u00a0 71;\u00a0 72;\u00a0 73;\u00a0 74;\u00a0 76;\u00a0 77<\/p>\r\n\r\n<ol id=\"fs-idm82651968\" type=\"a\">\r\n \t<li>Find the percentile of 37.<\/li>\r\n \t<li>Find the percentile of 72.<\/li>\r\n<\/ol>\r\n<\/div>\r\n<\/div>\r\n<div id=\"fs-idp30887728\" data-type=\"exercise\">\r\n<div id=\"fs-idp30887984\" data-type=\"problem\">\r\n<p id=\"fs-idp30888240\">Jesse was ranked 37<sup>th<\/sup> in his graduating class of 180 students. At what percentile is Jesse\u2019s ranking?<\/p>\r\n\r\n<\/div>\r\n<div id=\"fs-idm44160976\" data-type=\"solution\">\r\n<p id=\"fs-idm44160720\">Jesse graduated 37<sup>th<\/sup> out of a class of 180 students. There are 180 \u2013 37 = 143 students ranked below Jesse. There is one rank of 37.<\/p>\r\n<p id=\"fs-idm80018544\"><em data-effect=\"italics\">x<\/em> = 143 and <em data-effect=\"italics\">y<\/em> = 1. \\(\\frac{x+0.5y}{n}\\)(100) = \\(\\frac{143+0.5\\left(1\\right)}{180}\\)(100) = 79.72. Jesse\u2019s rank of 37 puts him at the 80<sup>th<\/sup> percentile.<\/p>\r\n\r\n<\/div>\r\n<\/div>\r\n<div id=\"eip-id1168182229232\" data-type=\"exercise\">\r\n<div id=\"eip-id1168183555687\" data-type=\"problem\">\r\n<ol id=\"eip-id1168185190290\" type=\"a\" data-mark-suffix=\".\">\r\n \t<li>For runners in a race, a low time means a faster run. The winners in a race have the shortest running times. Is it more desirable to have a finish time with a high or a low percentile when running a race?<\/li>\r\n \t<li>The 20<sup>th<\/sup> percentile of run times in a particular race is 5.2 minutes. Write a sentence interpreting the 20<sup>th<\/sup> percentile in the context of the situation.<\/li>\r\n \t<li>A bicyclist in the 90<sup>th<\/sup> percentile of a bicycle race completed the race in 1 hour and 12 minutes. Is he among the fastest or slowest cyclists in the race? Write a sentence interpreting the 90<sup>th<\/sup> percentile in the context of the situation.<\/li>\r\n<\/ol>\r\n<\/div>\r\n<\/div>\r\n<div id=\"eip-id1168182273864\" data-type=\"exercise\">\r\n<div id=\"eip-id1168191796049\" data-type=\"problem\">\r\n<ol id=\"eip-id5724192\" type=\"a\" data-mark-suffix=\".\">\r\n \t<li>For runners in a race, a higher speed means a faster run. Is it more desirable to have a speed with a high or a low percentile when running a race?<\/li>\r\n \t<li>The 40<sup>th<\/sup> percentile of speeds in a particular race is 7.5 miles per hour. Write a sentence interpreting the 40<sup>th<\/sup> percentile in the context of the situation.<\/li>\r\n<\/ol>\r\n<\/div>\r\n<div id=\"eip-id1168199883378\" data-type=\"solution\">\r\n<ol id=\"eip-id1168196369910\" type=\"a\" data-mark-suffix=\".\">\r\n \t<li>For runners in a race it is more desirable to have a high percentile for speed. A high percentile means a higher speed which is faster.<\/li>\r\n \t<li>40% of runners ran at speeds of 7.5 miles per hour or less (slower). 60% of runners ran at speeds of 7.5 miles per hour or more (faster).<\/li>\r\n<\/ol>\r\n<\/div>\r\n<\/div>\r\n<div id=\"eip-id1168217995987\" data-type=\"exercise\">\r\n<div id=\"eip-id1168183864592\" data-type=\"problem\">\r\n<p id=\"eip-id1168226425380\">On an exam, would it be more desirable to earn a grade with a high or low percentile? Explain.<\/p>\r\n\r\n<\/div>\r\n<\/div>\r\n<div id=\"eip-id1168183173702\" data-type=\"exercise\">\r\n<div id=\"eip-id1168227404691\" data-type=\"problem\">\r\n<p id=\"eip-id1168230025239\">Mina is waiting in line at the Department of Motor Vehicles (DMV). Her wait time of 32 minutes is the 85<sup>th<\/sup> percentile of wait times. Is that good or bad? Write a sentence interpreting the 85<sup>th<\/sup> percentile in the context of this situation.<\/p>\r\n\r\n<\/div>\r\n<div id=\"eip-id7704128\" data-type=\"solution\">\r\n<p id=\"eip-id1168214950316\">When waiting in line at the DMV, the 85<sup>th<\/sup> percentile would be a long wait time compared to the other people waiting. 85% of people had shorter wait times than Mina. In this context, Mina would prefer a wait time corresponding to a lower percentile. 85% of people at the DMV waited 32 minutes or less. 15% of people at the DMV waited 32 minutes or longer.<\/p>\r\n\r\n<\/div>\r\n<\/div>\r\n<div id=\"eip-id1168213546999\" data-type=\"exercise\">\r\n<div id=\"eip-id1168188876815\" data-type=\"problem\">\r\n<p id=\"eip-id7349223\">In a survey collecting data about the salaries earned by recent college graduates, Li found that her salary was in the 78<sup>th<\/sup> percentile. Should Li be pleased or upset by this result? Explain.<\/p>\r\n\r\n<\/div>\r\n<\/div>\r\n<div id=\"eip-id1168214876383\" data-type=\"exercise\">\r\n<div id=\"eip-id7327842\" data-type=\"problem\">\r\n<p id=\"eip-id1168230657040\">In a study collecting data about the repair costs of damage to automobiles in a certain type of crash tests, a certain model of car had \\$1,700 in damage and was in the 90<sup>th<\/sup> percentile. Should the manufacturer and the consumer be pleased or upset by this result? Explain and write a sentence that interprets the 90<sup>th<\/sup> percentile in the context of this problem.<\/p>\r\n\r\n<\/div>\r\n<div id=\"eip-id1168214946038\" data-type=\"solution\">\r\n<p id=\"eip-id1168234799988\">The manufacturer and the consumer would be upset. This is a large repair cost for the damages, compared to the other cars in the sample. INTERPRETATION: 90% of the crash tested cars had damage repair costs of \\$1700 or less; only 10% had damage repair costs of \\$1700 or more.<\/p>\r\n\r\n<\/div>\r\n<\/div>\r\n<div id=\"eip-id1168195852900\" data-type=\"exercise\">\r\n<div id=\"eip-id1168225195383\" data-type=\"problem\">\r\n<p id=\"eip-idm9549040\">The University of California has two criteria used to set admission standards for freshman to be admitted to a college in the UC system:<\/p>\r\n\r\n<ol id=\"eip-id1168211096380\" type=\"a\" data-mark-suffix=\"\">\r\n \t<li>Students' GPAs and scores on standardized tests (SATs and ACTs) are entered into a formula that calculates an \"admissions index\" score. The admissions index score is used to set eligibility standards intended to meet the goal of admitting the top 12% of high school students in the state. In this context, what percentile does the top 12% represent?<\/li>\r\n \t<li>Students whose GPAs are at or above the 96<sup>th<\/sup> percentile of all students at their high school are eligible (called eligible in the local context), even if they are not in the top 12% of all students in the state. What percentage of students from each high school are \"eligible in the local context\"?<\/li>\r\n<\/ol>\r\n<\/div>\r\n<\/div>\r\n<div id=\"eip-id1168223160542\" data-type=\"exercise\">\r\n<div id=\"eip-id7507305\" data-type=\"problem\">\r\n<p id=\"eip-id1168211272126\">Suppose that you are buying a house. You and your realtor have determined that the most expensive house you can afford is the 34<sup>th<\/sup> percentile. The 34<sup>th<\/sup> percentile of housing prices is \\$240,000 in the town you want to move to. In this town, can you afford 34% of the houses or 66% of the houses?<\/p>\r\n\r\n<\/div>\r\n<div id=\"eip-id1168213876148\" data-type=\"solution\">\r\n<p id=\"eip-id1168225765198\">You can afford 34% of houses. 66% of the houses are too expensive for your budget. INTERPRETATION: 34% of houses cost \\$240,000 or less. 66% of houses cost \\$240,000 or more.<\/p>\r\n\r\n<\/div>\r\n<\/div>\r\n<p id=\"element-726\">Use the following information to answer the next six exercises. Sixty-five randomly selected car salespersons were asked the number of cars they generally sell in one week. Fourteen people answered that they generally sell three cars; nineteen generally sell four cars; twelve generally sell five cars; nine generally sell six cars; eleven generally sell seven cars.<\/p>\r\n\r\n<div id=\"exercisenine\" data-type=\"exercise\">\r\n<div id=\"id21439538\" data-type=\"problem\">\r\n\r\nFirst quartile = _______\r\n\r\n<\/div>\r\n<\/div>\r\n<div id=\"exerciseten\" data-type=\"exercise\">\r\n<div id=\"id4433542\" data-type=\"problem\">\r\n\r\nSecond quartile = median = 50<sup>th<\/sup> percentile = _______\r\n\r\n<\/div>\r\n<div id=\"id12404344\" data-type=\"solution\">\r\n<p id=\"element-23635\">4<\/p>\r\n\r\n<\/div>\r\n<\/div>\r\n<div id=\"exerciseeleven\" data-type=\"exercise\">\r\n<div id=\"id21413333\" data-type=\"problem\">\r\n\r\nThird quartile = _______\r\n\r\n<\/div>\r\n<\/div>\r\n<div id=\"exercisetwelve\" data-type=\"exercise\">\r\n<div id=\"id13392439\" data-type=\"problem\">\r\n\r\nInterquartile range (<em data-effect=\"italics\">IQR<\/em>) = _____ \u2013 _____ = _____\r\n\r\n<\/div>\r\n<div id=\"id10710871\" data-type=\"solution\">\r\n<p id=\"element-23646\">6 \u2013 4 = 2<\/p>\r\n\r\n<\/div>\r\n<\/div>\r\n<div id=\"exercisethirteen\" data-type=\"exercise\">\r\n<div id=\"id14610610\" data-type=\"problem\">\r\n<p id=\"prob_13\">10<sup>th<\/sup> percentile = _______<\/p>\r\n\r\n<\/div>\r\n<\/div>\r\n<div id=\"exercisefourteen\" data-type=\"exercise\">\r\n<div id=\"id21409553\" data-type=\"problem\">\r\n<p id=\"prob_14\">70<sup>th<\/sup> percentile = _______<\/p>\r\n\r\n<\/div>\r\n<div id=\"id23430727\" data-type=\"solution\">\r\n<p id=\"element-234636\">6<\/p>\r\n\r\n<\/div>\r\n<\/div>\r\n<\/div>\r\n<div id=\"fs-idm1839472\" class=\"free-response\" data-depth=\"1\">\r\n<h3 data-type=\"title\">Homework<\/h3>\r\n<div id=\"element-927\" data-type=\"exercise\">\r\n<div id=\"id3483376\" data-type=\"problem\">\r\n<p id=\"element-746\">1)\u00a0 Six hundred adult Americans were asked by telephone poll, \"What do you think constitutes a middle-class income?\" The results are in <a class=\"autogenerated-content\" href=\"#element-588\">(Figure)<\/a>. Also, include left endpoint, but not the right endpoint.<\/p>\r\n\r\n<table id=\"element-588\" summary=\"This table presents the results from a poll on what Americans thought constituted middle class. The first column lists the salary and the second column lists the relative frequency. There are 8 rows.\">\r\n<thead>\r\n<tr>\r\n<th>Salary (\\$)<\/th>\r\n<th>Relative Frequency<\/th>\r\n<\/tr>\r\n<\/thead>\r\n<tbody>\r\n<tr>\r\n<td>&lt; 20,000<\/td>\r\n<td>0.02<\/td>\r\n<\/tr>\r\n<tr>\r\n<td>20,000\u201325,000<\/td>\r\n<td>0.09<\/td>\r\n<\/tr>\r\n<tr>\r\n<td>25,000\u201330,000<\/td>\r\n<td>0.19<\/td>\r\n<\/tr>\r\n<tr>\r\n<td>30,000\u201340,000<\/td>\r\n<td>0.26<\/td>\r\n<\/tr>\r\n<tr>\r\n<td>40,000\u201350,000<\/td>\r\n<td>0.18<\/td>\r\n<\/tr>\r\n<tr>\r\n<td>50,000\u201375,000<\/td>\r\n<td>0.17<\/td>\r\n<\/tr>\r\n<tr>\r\n<td>75,000\u201399,999<\/td>\r\n<td>0.02<\/td>\r\n<\/tr>\r\n<tr>\r\n<td>100,000+<\/td>\r\n<td>0.01<\/td>\r\n<\/tr>\r\n<\/tbody>\r\n<\/table>\r\n<ol id=\"element-295\" type=\"a\">\r\n \t<li>What percentage of the survey answered \"not sure\"?<\/li>\r\n \t<li>What percentage think that middle-class is from \\$25,000 to \\$50,000?<\/li>\r\n \t<li>Construct a histogram of the data.\r\n<ol id=\"nestlist3\" type=\"i\" data-mark-suffix=\".\">\r\n \t<li>Should all bars have the same width, based on the data? Why or why not?<\/li>\r\n \t<li>How should the &lt;20,000 and the 100,000+ intervals be handled? Why?<\/li>\r\n<\/ol>\r\n<\/li>\r\n \t<li>Find the 40<sup>th<\/sup> and 80<sup>th<\/sup> percentiles<\/li>\r\n \t<li>Construct a bar graph of the data<\/li>\r\n<\/ol>\r\n<\/div>\r\n<\/div>\r\n<\/div>\r\n&nbsp;\r\n\r\n&nbsp;\r\n<div id=\"fs-idm1839472\" class=\"free-response\" data-depth=\"1\">\r\n<div id=\"fs-idp35930608\" data-type=\"exercise\">\r\n<div id=\"id3500598\" data-type=\"problem\">\r\n\r\n2) Given the following box plot:\r\n<div id=\"fs-idm476896\" class=\"bc-figure figure\"><span id=\"id4775287\" data-type=\"media\" data-alt=\"This is a horizontal boxplot graphed over a number line from 0 to 13. The first whisker extends from the smallest value, 0, to the first quartile, 2. The box begins at the first quartile and extends to third quartile, 12. A vertical, dashed line is drawn at median, 10. The second whisker extends from the third quartile to largest value, 13.\"><img src=\"https:\/\/pressbooks.ccconline.org\/acccomposition1\/wp-content\/uploads\/sites\/83\/2022\/05\/fig-ch02_13_02-1.jpg\" alt=\"This is a horizontal boxplot graphed over a number line from 0 to 13. The first whisker extends from the smallest value, 0, to the first quartile, 2. The box begins at the first quartile and extends to third quartile, 12. A vertical, dashed line is drawn at median, 10. The second whisker extends from the third quartile to largest value, 13.\" width=\"400\" data-media-type=\"image\/jpg\" \/><\/span><\/div>\r\n<ol id=\"element-328\" type=\"a\">\r\n \t<li>which quarter has the smallest spread of data? What is that spread?<\/li>\r\n \t<li>which quarter has the largest spread of data? What is that spread?<\/li>\r\n \t<li>find the interquartile range (<em data-effect=\"italics\">IQR<\/em>).<\/li>\r\n \t<li>are there more data in the interval 5\u201310 or in the interval 10\u201313? How do you know this?<\/li>\r\n \t<li>which interval has the fewest data in it? How do you know this?\r\n<ol id=\"nestlist7\" type=\"i\" data-mark-suffix=\".\">\r\n \t<li>0\u20132<\/li>\r\n \t<li>2\u20134<\/li>\r\n \t<li>10\u201312<\/li>\r\n \t<li>12\u201313<\/li>\r\n \t<li>need more information<\/li>\r\n<\/ol>\r\n<\/li>\r\n<\/ol>\r\n<\/div>\r\n<\/div>\r\n<div id=\"element-284\" data-type=\"exercise\">\r\n<div id=\"id3912087\" data-type=\"problem\">\r\n\r\n&nbsp;\r\n<p id=\"element-874\">3) The following box plot shows the U.S. population for 1990, the latest available year.<\/p>\r\n\r\n<div id=\"fs-idm132205520\" class=\"bc-figure figure\"><span id=\"id7587202\" data-type=\"media\" data-alt=\"A box plot with values from 0 to 105, with Q1 at 17, M at 33, and Q3 at 50.\"><img src=\"https:\/\/pressbooks.ccconline.org\/acccomposition1\/wp-content\/uploads\/sites\/83\/2022\/08\/fig-ch02_13_08-1.jpg\" alt=\"A box plot with values from 0 to 105, with Q1 at 17, M at 33, and Q3 at 50.\" width=\"400\" data-media-type=\"image\/jpg\" \/><\/span><\/div>\r\n<ol type=\"a\">\r\n \t<li>Are there fewer or more children (age 17 and under) than senior citizens (age 65 and over)? How do you know?<\/li>\r\n \t<li>12.6% are age 65 and over. Approximately what percentage of the population are working age adults (above age 17 to age 65)?<\/li>\r\n<\/ol>\r\n&nbsp;\r\n\r\n<\/div>\r\n<div id=\"id7597969\" data-type=\"solution\">\r\n<p id=\"fs-idm5042080\">The median age for U.S. blacks currently is 30.9 years; for U.S. whites it is 42.3 years.<\/p>\r\n\r\n<ol id=\"fs-idm33581296\" type=\"a\">\r\n \t<li>Based upon this information, give two reasons why the black median age could be lower than the white median age.<\/li>\r\n \t<li>Does the lower median age for blacks necessarily mean that blacks die younger than whites? Why or why not?<\/li>\r\n \t<li>How might it be possible for blacks and whites to die at approximately the same age, but for the median age for whites to be higher?<\/li>\r\n<\/ol>\r\n<\/div>\r\n<\/div>\r\n<\/div>\r\nAnswers to odd questions\r\n\r\n1)\r\n<ol id=\"element-295a\" type=\"a\">\r\n \t<li>1 \u2013 (0.02+0.09+0.19+0.26+0.18+0.17+0.02+0.01) = 0.06<\/li>\r\n \t<li>0.19+0.26+0.18 = 0.63<\/li>\r\n \t<li>Check student\u2019s solution.<\/li>\r\n \t<li>\r\n<p id=\"eip-idp139654864\">40<sup>th<\/sup> percentile will fall between 30,000 and 40,000<\/p>\r\n<p id=\"eip-idp139655632\">80<sup>th<\/sup> percentile will fall between 50,000 and 75,000<\/p>\r\n<\/li>\r\n \t<li>Check student\u2019s solution.<\/li>\r\n<\/ol>\r\n3)\r\n<ol type=\"a\" data-mark-suffix=\".\">\r\n \t<li>more children; the left whisker shows that 25% of the population are children 17 and younger. The right whisker shows that 25% of the population are adults 50 and older, so adults 65 and over represent less than 25%.<\/li>\r\n \t<li>62.4%<\/li>\r\n<\/ol>\r\n<div class=\"textbox shaded\" data-type=\"glossary\">\r\n<h3 data-type=\"glossary-title\">Glossary<\/h3>\r\n<dl id=\"iqr\">\r\n \t<dt>Interquartile Range<\/dt>\r\n \t<dd id=\"id15896860\">or <em data-effect=\"italics\">IQR<\/em>, is the range of the middle 50 percent of the data values; the <em data-effect=\"italics\">IQR<\/em> is found by subtracting the first quartile from the third quartile.<\/dd>\r\n<\/dl>\r\n<dl id=\"outlier\">\r\n \t<dt>Outlier<\/dt>\r\n \t<dd id=\"id1171166689919\">an observation that does not fit the rest of the data<\/dd>\r\n<\/dl>\r\n<dl id=\"percentile\">\r\n \t<dt>Percentile<\/dt>\r\n \t<dd id=\"id19436015\">a number that divides ordered data into hundredths; percentiles may or may not be part of the data. The median of the data is the second quartile and the 50<sup>th<\/sup> percentile. The first and third quartiles are the 25<sup>th<\/sup> and the 75<sup>th<\/sup> percentiles, respectively.<\/dd>\r\n<\/dl>\r\n<dl id=\"quartiles\">\r\n \t<dt>Quartiles<\/dt>\r\n \t<dd id=\"id1164416504778\">the numbers that separate the data into quarters; quartiles may or may not be part of the data. The second quartile is the median of the data.<\/dd>\r\n<\/dl>\r\n<\/div>","rendered":"<p>&nbsp;<\/p>\n<p id=\"element-280\">The common measures of position or location are <span data-type=\"term\">quartiles<\/span> and <span data-type=\"term\">percentiles<\/span><\/p>\n<p id=\"fs-idp16986528\">Quartiles are special percentiles. The first quartile, <em data-effect=\"italics\">Q<\/em><sub>1<\/sub>, is the same as the 25<sup>th<\/sup> percentile, and the third quartile, <em data-effect=\"italics\">Q<\/em><sub>3<\/sub>, is the same as the 75<sup>th<\/sup> percentile. The median, <em data-effect=\"italics\">M<\/em>, is called both the second quartile and the 50<sup>th<\/sup> percentile.<\/p>\n<p id=\"element-105\">To calculate quartiles and percentiles, the data must be ordered from smallest to largest. Quartiles divide ordered data into quarters. Percentiles divide ordered data into hundredths. To score in the 90<sup>th<\/sup> percentile of an exam does not mean, necessarily, that you received 90% on a test. It means that 90% of test scores are the same or less than your score and 10% of the test scores are the same or greater than your test score.<\/p>\n<p id=\"fs-idm12500320\">Percentiles are useful for comparing values. For this reason, universities and colleges use percentiles extensively. One instance in which colleges and universities use percentiles is when SAT results are used to determine a minimum testing score that will be used as an acceptance factor. For example, suppose Duke accepts SAT scores at or above the 75<sup>th<\/sup> percentile. That translates into a score of at least 1220.<\/p>\n<p id=\"fs-idp48110304\">Percentiles are mostly used with very large populations. Therefore, if you were to say that 90% of the test scores are less (and not the same or less) than your score, it would be acceptable because removing one particular data value is not significant.<\/p>\n<p id=\"element-681\">The <span data-type=\"term\">median<\/span> is a number that measures the &#8220;center&#8221; of the data. You can think of the median as the &#8220;middle value,&#8221; but it does not actually have to be one of the observed values. It is a number that separates ordered data into halves. Half the values are the same number or smaller than the median, and half the values are the same number or larger. For example, consider the following data. <span data-type=\"newline\"><br \/>\n<\/span>1;\u00a0 11.5;\u00a0 6;\u00a0 7.2;\u00a0 4;\u00a0 8;\u00a0 9;\u00a0 10;\u00a0 6.8;\u00a0 8.3;\u00a0 2;\u00a0 2;\u00a0 10;\u00a0 1 <span data-type=\"newline\"><br \/>\n<\/span>Ordered from smallest to largest: <span data-type=\"newline\"><br \/>\n<\/span>1;\u00a0 1;\u00a0 2;\u00a0 2;\u00a0 4;\u00a0 6;\u00a0 6. 8;\u00a0 7.2;\u00a0 8;\u00a0 8.3;\u00a0 9;\u00a0 10;\u00a0 10;\u00a0 11.5<\/p>\n<p id=\"element-546\">Since there are 14 observations, the median is between the seventh value, 6.8, and the eighth value, 7.2. To find the median, add the two values together and divide by two.<\/p>\n<div data-type=\"equation\">\\(\\frac{6.8+7.2}{2}=7\\)<\/div>\n<p id=\"element-995\">The median is seven. Half of the values are smaller than seven and half of the values are larger than seven.<\/p>\n<p id=\"element-308\"><span data-type=\"term\">Quartiles<\/span> are numbers that separate the data into quarters. Quartiles may or may not be part of the data. To find the quartiles, first find the median or second quartile. The first quartile, <em data-effect=\"italics\">Q<\/em><sub>1<\/sub>, is the middle value of the lower half of the data, and the third quartile, <em data-effect=\"italics\">Q<\/em><sub>3<\/sub>, is the middle value, or median, of the upper half of the data. To get the idea, consider the same data set: <span data-type=\"newline\"><br \/>\n<\/span>1;\u00a0 1;\u00a0 2;\u00a0 2;\u00a0 4;\u00a0 6;\u00a0 6.8;\u00a0 7.2;\u00a0 8;\u00a0 8.3;\u00a0 9;\u00a0 10;\u00a0 10;\u00a0 11.5<\/p>\n<p id=\"element-805\">The median or <strong>second quartile<\/strong> is seven. The lower half of the data are 1,\u00a0 1,\u00a0 2,\u00a0 2,\u00a0 4,\u00a0 6,\u00a0 6.8. The middle value of the lower half is two. <span data-type=\"newline\"><br \/>\n<\/span>1;\u00a0 1;\u00a0 2;\u00a0 2;\u00a0 4;\u00a0 6;\u00a0 6.8<\/p>\n<p id=\"element-227\">The number two, which is part of the data, is the <span data-type=\"term\">first quartile<\/span>. One-fourth of the entire sets of values are the same as or less than two and three-fourths of the values are more than two.<\/p>\n<p>The upper half of the data is 7.2,\u00a0 8,\u00a0 8.3,\u00a0 9,\u00a0 10,\u00a0 10,\u00a0 11.5. The middle value of the upper half is nine.<\/p>\n<p id=\"element-386\">The <span data-type=\"term\">third quartile<\/span>, <em data-effect=\"italics\">Q<\/em>3, is nine. Three-fourths (75%) of the ordered data set are less than nine. One-fourth (25%) of the ordered data set are greater than nine. The third quartile is part of the data set in this example.<\/p>\n<p id=\"element-716\">The <span data-type=\"term\">interquartile range<\/span> is a number that indicates the spread of the middle half or the middle 50% of the data. It is the difference between the third quartile (<em data-effect=\"italics\">Q<\/em><sub>3<\/sub>) and the first quartile (<em data-effect=\"italics\">Q<\/em><sub>1<\/sub>).<\/p>\n<p id=\"delete_me\"><em data-effect=\"italics\">IQR<\/em> = <em data-effect=\"italics\">Q<\/em><sub>3<\/sub> \u2013 <em data-effect=\"italics\">Q<\/em><sub>1<\/sub><\/p>\n<p>The <em data-effect=\"italics\">IQR<\/em> can help to determine potential <strong>outliers<\/strong>. <strong>A value is suspected to be a potential outlier if it is less than (1.5)(<em data-effect=\"italics\">IQR<\/em>) below the first quartile or more than (1.5)(<em data-effect=\"italics\">IQR<\/em>) above the third quartile<\/strong>. Potential outliers always require further investigation.<\/p>\n<div id=\"fs-idm10803744\" data-type=\"note\" data-has-label=\"true\" data-label=\"\">\n<div data-type=\"title\">NOTE<\/div>\n<p id=\"fs-idp4345696\">A potential outlier is a data point that is significantly different from the other data points. These special data points may be errors or some kind of abnormality or they may be a key to understanding the data.<\/p>\n<\/div>\n<div id=\"element-826\" class=\"textbox textbox--examples\" data-type=\"example\">\n<div id=\"exer5\" data-type=\"exercise\">\n<div id=\"id45036025\" data-type=\"problem\">\n<p id=\"element-720\">For the following 13 real estate prices, calculate the <em data-effect=\"italics\">IQR<\/em> and determine if any prices are potential outliers. Prices are in dollars. <span data-type=\"newline\"><br \/>\n<\/span>389,950;\u00a0 230,500;\u00a0 158,000;\u00a0 479,000;\u00a0 639,000;\u00a0 114,950;\u00a0 5, 500,000;\u00a0 387,000;\u00a0 659,000;\u00a0 529,000;\u00a0 575,000;\u00a0 488,800;\u00a0 1,095,000<\/p>\n<\/div>\n<div id=\"id45746296\" data-type=\"solution\">\n<p id=\"element-939\">Order the data from smallest to largest. <span data-type=\"newline\"><br \/>\n<\/span>114,950;\u00a0 158,000;\u00a0 230,500;\u00a0 387,000;\u00a0 389,950;\u00a0 479,000;\u00a0 488,800;\u00a0 529,000;\u00a0 575,000; 639,000; 659,000; 1,095,000; 5,500,000<\/p>\n<p id=\"element-170\"><em data-effect=\"italics\">M<\/em> = 488, 800<\/p>\n<p><em data-effect=\"italics\">Q<\/em><sub>1<\/sub> = \\(\\frac{\\text{230,500 + 387,000}}{2}\\) = 308,750<\/p>\n<p><em data-effect=\"italics\">Q<\/em><sub>3<\/sub> = \\(\\frac{\\text{639,000 + 659,000}}{2}\\) = 649,000<\/p>\n<p id=\"element-290\"><em data-effect=\"italics\">IQR<\/em> = 649,000 \u2013 308,750 = 340,250<\/p>\n<p id=\"element-166\">(1.5)(<em data-effect=\"italics\">IQR<\/em>) = (1.5)(340,250) = 510,375<\/p>\n<p id=\"element-348\"><em data-effect=\"italics\">Q<\/em><sub>1<\/sub> \u2013 (1.5)(<em data-effect=\"italics\">IQR<\/em>) = 308,750 \u2013 510,375 = \u2013201,625<\/p>\n<p id=\"element-211\"><em data-effect=\"italics\">Q<\/em><sub>3<\/sub> + (1.5)(<em data-effect=\"italics\">IQR<\/em>) = 649,000 + 510,375 = 1,159,375<\/p>\n<p id=\"element-109\">No house price is less than \u2013201,625. However, 5,500,000 is more than 1,159,375. Therefore, 5,500,000 is a potential <span data-type=\"term\">outlier<\/span>.<\/p>\n<\/div>\n<\/div>\n<\/div>\n<div id=\"fs-idp16250528\" class=\"statistics try\" data-type=\"note\" data-has-label=\"true\" data-label=\"\">\n<div data-type=\"title\">Try It<\/div>\n<div id=\"fs-idp63302352\" data-type=\"exercise\">\n<div id=\"fs-idm22548992\" data-type=\"problem\">\n<p id=\"fs-idp42507600\">For the following 11 salaries, calculate the <em data-effect=\"italics\">IQR<\/em> and determine if any salaries are outliers. The salaries are in dollars.<\/p>\n<p id=\"fs-idm25187088\"><span data-type=\"list\" data-list-type=\"labeled-item\" data-display=\"inline\"><span data-type=\"item\">\\$33,000\u00a0 \u00a0\\$<\/span><span data-type=\"item\">64,500\u00a0 \u00a0\\$<\/span><span data-type=\"item\">28,000\u00a0 \u00a0\\$<\/span><span data-type=\"item\">54,000\u00a0 \u00a0\\$<\/span><span data-type=\"item\">72,000\u00a0 \u00a0\\$<\/span><span data-type=\"item\">68,500\u00a0 \u00a0\\$<\/span><span data-type=\"item\">69,000\u00a0 \u00a0\\$<\/span><span data-type=\"item\">42,000\u00a0 \u00a0\\$<\/span><span data-type=\"item\">54,000\u00a0 \u00a0\\$<\/span><span data-type=\"item\">120,000\u00a0 \u00a0\\$<\/span><span data-type=\"item\">40,500<\/span><\/span><\/p>\n<\/div>\n<\/div>\n<\/div>\n<div id=\"element-17\" class=\"textbox textbox--examples\" data-type=\"example\">\n<div data-type=\"exercise\">\n<div id=\"id45587381\" data-type=\"problem\">\n<p id=\"element-880\">For the two data sets in the <a href=\"#element-583\">test scores example<\/a>, find the following:<\/p>\n<ol type=\"a\" data-mark-suffix=\".\">\n<li>The interquartile range. Compare the two interquartile ranges.<\/li>\n<li>Any outliers in either set.<\/li>\n<\/ol>\n<\/div>\n<div id=\"fs-idm13740032\" data-type=\"solution\">\n<p id=\"fs-idp37987952\">The five number summary for the day and night classes is<\/p>\n<table id=\"fs-idp36487328\" summary=\"\">\n<thead>\n<tr>\n<th><\/th>\n<th>Minimum<\/th>\n<th><em data-effect=\"italics\">Q<\/em><sub>1<\/sub><\/th>\n<th>Median<\/th>\n<th><em data-effect=\"italics\">Q<\/em><sub>3<\/sub><\/th>\n<th>Maximum<\/th>\n<\/tr>\n<\/thead>\n<tbody>\n<tr>\n<td><strong data-effect=\"bold\">Day<\/strong><\/td>\n<td>32<\/td>\n<td>56<\/td>\n<td>74.5<\/td>\n<td>82.5<\/td>\n<td>99<\/td>\n<\/tr>\n<tr>\n<td><strong data-effect=\"bold\">Night<\/strong><\/td>\n<td>25.5<\/td>\n<td>78<\/td>\n<td>81<\/td>\n<td>89<\/td>\n<td>98<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<ol id=\"fs-idm23962720\" type=\"a\">\n<li>The IQR for the day group is <em data-effect=\"italics\">Q<\/em><sub>3<\/sub> \u2013 <em data-effect=\"italics\">Q<\/em><sub>1<\/sub> = 82.5 \u2013 56 = 26.5\n<p id=\"fs-idm7044352\">The IQR for the night group is <em data-effect=\"italics\">Q<\/em><sub>3<\/sub> \u2013 <em data-effect=\"italics\">Q<\/em><sub>1<\/sub> = 89 \u2013 78 = 11<\/p>\n<p id=\"fs-idp42547504\">The interquartile range (the spread or variability) for the day class is larger than the night class <em data-effect=\"italics\">IQR<\/em>. This suggests more variation will be found in the day class\u2019s class test scores.<\/p>\n<\/li>\n<li>Day class outliers are found using the IQR times 1.5 rule. So,\n<ul id=\"fs-idm52257968\" data-labeled-item=\"true\">\n<li><em data-effect=\"italics\">Q<\/em><sub>1<\/sub> &#8211; <em data-effect=\"italics\">IQR<\/em>(1.5) = 56 \u2013 26.5(1.5) = 16.25<\/li>\n<li><em data-effect=\"italics\">Q<\/em><sub>3<\/sub> + <em data-effect=\"italics\">IQR<\/em>(1.5) = 82.5 + 26.5(1.5) = 122.25<\/li>\n<\/ul>\n<p id=\"fs-idp38341744\">Since the minimum and maximum values for the day class are greater than 16.25 and less than 122.25, there are no outliers.<\/p>\n<p id=\"fs-idm23940160\">Night class outliers are calculated as:<\/p>\n<ul id=\"fs-idp29569184\" data-labeled-item=\"true\">\n<li><em data-effect=\"italics\">Q<\/em><sub>1<\/sub> \u2013 <em data-effect=\"italics\">IQR<\/em> (1.5) = 78 \u2013 11(1.5) = 61.5<\/li>\n<li><em data-effect=\"italics\">Q<\/em><sub>3<\/sub> + IQR(1.5) = 89 + 11(1.5) = 105.5<\/li>\n<\/ul>\n<p id=\"fs-idp5005056\">For this class, any test score less than 61.5 is an outlier. Therefore, the scores of 45 and 25.5 are outliers. Since no test score is greater than 105.5, there is no upper end outlier.<\/p>\n<\/li>\n<\/ol>\n<\/div>\n<\/div>\n<\/div>\n<div id=\"fs-idp58037360\" class=\"statistics try\" data-type=\"note\" data-has-label=\"true\" data-label=\"\">\n<div data-type=\"title\">Try It<\/div>\n<div id=\"fs-idp23368176\" data-type=\"exercise\">\n<div id=\"fs-idp23368304\" data-type=\"problem\">\n<p id=\"fs-idp5269648\">Find the interquartile range for the following two data sets and compare them.<\/p>\n<p id=\"fs-idp4060048\">Test Scores for Class <em data-effect=\"italics\">A<\/em> <span data-type=\"newline\"><br \/>\n<\/span>69;\u00a0 96;\u00a0 81;\u00a0 79;\u00a0 65;\u00a0 76;\u00a0 83;\u00a0 99;\u00a0 89;\u00a0 67;\u00a0 90;\u00a0 77;\u00a0 85;\u00a0 98;\u00a0 66;\u00a0 91;\u00a0 77;\u00a0 69;\u00a0 80;\u00a0 94 <span data-type=\"newline\"><br \/>\n<\/span>Test Scores for Class <em data-effect=\"italics\">B<\/em> <span data-type=\"newline\"><br \/>\n<\/span>90;\u00a0 72;\u00a0 80;\u00a0 92;\u00a0 90;\u00a0 97;\u00a0 92;\u00a0 75;\u00a0 79;\u00a0 68;\u00a0 70;\u00a0 80;\u00a0 99;\u00a0 95;\u00a0 78;\u00a0 73;\u00a0 71;\u00a0 68;\u00a0 95;\u00a0 100<\/p>\n<\/div>\n<\/div>\n<\/div>\n<div id=\"element-84\" class=\"textbox textbox--examples\" data-type=\"example\">\n<p id=\"element-913\">Fifty statistics students were asked how much sleep they get per school night (rounded to the nearest hour). The results were:<\/p>\n<table id=\"id4431204\" summary=\"This table presents the amount of sleep per school night in hours in the first column, from 4-10 hours, frequency in the second column, relative frequency in the third column, and cumulative relative frequency in the fourth column.\">\n<thead>\n<tr>\n<th>AMOUNT OF SLEEP PER SCHOOL NIGHT (HOURS)<\/th>\n<th>FREQUENCY<\/th>\n<th>RELATIVE FREQUENCY<\/th>\n<th>CUMULATIVE RELATIVE FREQUENCY<\/th>\n<\/tr>\n<\/thead>\n<tbody>\n<tr>\n<td>4<\/td>\n<td>2<\/td>\n<td>0.04<\/td>\n<td>0.04<\/td>\n<\/tr>\n<tr>\n<td>5<\/td>\n<td>5<\/td>\n<td>0.10<\/td>\n<td>0.14<\/td>\n<\/tr>\n<tr>\n<td>6<\/td>\n<td>7<\/td>\n<td>0.14<\/td>\n<td>0.28<\/td>\n<\/tr>\n<tr>\n<td>7<\/td>\n<td>12<\/td>\n<td>0.24<\/td>\n<td>0.52<\/td>\n<\/tr>\n<tr>\n<td>8<\/td>\n<td>14<\/td>\n<td>0.28<\/td>\n<td>0.80<\/td>\n<\/tr>\n<tr>\n<td>9<\/td>\n<td>7<\/td>\n<td>0.14<\/td>\n<td>0.94<\/td>\n<\/tr>\n<tr>\n<td>10<\/td>\n<td>3<\/td>\n<td>0.06<\/td>\n<td>1.00<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<p id=\"element-688\"><strong>Find the 28<sup>th<\/sup> percentile<\/strong>. Notice the 0.28 in the &#8220;cumulative relative frequency&#8221; column. Twenty-eight percent of 50 data values is 14 values. There are 14 values less than the 28<sup>th<\/sup> percentile. They include the two 4s, the five 5s, and the seven 6s. The 28<sup>th<\/sup> percentile is between the last six and the first seven. <strong>The 28<sup>th<\/sup> percentile is 6.5.<\/strong><\/p>\n<p id=\"element-488\"><strong>Find the median<\/strong>. Look again at the &#8220;cumulative relative frequency&#8221; column and find 0.52. The median is the 50<sup>th<\/sup> percentile or the second quartile. 50% of 50 is 25. There are 25 values less than the median. They include the two 4s, the five 5s, the seven 6s, and eleven of the 7s. The median or 50<sup>th<\/sup> percentile is between the 25<sup>th<\/sup>, or seven, and 26<sup>th<\/sup>, or seven, values. <strong>The median is seven.<\/strong><\/p>\n<p id=\"element-539\"><strong>Find the third quartile<\/strong>. The third quartile is the same as the 75<sup>th<\/sup> percentile. You can &#8220;eyeball&#8221; this answer. If you look at the &#8220;cumulative relative frequency&#8221; column, you find 0.52 and 0.80. When you have all the fours, fives, sixes and sevens, you have 52% of the data. When you include all the 8s, you have 80% of the data. <strong>The 75<sup>th<\/sup> percentile, then, must be an eight<\/strong>. Another way to look at the problem is to find 75% of 50, which is 37.5, and round up to 38. The third quartile, <em data-effect=\"italics\">Q<\/em><sub>3<\/sub>, is the 38<sup>th<\/sup> value, which is an eight. You can check this answer by counting the values. (There are 37 values below the third quartile and 12 values above.)<\/p>\n<\/div>\n<div id=\"fs-idm52647472\" class=\"statistics try\" data-type=\"note\" data-has-label=\"true\" data-label=\"\">\n<div data-type=\"title\">Try it<\/div>\n<div id=\"fs-idm18606176\" data-type=\"exercise\">\n<div id=\"fs-idm21314496\" data-type=\"problem\">\n<p id=\"fs-idm44305856\">Forty bus drivers were asked how many hours they spend each day running their routes (rounded to the nearest hour). Find the 65<sup>th<\/sup> percentile.<\/p>\n<table id=\"fs-idm24649760\" summary=\"\">\n<thead>\n<tr>\n<th>Amount of time spent on route (hours)<\/th>\n<th>Frequency<\/th>\n<th>Relative Frequency<\/th>\n<th>Cumulative Relative Frequency<\/th>\n<\/tr>\n<\/thead>\n<tbody>\n<tr>\n<td>2<\/td>\n<td>12<\/td>\n<td>0.30<\/td>\n<td>0.30<\/td>\n<\/tr>\n<tr>\n<td>3<\/td>\n<td>14<\/td>\n<td>0.35<\/td>\n<td>0.65<\/td>\n<\/tr>\n<tr>\n<td>4<\/td>\n<td>10<\/td>\n<td>0.25<\/td>\n<td>0.90<\/td>\n<\/tr>\n<tr>\n<td>5<\/td>\n<td>4<\/td>\n<td>0.10<\/td>\n<td>1.00<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<\/div>\n<\/div>\n<\/div>\n<div id=\"element-572\" class=\"textbox textbox--examples\" data-type=\"example\">\n<div id=\"element-2353\" data-type=\"exercise\">\n<div id=\"id45288379\" data-type=\"problem\">\n<p id=\"element-23532\">Using <a class=\"autogenerated-content\" href=\"#id4431204\">(Figure)<\/a>:<\/p>\n<ol type=\"a\">\n<li>Find the 80<sup>th<\/sup> percentile.<\/li>\n<li>Find the 90<sup>th<\/sup> percentile.<\/li>\n<li>Find the first quartile. What is another name for the first quartile?<\/li>\n<\/ol>\n<\/div>\n<div id=\"fs-idp60869984\" data-type=\"solution\">\n<p id=\"fs-idp15042704\">Using the data from the frequency table, we have:<\/p>\n<ol id=\"fs-idm54301152\" type=\"a\">\n<li>The 80<sup>th<\/sup> percentile is between the last eight and the first nine in the table (between the 40<sup>th<\/sup> and 41<sup>st<\/sup> values). Therefore, we need to take the mean of the 40<sup>th<\/sup> an 41<sup>st<\/sup> values. The 80<sup>th<\/sup> percentile \\(=\\frac{8+9}{2}=8.5\\)<\/li>\n<li>The 90<sup>th<\/sup> percentile will be the 45<sup>th<\/sup> data value (location is 0.90(50) = 45) and the 45<sup>th<\/sup> data value is nine.<\/li>\n<li><em data-effect=\"italics\">Q<\/em><sub>1<\/sub> is also the 25<sup>th<\/sup> percentile. The 25<sup>th<\/sup> percentile location calculation: <em data-effect=\"italics\">P<\/em><sub>25<\/sub> = 0.25(50) = 12.5 \u2248 13 the 13<sup>th<\/sup> data value. Thus, the 25th percentile is six.<\/li>\n<\/ol>\n<\/div>\n<\/div>\n<\/div>\n<div id=\"fs-idm56651440\" class=\"statistics try\" data-type=\"note\" data-has-label=\"true\" data-label=\"\">\n<div data-type=\"title\">Try It<\/div>\n<div id=\"fs-idp38065168\" data-type=\"exercise\">\n<div id=\"fs-idm27880528\" data-type=\"problem\">\n<p id=\"fs-idp54653312\">Refer to the <a class=\"autogenerated-content\" href=\"#fs-idm24649760\">(Figure)<\/a>. Find the third quartile. What is another name for the third quartile?<\/p>\n<\/div>\n<\/div>\n<\/div>\n<div id=\"fs-idm13393536\" class=\"statistics collab\" data-type=\"note\" data-has-label=\"true\" data-label=\"\">\n<div data-type=\"title\">Collaborative Statistics<\/div>\n<p id=\"element-758\">Your instructor or a member of the class will ask everyone in class how many sweaters they own. Answer the following questions:<\/p>\n<ol id=\"exlist\">\n<li>How many students were surveyed?<\/li>\n<li>What kind of sampling did you do?<\/li>\n<li>Construct two different histograms. For each, starting value = _____ ending value = ____.<\/li>\n<li>Find the median, first quartile, and third quartile.<\/li>\n<li>Construct a table of the data to find the following:\n<ol id=\"exlist2\" type=\"a\">\n<li>the 10<sup>th<\/sup> percentile<\/li>\n<li>the 70<sup>th<\/sup> percentile<\/li>\n<li>the percent of students who own less than four sweaters<\/li>\n<\/ol>\n<\/li>\n<\/ol>\n<\/div>\n<div id=\"fs-idm21580416\" class=\"bc-section section\" data-depth=\"1\">\n<h3 data-type=\"title\">A Formula for Finding the <em data-effect=\"italics\">k<\/em>th Percentile<\/h3>\n<p id=\"fs-idp1786064\">If you were to do a little research, you would find several formulas for calculating the <em data-effect=\"italics\">k<\/em><sup>th<\/sup> percentile. Here is one of them.<\/p>\n<p id=\"fs-idp3096416\"><em data-effect=\"italics\">k<\/em> = the <em data-effect=\"italics\">k<sup>th<\/sup><\/em> percentile. It may or may not be part of the data.<\/p>\n<p id=\"fs-idp1947472\"><em data-effect=\"italics\">i<\/em> = the index (ranking or position of a data value)<\/p>\n<p id=\"fs-idm946480\"><em data-effect=\"italics\">n<\/em> = the total number of data<\/p>\n<ul id=\"fs-idm9831088\">\n<li>Order the data from smallest to largest.<\/li>\n<li>Calculate \\(i=\\frac{k}{100}\\left(n+1\\right)\\)<\/li>\n<li>If <em data-effect=\"italics\">i<\/em> is an integer, then the <em data-effect=\"italics\">k<sup>th<\/sup><\/em> percentile is the data value in the <em data-effect=\"italics\">i<sup>th<\/sup><\/em> position in the ordered set of data.<\/li>\n<li>If <em data-effect=\"italics\">i<\/em> is not an integer, then round <em data-effect=\"italics\">i<\/em> up and round <em data-effect=\"italics\">i<\/em> down to the nearest integers. Average the two data values in these two positions in the ordered data set. This is easier to understand in an example.<\/li>\n<\/ul>\n<div id=\"fs-idm4569232\" class=\"textbox textbox--examples\" data-type=\"example\">\n<div id=\"fs-idm105708208\" data-type=\"exercise\">\n<div id=\"fs-idm3783968\" data-type=\"problem\">\n<p id=\"fs-idp1509664\">Listed are 29 ages for Academy Award winning best actors <em data-effect=\"italics\">in order from smallest to largest.<\/em> <span data-type=\"newline\"><br \/>\n<\/span>18;\u00a0 21;\u00a0 22;\u00a0 25;\u00a0 26;\u00a0 27;\u00a0 29;\u00a0 30;\u00a0 31;\u00a0 33;\u00a0 36;\u00a0 37;\u00a0 41;\u00a0 42;\u00a0 47;\u00a0 52;\u00a0 55;\u00a0 57;\u00a0 58;\u00a0 62;\u00a0 64;\u00a0 67;\u00a0 69;\u00a0 71;\u00a0 72;\u00a0 73;\u00a0 74;\u00a0 76;\u00a0 77<\/p>\n<ol id=\"fs-idm1901040\" type=\"a\">\n<li>Find the 70<sup>th<\/sup> percentile.<\/li>\n<li>Find the 83<sup>rd<\/sup> percentile.<\/li>\n<\/ol>\n<\/div>\n<div id=\"fs-idp40713040\" data-type=\"solution\">\n<ol id=\"fs-idm62647008\" type=\"a\">\n<li>\n<ul id=\"fs-idp14170864\" data-labeled-item=\"true\">\n<li><em data-effect=\"italics\">k<\/em> = 70<\/li>\n<li><em data-effect=\"italics\">i<\/em> = the index<\/li>\n<li><em data-effect=\"italics\">n<\/em> = 29<\/li>\n<\/ul>\n<p><em data-effect=\"italics\">i<\/em> = \\(\\frac{k}{100}\\) (<em data-effect=\"italics\">n<\/em> + 1) = (\\(\\frac{70}{100}\\))(29 + 1) = 21. Twenty-one is an integer, and the data value in the 21<sup>st<\/sup> position in the ordered data set is 64. The 70<sup>th<\/sup> percentile is 64 years.<\/li>\n<li>\n<ul id=\"fs-idm21563168\" data-labeled-item=\"true\">\n<li><em data-effect=\"italics\">k<\/em> = 83<sup>rd<\/sup> percentile<\/li>\n<li><em data-effect=\"italics\">i<\/em> = the index<\/li>\n<li><em data-effect=\"italics\">n<\/em> = 29<\/li>\n<\/ul>\n<p><em data-effect=\"italics\">i<\/em> \u00a0= \\(\\frac{k}{100}\\) (<em data-effect=\"italics\">n<\/em> + 1) = (\\(\\frac{83}{100}\\))(29 + 1) = 24.9, which is NOT an integer.<\/p>\n<p>Round it down to 24 and up to 25. The age in the 24<sup>th<\/sup> position is 71 and the age in the 25<sup>th<\/sup> position is 72. Average 71 and 72. The 83<sup>rd<\/sup> percentile is 71.5 years.<\/li>\n<\/ol>\n<\/div>\n<\/div>\n<\/div>\n<div id=\"fs-idm16529696\" class=\"statistics try\" data-type=\"note\" data-has-label=\"true\" data-label=\"\">\n<div data-type=\"title\">Try It<\/div>\n<div id=\"fs-idm3894192\" data-type=\"exercise\">\n<div id=\"fs-idp25866864\" data-type=\"problem\">\n<p id=\"fs-idp25866992\">Listed are 29 ages for Academy Award winning best actors <em data-effect=\"italics\">in order from smallest to largest.<\/em><\/p>\n<p id=\"fs-idm19734064\">18;\u00a0 21;\u00a0 22;\u00a0 25;\u00a0 26;\u00a0 27;\u00a0 29;\u00a0 30;\u00a0 31;\u00a0 33;\u00a0 36;\u00a0 37;\u00a0 41;\u00a0 42;\u00a0 47;\u00a0 52;\u00a0 55;\u00a0 57;\u00a0 58;\u00a0 62;\u00a0 64;\u00a0 67;\u00a0 69;\u00a0 71;\u00a0 72;\u00a0 73;\u00a0 74;\u00a0 76;\u00a0 77 <span data-type=\"newline\"><br \/>\n<\/span>Calculate the 20<sup>th<\/sup> percentile and the 55<sup>th<\/sup> percentile.<\/p>\n<\/div>\n<\/div>\n<\/div>\n<div id=\"eip-404\" class=\"finger\" data-type=\"note\" data-has-label=\"true\" data-label=\"\">\n<div data-type=\"title\">NOTE<\/div>\n<p id=\"fs-idp26669920\">You can calculate percentiles using calculators and computers. There are a variety of online calculators.<\/p>\n<\/div>\n<\/div>\n<div id=\"fs-idp2972176\" class=\"bc-section section\" data-depth=\"1\">\n<h3 data-type=\"title\">A Formula for Finding the Percentile of a Value in a Data Set<\/h3>\n<ul id=\"fs-idm17756640\">\n<li>Order the data from smallest to largest.<\/li>\n<li><em data-effect=\"italics\">x<\/em> = the number of data values counting from the bottom of the data list up to but not including the data value for which you want to find the percentile.<\/li>\n<li><em data-effect=\"italics\">y<\/em> = the number of data values equal to the data value for which you want to find the percentile.<\/li>\n<li><em data-effect=\"italics\">n<\/em> = the total number of data.<\/li>\n<li>Calculate \\(\\frac{x+0.5y}{n}\\)(100). Then round to the nearest integer.<\/li>\n<\/ul>\n<div id=\"fs-idm3849664\" class=\"textbox textbox--examples\" data-type=\"example\">\n<div id=\"fs-idp28609648\" data-type=\"exercise\">\n<div id=\"fs-idp28609904\" data-type=\"problem\">\n<p id=\"fs-idp38890112\">Listed are 29 ages for Academy Award winning best actors <em data-effect=\"italics\">in order from smallest to largest.<\/em> <span data-type=\"newline\"><br \/>\n<\/span>18;\u00a0 21;\u00a0 22;\u00a0 25;\u00a0 26;\u00a0 27;\u00a0 29;\u00a0 30;\u00a0 31;\u00a0 33;\u00a0 36;\u00a0 37;\u00a0 41;\u00a0 42;\u00a0 47;\u00a0 52;\u00a0 55;\u00a0 57;\u00a0 58;\u00a0 62;\u00a0 64;\u00a0 67;\u00a0 69;\u00a0 71;\u00a0 72;\u00a0 73;\u00a0 74;\u00a0 76;\u00a0 77<\/p>\n<ol id=\"fs-idm21168272\" type=\"a\">\n<li>Find the percentile for 58.<\/li>\n<li>Find the percentile for 25.<\/li>\n<\/ol>\n<\/div>\n<div id=\"fs-idm170490752\" data-type=\"solution\">\n<ol id=\"fs-idm170490496\" type=\"a\">\n<li>Counting from the bottom of the list, there are 18 data values less than 58. There is one value of 58.\n<p id=\"fs-idm3871584\"><em data-effect=\"italics\">x<\/em> = 18 and <em data-effect=\"italics\">y<\/em> = 1. \\(\\frac{x+0.5y}{n}\\)(100) = \\(\\frac{18+0.5\\left(1\\right)}{29}\\)(100) = 63.80. 58 is the 64<sup>th<\/sup> percentile.<\/p>\n<\/li>\n<li>Counting from the bottom of the list, there are three data values less than 25. There is one value of 25.\n<p id=\"fs-idm21523472\"><em data-effect=\"italics\">x<\/em> = 3 and <em data-effect=\"italics\">y<\/em> = 1. \\(\\frac{x+0.5y}{n}\\)(100) = \\(\\frac{3+0.5\\left(1\\right)}{29}\\)(100) = 12.07. Twenty-five is the 12<sup>th<\/sup> percentile.<\/p>\n<\/li>\n<\/ol>\n<\/div>\n<\/div>\n<\/div>\n<div id=\"fs-idm170943360\" class=\"statistics try\" data-type=\"note\" data-has-label=\"true\" data-label=\"\">\n<div data-type=\"title\">Try It<\/div>\n<div id=\"fs-idm170942864\" data-type=\"exercise\">\n<div id=\"fs-idp35294448\" data-type=\"problem\">\n<p id=\"fs-idp35294576\">Listed are 30 ages for Academy Award winning best actors <u data-effect=\"underline\">in order from smallest to largest.<\/u><\/p>\n<p id=\"fs-idp13252768\">18;\u00a0 21;\u00a0 22;\u00a0 25;\u00a0 26;\u00a0 27;\u00a0 29;\u00a0 30;\u00a0 31;\u00a0 31;\u00a0 33;\u00a0 36;\u00a0 37;\u00a0 41;\u00a0 42;\u00a0 47;\u00a0 52;\u00a0 55;\u00a0 57;\u00a0 58;\u00a0 62;\u00a0 64;\u00a0 67;\u00a0 69;\u00a0 71;\u00a0 72;\u00a0 73;\u00a0 74;\u00a0 76;\u00a0 77 <span data-type=\"newline\"><br \/>\n<\/span>Find the percentiles for 47 and 31.<\/p>\n<\/div>\n<\/div>\n<\/div>\n<\/div>\n<div id=\"fs-idp45793312\" class=\"bc-section section\" data-depth=\"1\">\n<h3 data-type=\"title\">Interpreting Percentiles, Quartiles, and Median<\/h3>\n<p id=\"eip-400\">A percentile indicates the relative standing of a data value when data are sorted into numerical order from smallest to largest. Percentages of data values are less than or equal to the pth percentile. For example, 15% of data values are less than or equal to the 15<sup>th<\/sup> percentile.<\/p>\n<ul id=\"eip-id1164310609380\" data-bullet-style=\"bullet\">\n<li>Low percentiles always correspond to lower data values.<\/li>\n<li>High percentiles always correspond to higher data values.<\/li>\n<\/ul>\n<p id=\"fs-idp44902944\">A percentile may or may not correspond to a value judgment about whether it is &#8220;good&#8221; or &#8220;bad.&#8221; The interpretation of whether a certain percentile is &#8220;good&#8221; or &#8220;bad&#8221; depends on the context of the situation to which the data applies. In some situations, a low percentile would be considered &#8220;good;&#8221; in other contexts a high percentile might be considered &#8220;good&#8221;. In many situations, there is no value judgment that applies.<\/p>\n<p id=\"fs-idm23920480\">Understanding how to interpret percentiles properly is important not only when describing data, but also when calculating probabilities in later chapters of this text.<\/p>\n<\/div>\n<div id=\"fs-idm106923680\" data-type=\"note\" data-has-label=\"true\" data-label=\"\">\n<div data-type=\"title\">NOTE<\/div>\n<p id=\"fs-idm20251376\">When writing the interpretation of a percentile in the context of the given data, the sentence should contain the following information.<\/p>\n<ul id=\"eip-id1168197264788\">\n<li>information about the context of the situation being considered<\/li>\n<li>the data value (value of the variable) that represents the percentile<\/li>\n<li>the percent of individuals or items with data values below the percentile<\/li>\n<li>the percent of individuals or items with data values above the percentile.<\/li>\n<\/ul>\n<\/div>\n<div id=\"eip-id1170215995305\" class=\"textbox textbox--examples\" data-type=\"example\">\n<div id=\"fs-idm91768592\" data-type=\"exercise\">\n<div id=\"fs-idm91768464\" data-type=\"problem\">\n<p id=\"eip-id1170184310084\">On a timed math test, the first quartile for time it took to finish the exam was 35 minutes. Interpret the first quartile in the context of this situation.<\/p>\n<\/div>\n<div id=\"fs-idm53128368\" data-type=\"solution\">\n<ul id=\"eip-id1170179452695\">\n<li>Twenty-five percent of students finished the exam in 35 minutes or less.<\/li>\n<li>Seventy-five percent of students finished the exam in 35 minutes or more.<\/li>\n<li>A low percentile could be considered good, as finishing more quickly on a timed exam is desirable. (If you take too long, you might not be able to finish.)<\/li>\n<\/ul>\n<\/div>\n<\/div>\n<\/div>\n<div id=\"fs-idp16945648\" class=\"statistics try\" data-type=\"note\" data-has-label=\"true\" data-label=\"\">\n<div data-type=\"title\">Try It<\/div>\n<div id=\"fs-idp33388848\" data-type=\"exercise\">\n<div id=\"fs-idm41955616\" data-type=\"problem\">\n<p id=\"fs-idp20699248\">For the 100-meter dash, the third quartile for times for finishing the race was 11.5 seconds. Interpret the third quartile in the context of the situation.<\/p>\n<\/div>\n<\/div>\n<\/div>\n<div id=\"eip-id1170441826663\" class=\"textbox textbox--examples\" data-type=\"example\">\n<div id=\"fs-idm148596320\" data-type=\"exercise\">\n<div id=\"fs-idm170402432\" data-type=\"problem\">\n<p id=\"eip-id1170436117670\">On a 20 question math test, the 70<sup>th<\/sup> percentile for number of correct answers was 16. Interpret the 70<sup>th<\/sup> percentile in the context of this situation.<\/p>\n<\/div>\n<\/div>\n<\/div>\n<div id=\"fs-idp77029680\" class=\"statistics try\" data-type=\"note\" data-has-label=\"true\" data-label=\"\">\n<div data-type=\"title\">Try It<\/div>\n<div id=\"fs-idp34034288\" data-type=\"exercise\">\n<div id=\"fs-idp48692144\" data-type=\"problem\">\n<p id=\"fs-idp55037680\">On a 60 point written assignment, the 80<sup>th<\/sup> percentile for the number of points earned was 49. Interpret the 80<sup>th<\/sup> percentile in the context of this situation.<\/p>\n<\/div>\n<\/div>\n<\/div>\n<div id=\"eip-id7060500\" class=\"textbox textbox--examples\" data-type=\"example\">\n<div id=\"fs-idm205091056\" data-type=\"exercise\">\n<div id=\"fs-idm15124096\" data-type=\"problem\">\n<p id=\"eip-id1170610063171\">At a community college, it was found that the 30<sup>th<\/sup> percentile of credit units that students are enrolled for is seven units. Interpret the 30<sup>th<\/sup> percentile in the context of this situation.<\/p>\n<\/div>\n<\/div>\n<\/div>\n<div id=\"fs-idp80590208\" class=\"statistics try\" data-type=\"note\" data-has-label=\"true\" data-label=\"\">\n<div data-type=\"title\">Try It<\/div>\n<div id=\"fs-idp73731328\" data-type=\"exercise\">\n<div id=\"fs-idp42792528\" data-type=\"problem\">\n<p id=\"fs-idm23433888\">During a season, the 40<sup>th<\/sup> percentile for points scored per player in a game is eight. Interpret the 40<sup>th<\/sup> percentile in the context of this situation.<\/p>\n<\/div>\n<\/div>\n<\/div>\n<div id=\"fs-idp9603904\" class=\"textbox textbox--examples\" data-type=\"example\">\n<p id=\"fs-idp45664304\">Sharpe Middle School is applying for a grant that will be used to add fitness equipment to the gym. The principal surveyed 15 anonymous students to determine how many minutes a day the students spend exercising. The results from the 15 anonymous students are shown.<\/p>\n<p id=\"fs-idp39768656\">0 minutes; 40 minutes; 60 minutes; 30 minutes; 60 minutes<\/p>\n<p id=\"fs-idm13969776\">10 minutes; 45 minutes; 30 minutes; 300 minutes; 90 minutes;<\/p>\n<p id=\"fs-idp22597008\">30 minutes; 120 minutes; 60 minutes; 0 minutes; 20 minutes<\/p>\n<p id=\"fs-idp53167440\">Determine the following five values.<\/p>\n<ul id=\"fs-idp70490496\" data-labeled-item=\"true\">\n<li>Min = 0<\/li>\n<li><em data-effect=\"italics\">Q<\/em><sub>1<\/sub> = 20<\/li>\n<li>Med = 40<\/li>\n<li><em data-effect=\"italics\">Q<\/em><sub>3<\/sub> = 60<\/li>\n<li>Max = 300<\/li>\n<\/ul>\n<p id=\"fs-idp83565376\">If you were the principal, would you be justified in purchasing new fitness equipment? Since 75% of the students exercise for 60 minutes or less daily, and since the <em data-effect=\"italics\">IQR<\/em> is 40 minutes (60 \u2013 20 = 40), we know that half of the students surveyed exercise between 20 minutes and 60 minutes daily. This seems a reasonable amount of time spent exercising, so the principal would be justified in purchasing the new equipment.<\/p>\n<p id=\"fs-idm77236544\">However, the principal needs to be careful. The value 300 appears to be a potential outlier.<\/p>\n<p id=\"fs-idm9669376\"><em data-effect=\"italics\">Q<\/em><sub>3<\/sub> + 1.5(<em data-effect=\"italics\">IQR<\/em>) = 60 + (1.5)(40) = 120.<\/p>\n<p id=\"fs-idp13270336\">The value 300 is greater than 120 so it is a potential outlier. If we delete it and calculate the five values, we get the following values:<\/p>\n<ul id=\"fs-idm2894688\" data-labeled-item=\"true\">\n<li>Min = 0<\/li>\n<li><em data-effect=\"italics\">Q<\/em><sub>1<\/sub> = 20<\/li>\n<li><em data-effect=\"italics\">Q<\/em><sub>3<\/sub> = 60<\/li>\n<li>Max = 120<\/li>\n<\/ul>\n<p id=\"fs-idm6660656\">We still have 75% of the students exercising for 60 minutes or less daily and half of the students exercising between 20 and 60 minutes a day. However, 15 students is a small sample and the principal should survey more students to be sure of his survey results.<\/p>\n<\/div>\n<div id=\"fs-idm63224784\" class=\"footnotes\" data-depth=\"1\">\n<h3 data-type=\"title\">References<\/h3>\n<p id=\"fs-idm63224288\">Cauchon, Dennis, Paul Overberg. \u201cCensus data shows minorities now a majority of U.S. births.\u201d USA Today, 2012. Available online at http:\/\/usatoday30.usatoday.com\/news\/nation\/story\/2012-05-17\/minority-birthscensus\/55029100\/1 (accessed April 3, 2013).<\/p>\n<p id=\"fs-idm76887104\">Data from the United States Department of Commerce: United States Census Bureau. Available online at http:\/\/www.census.gov\/ (accessed April 3, 2013).<\/p>\n<p id=\"fs-idm76886560\">\u201c1990 Census.\u201d United States Department of Commerce: United States Census Bureau. Available online at http:\/\/www.census.gov\/main\/www\/cen1990.html (accessed April 3, 2013).<\/p>\n<p id=\"fs-idm76885984\">Data from <em data-effect=\"italics\">San Jose Mercury News<\/em>.<\/p>\n<p id=\"fs-idm76885600\">Data from <em data-effect=\"italics\">Time Magazine<\/em>; survey by Yankelovich Partners, Inc.<\/p>\n<\/div>\n<div id=\"fs-idm13790128\" class=\"summary\" data-depth=\"1\">\n<h3 data-type=\"title\">Chapter Review<\/h3>\n<p id=\"fs-idp1397504\">The values that divide a rank-ordered set of data into 100 equal parts are called percentiles. Percentiles are used to compare and interpret data. For example, an observation at the 50<sup>th<\/sup> percentile would be greater than 50 percent of the other obeservations in the set. Quartiles divide data into quarters. The first quartile (<em data-effect=\"italics\">Q<\/em><sub>1<\/sub>) is the 25<sup>th<\/sup> percentile,the second quartile (<em data-effect=\"italics\">Q<\/em><sub>2<\/sub> or median) is 50<sup>th<\/sup> percentile, and the third quartile (<em data-effect=\"italics\">Q<\/em><sub>3<\/sub>) is the the 75<sup>th<\/sup> percentile. The interquartile range, or <em data-effect=\"italics\">IQR<\/em>, is the range of the middle 50 percent of the data values. The <em data-effect=\"italics\">IQR<\/em> is found by subtracting <em data-effect=\"italics\">Q<\/em><sub>1<\/sub> from <em data-effect=\"italics\">Q<\/em><sub>3<\/sub>, and can help determine outliers by using the following two expressions.<\/p>\n<ul id=\"fs-idp12766560\">\n<li><em data-effect=\"italics\">Q<\/em><sub>3<\/sub> + <em data-effect=\"italics\">IQR<\/em>(1.5)<\/li>\n<li><em data-effect=\"italics\">Q<\/em><sub>1<\/sub> \u2013 <em data-effect=\"italics\">IQR<\/em>(1.5)<\/li>\n<\/ul>\n<\/div>\n<div id=\"fs-idm202752\" class=\"formula-review\" data-depth=\"1\">\n<h3 data-type=\"title\">Formula Review<\/h3>\n<p id=\"fs-idp5126816\">\\(i=\\left(\\frac{k}{100}\\right)\\left(n+1\\right)\\)<\/p>\n<p id=\"fs-idp706784\">where <em data-effect=\"italics\">i<\/em> = the ranking or position of a data value,<\/p>\n<p id=\"fs-idm55046864\"><em data-effect=\"italics\">k<\/em> = the kth percentile,<\/p>\n<p id=\"fs-idm6916704\"><em data-effect=\"italics\">n<\/em> = total number of data.<\/p>\n<p id=\"fs-idp294352\">Expression for finding the percentile of a data value: \\(\\left(\\frac{x\\text{ + }0.5y}{n}\\right)\\)(100)<\/p>\n<p id=\"fs-idp17176000\">where <em data-effect=\"italics\">x<\/em> = the number of values counting from the bottom of the data list up to but not including the data value for which you want to find the percentile,<\/p>\n<p id=\"fs-idm2884704\"><em data-effect=\"italics\">y<\/em> = the number of data values equal to the data value for which you want to find the percentile,<\/p>\n<p id=\"fs-idm5691392\"><em data-effect=\"italics\">n<\/em> = total number of data<\/p>\n<\/div>\n<div id=\"fs-idp40431760\" class=\"practice\" data-depth=\"1\">\n<div id=\"fs-idm1110784\" data-type=\"exercise\">\n<div id=\"fs-idm38839376\" data-type=\"problem\">\n<p id=\"fs-idm38839120\">Listed are 29 ages for Academy Award winning best actors <em data-effect=\"italics\">in order from smallest to largest.<\/em><\/p>\n<p id=\"fs-idm6939584\">18;\u00a0 21;\u00a0 22;\u00a0 25;\u00a0 26;\u00a0 27;\u00a0 29;\u00a0 30;\u00a0 31;\u00a0 33;\u00a0 36;\u00a0 37;\u00a0 41;\u00a0 42;\u00a0 47;\u00a0 52;\u00a0 55;\u00a0 57;\u00a0 58;\u00a0 62;\u00a0 64;\u00a0 67;\u00a0 69;\u00a0 71;\u00a0 72;\u00a0 73;\u00a0 74;\u00a0 76;\u00a0 77<\/p>\n<ol id=\"fs-idp12728784\" type=\"a\">\n<li>Find the 40<sup>th<\/sup> percentile.<\/li>\n<li>Find the 78<sup>th<\/sup> percentile.<\/li>\n<\/ol>\n<\/div>\n<div id=\"fs-idm60642032\" data-type=\"solution\">\n<ol id=\"fs-idp34749472\" type=\"a\">\n<li>The 40<sup>th<\/sup> percentile is 37 years.<\/li>\n<li>The 78<sup>th<\/sup> percentile is 70 years.<\/li>\n<\/ol>\n<\/div>\n<\/div>\n<div id=\"fs-idm4719584\" data-type=\"exercise\">\n<div id=\"fs-idm4719328\" data-type=\"problem\">\n<p id=\"fs-idp14289472\">Listed are 32 ages for Academy Award winning best actors <em data-effect=\"italics\">in order from smallest to largest.<\/em><\/p>\n<p id=\"fs-idm62636912\">18;\u00a0 18;\u00a0 21;\u00a0 22;\u00a0 25;\u00a0 26;\u00a0 27;\u00a0 29;\u00a0 30;\u00a0 31;\u00a0 31;\u00a0 33;\u00a0 36;\u00a0 37;\u00a0 37;\u00a0 41;\u00a0 42;\u00a0 47;\u00a0 52;\u00a0 55;\u00a0 57;\u00a0 58;\u00a0 62;\u00a0 64;\u00a0 67;\u00a0 69;\u00a0 71;\u00a0 72;\u00a0 73;\u00a0 74;\u00a0 76;\u00a0 77<\/p>\n<ol id=\"fs-idm82651968\" type=\"a\">\n<li>Find the percentile of 37.<\/li>\n<li>Find the percentile of 72.<\/li>\n<\/ol>\n<\/div>\n<\/div>\n<div id=\"fs-idp30887728\" data-type=\"exercise\">\n<div id=\"fs-idp30887984\" data-type=\"problem\">\n<p id=\"fs-idp30888240\">Jesse was ranked 37<sup>th<\/sup> in his graduating class of 180 students. At what percentile is Jesse\u2019s ranking?<\/p>\n<\/div>\n<div id=\"fs-idm44160976\" data-type=\"solution\">\n<p id=\"fs-idm44160720\">Jesse graduated 37<sup>th<\/sup> out of a class of 180 students. There are 180 \u2013 37 = 143 students ranked below Jesse. There is one rank of 37.<\/p>\n<p id=\"fs-idm80018544\"><em data-effect=\"italics\">x<\/em> = 143 and <em data-effect=\"italics\">y<\/em> = 1. \\(\\frac{x+0.5y}{n}\\)(100) = \\(\\frac{143+0.5\\left(1\\right)}{180}\\)(100) = 79.72. Jesse\u2019s rank of 37 puts him at the 80<sup>th<\/sup> percentile.<\/p>\n<\/div>\n<\/div>\n<div id=\"eip-id1168182229232\" data-type=\"exercise\">\n<div id=\"eip-id1168183555687\" data-type=\"problem\">\n<ol id=\"eip-id1168185190290\" type=\"a\" data-mark-suffix=\".\">\n<li>For runners in a race, a low time means a faster run. The winners in a race have the shortest running times. Is it more desirable to have a finish time with a high or a low percentile when running a race?<\/li>\n<li>The 20<sup>th<\/sup> percentile of run times in a particular race is 5.2 minutes. Write a sentence interpreting the 20<sup>th<\/sup> percentile in the context of the situation.<\/li>\n<li>A bicyclist in the 90<sup>th<\/sup> percentile of a bicycle race completed the race in 1 hour and 12 minutes. Is he among the fastest or slowest cyclists in the race? Write a sentence interpreting the 90<sup>th<\/sup> percentile in the context of the situation.<\/li>\n<\/ol>\n<\/div>\n<\/div>\n<div id=\"eip-id1168182273864\" data-type=\"exercise\">\n<div id=\"eip-id1168191796049\" data-type=\"problem\">\n<ol id=\"eip-id5724192\" type=\"a\" data-mark-suffix=\".\">\n<li>For runners in a race, a higher speed means a faster run. Is it more desirable to have a speed with a high or a low percentile when running a race?<\/li>\n<li>The 40<sup>th<\/sup> percentile of speeds in a particular race is 7.5 miles per hour. Write a sentence interpreting the 40<sup>th<\/sup> percentile in the context of the situation.<\/li>\n<\/ol>\n<\/div>\n<div id=\"eip-id1168199883378\" data-type=\"solution\">\n<ol id=\"eip-id1168196369910\" type=\"a\" data-mark-suffix=\".\">\n<li>For runners in a race it is more desirable to have a high percentile for speed. A high percentile means a higher speed which is faster.<\/li>\n<li>40% of runners ran at speeds of 7.5 miles per hour or less (slower). 60% of runners ran at speeds of 7.5 miles per hour or more (faster).<\/li>\n<\/ol>\n<\/div>\n<\/div>\n<div id=\"eip-id1168217995987\" data-type=\"exercise\">\n<div id=\"eip-id1168183864592\" data-type=\"problem\">\n<p id=\"eip-id1168226425380\">On an exam, would it be more desirable to earn a grade with a high or low percentile? Explain.<\/p>\n<\/div>\n<\/div>\n<div id=\"eip-id1168183173702\" data-type=\"exercise\">\n<div id=\"eip-id1168227404691\" data-type=\"problem\">\n<p id=\"eip-id1168230025239\">Mina is waiting in line at the Department of Motor Vehicles (DMV). Her wait time of 32 minutes is the 85<sup>th<\/sup> percentile of wait times. Is that good or bad? Write a sentence interpreting the 85<sup>th<\/sup> percentile in the context of this situation.<\/p>\n<\/div>\n<div id=\"eip-id7704128\" data-type=\"solution\">\n<p id=\"eip-id1168214950316\">When waiting in line at the DMV, the 85<sup>th<\/sup> percentile would be a long wait time compared to the other people waiting. 85% of people had shorter wait times than Mina. In this context, Mina would prefer a wait time corresponding to a lower percentile. 85% of people at the DMV waited 32 minutes or less. 15% of people at the DMV waited 32 minutes or longer.<\/p>\n<\/div>\n<\/div>\n<div id=\"eip-id1168213546999\" data-type=\"exercise\">\n<div id=\"eip-id1168188876815\" data-type=\"problem\">\n<p id=\"eip-id7349223\">In a survey collecting data about the salaries earned by recent college graduates, Li found that her salary was in the 78<sup>th<\/sup> percentile. Should Li be pleased or upset by this result? Explain.<\/p>\n<\/div>\n<\/div>\n<div id=\"eip-id1168214876383\" data-type=\"exercise\">\n<div id=\"eip-id7327842\" data-type=\"problem\">\n<p id=\"eip-id1168230657040\">In a study collecting data about the repair costs of damage to automobiles in a certain type of crash tests, a certain model of car had \\$1,700 in damage and was in the 90<sup>th<\/sup> percentile. Should the manufacturer and the consumer be pleased or upset by this result? Explain and write a sentence that interprets the 90<sup>th<\/sup> percentile in the context of this problem.<\/p>\n<\/div>\n<div id=\"eip-id1168214946038\" data-type=\"solution\">\n<p id=\"eip-id1168234799988\">The manufacturer and the consumer would be upset. This is a large repair cost for the damages, compared to the other cars in the sample. INTERPRETATION: 90% of the crash tested cars had damage repair costs of \\$1700 or less; only 10% had damage repair costs of \\$1700 or more.<\/p>\n<\/div>\n<\/div>\n<div id=\"eip-id1168195852900\" data-type=\"exercise\">\n<div id=\"eip-id1168225195383\" data-type=\"problem\">\n<p id=\"eip-idm9549040\">The University of California has two criteria used to set admission standards for freshman to be admitted to a college in the UC system:<\/p>\n<ol id=\"eip-id1168211096380\" type=\"a\" data-mark-suffix=\"\">\n<li>Students&#8217; GPAs and scores on standardized tests (SATs and ACTs) are entered into a formula that calculates an &#8220;admissions index&#8221; score. The admissions index score is used to set eligibility standards intended to meet the goal of admitting the top 12% of high school students in the state. In this context, what percentile does the top 12% represent?<\/li>\n<li>Students whose GPAs are at or above the 96<sup>th<\/sup> percentile of all students at their high school are eligible (called eligible in the local context), even if they are not in the top 12% of all students in the state. What percentage of students from each high school are &#8220;eligible in the local context&#8221;?<\/li>\n<\/ol>\n<\/div>\n<\/div>\n<div id=\"eip-id1168223160542\" data-type=\"exercise\">\n<div id=\"eip-id7507305\" data-type=\"problem\">\n<p id=\"eip-id1168211272126\">Suppose that you are buying a house. You and your realtor have determined that the most expensive house you can afford is the 34<sup>th<\/sup> percentile. The 34<sup>th<\/sup> percentile of housing prices is \\$240,000 in the town you want to move to. In this town, can you afford 34% of the houses or 66% of the houses?<\/p>\n<\/div>\n<div id=\"eip-id1168213876148\" data-type=\"solution\">\n<p id=\"eip-id1168225765198\">You can afford 34% of houses. 66% of the houses are too expensive for your budget. INTERPRETATION: 34% of houses cost \\$240,000 or less. 66% of houses cost \\$240,000 or more.<\/p>\n<\/div>\n<\/div>\n<p id=\"element-726\">Use the following information to answer the next six exercises. Sixty-five randomly selected car salespersons were asked the number of cars they generally sell in one week. Fourteen people answered that they generally sell three cars; nineteen generally sell four cars; twelve generally sell five cars; nine generally sell six cars; eleven generally sell seven cars.<\/p>\n<div id=\"exercisenine\" data-type=\"exercise\">\n<div id=\"id21439538\" data-type=\"problem\">\n<p>First quartile = _______<\/p>\n<\/div>\n<\/div>\n<div id=\"exerciseten\" data-type=\"exercise\">\n<div id=\"id4433542\" data-type=\"problem\">\n<p>Second quartile = median = 50<sup>th<\/sup> percentile = _______<\/p>\n<\/div>\n<div id=\"id12404344\" data-type=\"solution\">\n<p id=\"element-23635\">4<\/p>\n<\/div>\n<\/div>\n<div id=\"exerciseeleven\" data-type=\"exercise\">\n<div id=\"id21413333\" data-type=\"problem\">\n<p>Third quartile = _______<\/p>\n<\/div>\n<\/div>\n<div id=\"exercisetwelve\" data-type=\"exercise\">\n<div id=\"id13392439\" data-type=\"problem\">\n<p>Interquartile range (<em data-effect=\"italics\">IQR<\/em>) = _____ \u2013 _____ = _____<\/p>\n<\/div>\n<div id=\"id10710871\" data-type=\"solution\">\n<p id=\"element-23646\">6 \u2013 4 = 2<\/p>\n<\/div>\n<\/div>\n<div id=\"exercisethirteen\" data-type=\"exercise\">\n<div id=\"id14610610\" data-type=\"problem\">\n<p id=\"prob_13\">10<sup>th<\/sup> percentile = _______<\/p>\n<\/div>\n<\/div>\n<div id=\"exercisefourteen\" data-type=\"exercise\">\n<div id=\"id21409553\" data-type=\"problem\">\n<p id=\"prob_14\">70<sup>th<\/sup> percentile = _______<\/p>\n<\/div>\n<div id=\"id23430727\" data-type=\"solution\">\n<p id=\"element-234636\">6<\/p>\n<\/div>\n<\/div>\n<\/div>\n<div id=\"fs-idm1839472\" class=\"free-response\" data-depth=\"1\">\n<h3 data-type=\"title\">Homework<\/h3>\n<div id=\"element-927\" data-type=\"exercise\">\n<div id=\"id3483376\" data-type=\"problem\">\n<p id=\"element-746\">1)\u00a0 Six hundred adult Americans were asked by telephone poll, &#8220;What do you think constitutes a middle-class income?&#8221; The results are in <a class=\"autogenerated-content\" href=\"#element-588\">(Figure)<\/a>. Also, include left endpoint, but not the right endpoint.<\/p>\n<table id=\"element-588\" summary=\"This table presents the results from a poll on what Americans thought constituted middle class. The first column lists the salary and the second column lists the relative frequency. There are 8 rows.\">\n<thead>\n<tr>\n<th>Salary (\\$)<\/th>\n<th>Relative Frequency<\/th>\n<\/tr>\n<\/thead>\n<tbody>\n<tr>\n<td>&lt; 20,000<\/td>\n<td>0.02<\/td>\n<\/tr>\n<tr>\n<td>20,000\u201325,000<\/td>\n<td>0.09<\/td>\n<\/tr>\n<tr>\n<td>25,000\u201330,000<\/td>\n<td>0.19<\/td>\n<\/tr>\n<tr>\n<td>30,000\u201340,000<\/td>\n<td>0.26<\/td>\n<\/tr>\n<tr>\n<td>40,000\u201350,000<\/td>\n<td>0.18<\/td>\n<\/tr>\n<tr>\n<td>50,000\u201375,000<\/td>\n<td>0.17<\/td>\n<\/tr>\n<tr>\n<td>75,000\u201399,999<\/td>\n<td>0.02<\/td>\n<\/tr>\n<tr>\n<td>100,000+<\/td>\n<td>0.01<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<ol id=\"element-295\" type=\"a\">\n<li>What percentage of the survey answered &#8220;not sure&#8221;?<\/li>\n<li>What percentage think that middle-class is from \\$25,000 to \\$50,000?<\/li>\n<li>Construct a histogram of the data.\n<ol id=\"nestlist3\" type=\"i\" data-mark-suffix=\".\">\n<li>Should all bars have the same width, based on the data? Why or why not?<\/li>\n<li>How should the &lt;20,000 and the 100,000+ intervals be handled? Why?<\/li>\n<\/ol>\n<\/li>\n<li>Find the 40<sup>th<\/sup> and 80<sup>th<\/sup> percentiles<\/li>\n<li>Construct a bar graph of the data<\/li>\n<\/ol>\n<\/div>\n<\/div>\n<\/div>\n<p>&nbsp;<\/p>\n<p>&nbsp;<\/p>\n<div class=\"free-response\" data-depth=\"1\">\n<div id=\"fs-idp35930608\" data-type=\"exercise\">\n<div id=\"id3500598\" data-type=\"problem\">\n<p>2) Given the following box plot:<\/p>\n<div id=\"fs-idm476896\" class=\"bc-figure figure\"><span id=\"id4775287\" data-type=\"media\" data-alt=\"This is a horizontal boxplot graphed over a number line from 0 to 13. The first whisker extends from the smallest value, 0, to the first quartile, 2. The box begins at the first quartile and extends to third quartile, 12. A vertical, dashed line is drawn at median, 10. The second whisker extends from the third quartile to largest value, 13.\"><img decoding=\"async\" src=\"https:\/\/pressbooks.ccconline.org\/acccomposition1\/wp-content\/uploads\/sites\/83\/2022\/05\/fig-ch02_13_02-1.jpg\" alt=\"This is a horizontal boxplot graphed over a number line from 0 to 13. The first whisker extends from the smallest value, 0, to the first quartile, 2. The box begins at the first quartile and extends to third quartile, 12. A vertical, dashed line is drawn at median, 10. The second whisker extends from the third quartile to largest value, 13.\" width=\"400\" data-media-type=\"image\/jpg\" \/><\/span><\/div>\n<ol id=\"element-328\" type=\"a\">\n<li>which quarter has the smallest spread of data? What is that spread?<\/li>\n<li>which quarter has the largest spread of data? What is that spread?<\/li>\n<li>find the interquartile range (<em data-effect=\"italics\">IQR<\/em>).<\/li>\n<li>are there more data in the interval 5\u201310 or in the interval 10\u201313? How do you know this?<\/li>\n<li>which interval has the fewest data in it? How do you know this?\n<ol id=\"nestlist7\" type=\"i\" data-mark-suffix=\".\">\n<li>0\u20132<\/li>\n<li>2\u20134<\/li>\n<li>10\u201312<\/li>\n<li>12\u201313<\/li>\n<li>need more information<\/li>\n<\/ol>\n<\/li>\n<\/ol>\n<\/div>\n<\/div>\n<div id=\"element-284\" data-type=\"exercise\">\n<div id=\"id3912087\" data-type=\"problem\">\n<p>&nbsp;<\/p>\n<p id=\"element-874\">3) The following box plot shows the U.S. population for 1990, the latest available year.<\/p>\n<div id=\"fs-idm132205520\" class=\"bc-figure figure\"><span id=\"id7587202\" data-type=\"media\" data-alt=\"A box plot with values from 0 to 105, with Q1 at 17, M at 33, and Q3 at 50.\"><img decoding=\"async\" src=\"https:\/\/pressbooks.ccconline.org\/acccomposition1\/wp-content\/uploads\/sites\/83\/2022\/08\/fig-ch02_13_08-1.jpg\" alt=\"A box plot with values from 0 to 105, with Q1 at 17, M at 33, and Q3 at 50.\" width=\"400\" data-media-type=\"image\/jpg\" \/><\/span><\/div>\n<ol type=\"a\">\n<li>Are there fewer or more children (age 17 and under) than senior citizens (age 65 and over)? How do you know?<\/li>\n<li>12.6% are age 65 and over. Approximately what percentage of the population are working age adults (above age 17 to age 65)?<\/li>\n<\/ol>\n<p>&nbsp;<\/p>\n<\/div>\n<div id=\"id7597969\" data-type=\"solution\">\n<p id=\"fs-idm5042080\">The median age for U.S. blacks currently is 30.9 years; for U.S. whites it is 42.3 years.<\/p>\n<ol id=\"fs-idm33581296\" type=\"a\">\n<li>Based upon this information, give two reasons why the black median age could be lower than the white median age.<\/li>\n<li>Does the lower median age for blacks necessarily mean that blacks die younger than whites? Why or why not?<\/li>\n<li>How might it be possible for blacks and whites to die at approximately the same age, but for the median age for whites to be higher?<\/li>\n<\/ol>\n<\/div>\n<\/div>\n<\/div>\n<p>Answers to odd questions<\/p>\n<p>1)<\/p>\n<ol id=\"element-295a\" type=\"a\">\n<li>1 \u2013 (0.02+0.09+0.19+0.26+0.18+0.17+0.02+0.01) = 0.06<\/li>\n<li>0.19+0.26+0.18 = 0.63<\/li>\n<li>Check student\u2019s solution.<\/li>\n<li>\n<p id=\"eip-idp139654864\">40<sup>th<\/sup> percentile will fall between 30,000 and 40,000<\/p>\n<p id=\"eip-idp139655632\">80<sup>th<\/sup> percentile will fall between 50,000 and 75,000<\/p>\n<\/li>\n<li>Check student\u2019s solution.<\/li>\n<\/ol>\n<p>3)<\/p>\n<ol type=\"a\" data-mark-suffix=\".\">\n<li>more children; the left whisker shows that 25% of the population are children 17 and younger. The right whisker shows that 25% of the population are adults 50 and older, so adults 65 and over represent less than 25%.<\/li>\n<li>62.4%<\/li>\n<\/ol>\n<div class=\"textbox shaded\" data-type=\"glossary\">\n<h3 data-type=\"glossary-title\">Glossary<\/h3>\n<dl id=\"iqr\">\n<dt>Interquartile Range<\/dt>\n<dd id=\"id15896860\">or <em data-effect=\"italics\">IQR<\/em>, is the range of the middle 50 percent of the data values; the <em data-effect=\"italics\">IQR<\/em> is found by subtracting the first quartile from the third quartile.<\/dd>\n<\/dl>\n<dl id=\"outlier\">\n<dt>Outlier<\/dt>\n<dd id=\"id1171166689919\">an observation that does not fit the rest of the data<\/dd>\n<\/dl>\n<dl id=\"percentile\">\n<dt>Percentile<\/dt>\n<dd id=\"id19436015\">a number that divides ordered data into hundredths; percentiles may or may not be part of the data. The median of the data is the second quartile and the 50<sup>th<\/sup> percentile. The first and third quartiles are the 25<sup>th<\/sup> and the 75<sup>th<\/sup> percentiles, respectively.<\/dd>\n<\/dl>\n<dl id=\"quartiles\">\n<dt>Quartiles<\/dt>\n<dd id=\"id1164416504778\">the numbers that separate the data into quarters; quartiles may or may not be part of the data. The second quartile is the median of the data.<\/dd>\n<\/dl>\n<\/div>\n","protected":false},"author":32,"menu_order":7,"template":"","meta":{"pb_show_title":"","pb_short_title":"","pb_subtitle":"","pb_authors":[],"pb_section_license":""},"chapter-type":[],"contributor":[],"license":[],"class_list":["post-81","chapter","type-chapter","status-publish","hentry"],"part":51,"_links":{"self":[{"href":"https:\/\/pressbooks.ccconline.org\/accintrostats\/wp-json\/pressbooks\/v2\/chapters\/81","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/pressbooks.ccconline.org\/accintrostats\/wp-json\/pressbooks\/v2\/chapters"}],"about":[{"href":"https:\/\/pressbooks.ccconline.org\/accintrostats\/wp-json\/wp\/v2\/types\/chapter"}],"author":[{"embeddable":true,"href":"https:\/\/pressbooks.ccconline.org\/accintrostats\/wp-json\/wp\/v2\/users\/32"}],"version-history":[{"count":4,"href":"https:\/\/pressbooks.ccconline.org\/accintrostats\/wp-json\/pressbooks\/v2\/chapters\/81\/revisions"}],"predecessor-version":[{"id":705,"href":"https:\/\/pressbooks.ccconline.org\/accintrostats\/wp-json\/pressbooks\/v2\/chapters\/81\/revisions\/705"}],"part":[{"href":"https:\/\/pressbooks.ccconline.org\/accintrostats\/wp-json\/pressbooks\/v2\/parts\/51"}],"metadata":[{"href":"https:\/\/pressbooks.ccconline.org\/accintrostats\/wp-json\/pressbooks\/v2\/chapters\/81\/metadata\/"}],"wp:attachment":[{"href":"https:\/\/pressbooks.ccconline.org\/accintrostats\/wp-json\/wp\/v2\/media?parent=81"}],"wp:term":[{"taxonomy":"chapter-type","embeddable":true,"href":"https:\/\/pressbooks.ccconline.org\/accintrostats\/wp-json\/pressbooks\/v2\/chapter-type?post=81"},{"taxonomy":"contributor","embeddable":true,"href":"https:\/\/pressbooks.ccconline.org\/accintrostats\/wp-json\/wp\/v2\/contributor?post=81"},{"taxonomy":"license","embeddable":true,"href":"https:\/\/pressbooks.ccconline.org\/accintrostats\/wp-json\/wp\/v2\/license?post=81"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}