2.3 Cases: cups of Mocha Frappuccino. Variables: size and price (both quantitative)
2.7 Answers will vary.
2.9 The size in ounces is the explanatory variable, which should explain or cause changes in the cost. The scatterplot shows that there is a relationship: as ounces increase, so does the cost.
2.11
2.15
When a country’s MI is $35, its predicted
2.17 There are 7 points above the line and 7 points below the line.
2.19
The values of
r |
|
|
|
|
0 | 0.2 | 0.4 | 0.8 |
|
1 | 0.64 | 0.16 | 0.04 | 0 | 0.04 | 0.16 | 0.64 |
2.25 There were 974 children aged 11 to 13. There were 1278 total children who met the requirement.
2.27
Each entry is the row total divided by the table total. For No,
2.29
Yes:
2.3 Answers will vary. Some possible variables are price, type of textbook, major, third or fourth year course, etc.
2.5 Answers will vary. Some possible variables are university, size, etc., in addition to the average number of tickets sold and the percentage of games won. Cases would be individual teams. Here, we would likely be interested in whether there is a relationship between the average number of tickets sold and the percentage of games won.
2.7 Answers will vary. This could include enrollment, graduation rate, job placement rate, in-state tuition, out-of-state tuition, public/private institution, etc.
2.17 Answers will vary. The explanatory variable is the major. The response variable is graduating in four years. Both variables are categorical, therefore the methods described in this section cannot be used.
2.21 The relationships between calories and alcohol content are quite similar for both domestic and imported beers. Also, the outlier for the imported beers no longer is an outlier because there are several other domestic beers that have a similar alcohol content.
2.33
For fuel type D:
2.37 There is little linear association between research and teaching; for example, knowing that a professor is a good researcher gives little information about whether she or he is a good or bad teacher.
2.43 Both correlations for the imported and domestic beers are quite similar, especially when the outlier O’Doul’s is removed. The relationships between calories and percent alcohol for both types of beers are linear and very strong and quite similar in pattern.
2.55
Predicted bone strength is 22.854
(a) – (c)
|
||||
---|---|---|---|---|
Time | Count | Predicted (a) | Difference (b) | Squared difference (c) |
1 | 578 | 528.1 | 49.9 | 2490.01 |
3 | 317 | 378.7 |
|
3806.89 |
5 | 203 | 229.3 |
|
691.69 |
7 | 118 | 79.9 | 38.1 | 1451.61 |
|
||
---|---|---|
Predicted | Difference | Squared Difference |
400 | 178 | 31,684 |
200 | 117 | 13,689 |
0 | 203 | 41,209 |
|
318 | 101,124 |
The first line is a better description of the relationship.
2.73
2.75
2.77
The residuals are 10.58,
2.79 10.0, extrapolation; 13.0, 16.0, 19.0, 30.0, prediction.
(a) – (b)
Time | LogCount | Predicted | Residual |
---|---|---|---|
1 | 6.35957 | 6.33244 | 0.02713 |
3 | 5.75890 | 5.81121 |
|
5 | 5.31321 | 5.28997 | 0.02324 |
7 | 4.77068 | 4.76874 | 0.00195 |
2.87 Internet use does not cause people to have fewer babies. Possible lurking variables are economic status of the country, levels of education, etc.
2.89 For example, a reasonable explanation is that the cause-and-effect relationship goes in the other direction: doing well makes students or workers feel good about themselves rather than vice versa.
2.91 The explanatory and response variables were “consumption of herbal tea” and “cheerfulness/health.” The most important lurking variable is social interaction; many of the nursing-home residents may have been lonely before the students started visiting.
2.95 Each group has a positive association, but when combined, the regression slope is negative.
Under 20 | 20 to 25 | 25 to 30 | 30 to 35 | 35 to 40 | Over 40 | |
---|---|---|---|---|---|---|
Yes | 0.0002 | 0.0019 | 0.0033 | 0.0053 | 0.0086 | 0.0114 |
No | 0.1761 | 0.2333 | 0.1663 | 0.1316 | 0.1423 | 0.1196 |
Marginal distribution of rejected | |
---|---|
Yes | No |
0.03081 | 0.96919 |
Marginal distribution of age | |||||
---|---|---|---|---|---|
Under 20 | 20 to 25 | 25 to 30 | 30 to 35 | 35 to 40 | Over 40 |
0.1763 | 0.2352 | 0.1696 | 0.1369 | 0.1509 | 0.131 |
The conditional distribution of Rejected given Age, because we have said Age is the explanatory variable.
In the table, note that all columns sum to 1. We can clearly see the proportion of rejected recruits increasing with increasing age.
Under 20 | 20 to 25 | 25 to 30 | 30 to 35 | 35 to 40 | Over 40 | |
---|---|---|---|---|---|---|
Yes | 0.0012 | 0.0082 | 0.0196 | 0.0389 | 0.0572 | 0.0868 |
No | 0.9988 | 0.9918 | 0.9804 | 0.9611 | 0.9428 | 0.9132 |
2.101 Sex is the explanatory variable, and Lied is the response variable. For the males, about 55% admitted that they had lied, whereas for the females, 51% admitted that they had lied. Males may be slightly more willing to admit that they lied than females.
2.105 3.0% of Hospital A’s patients died, compared with 2.0% at B.
2.107 In general, choose a to be any number from 0 to 300, and then all the other entries can be determined.
2.109 For example, causation might be a negative association between the setting on a stove and the time required to boil a pot of water (higher setting, less time). Common response might be a positive association between SAT score and grade point average. Both of these will have a positive relationship with a person’s IQ. An example of confounding might be a negative association between hours of TV watching and grade point average. Once again, people who are naturally smart could finish required work faster and have more time for TV; those who aren’t as smart could become frustrated and watch TV instead of doing homework.
2.111 This is a case of confounding: the association between dietary iron and anemia is difficult to detect because malaria and helminths also affect iron levels in the body.
2.113 Responses will vary. For example, students who choose the online course might have more self-motivation or better computer skills. The generic “Student characteristics” might be replaced with something more specific.
2.115 No; self-confidence and improving fitness could be common responses to some other personality trait, or high self-confidence could make a person more likely to join the exercise program.
2.117 Patients suffering from more serious illnesses are more likely to go to larger hospitals (which may have more or better facilities) for treatment. They are also likely to require more time to recuperate afterward.
2.119 People who are overweight are more likely to be on diets and so choose artificial sweeteners over sugar.
2.121 This is an observational study: students choose their “treatment” (to take or not take the refresher sessions).
2.131 A school that accepts weaker students but graduates a higher-than-expected number of them would have a positive residual, whereas a school with a stronger incoming class but a lower-than-expected graduation rate would have a negative residual. It seems reasonable to measure school quality by how much benefit students receive from attending the school.
2.139 Number of firefighters and amount of damage are common responses to the seriousness of the fire.
2.141
There is a strong linear positive association.
75% of men apply to business school, where admission is easier. More women apply to law school, which is more selective.
Sex | Admit | Deny | Total |
---|---|---|---|
Male | 490 | 310 | 800 |
Female | 400 | 300 | 700 |
Total | 890 | 610 | 1500 |
2.145 If we ignore “Year,” Department A teaches 61.54% small classes, and Department B teaches 39.62% small classes. However, in upper-level classes, A has 77.5% and B has 83.33% small classes. Additionally, 76.92% of A’s classes are upper-level courses, compared to 33.96% of B’s classes.
2.149 Answers will vary.
The tables are shown here.
Female Titanic passengers | ||||
---|---|---|---|---|
Class | Total | |||
1 | 2 | 3 | ||
Survived | 139 | 94 | 106 | 339 |
Died | 5 | 12 | 110 | 127 |
Total | 144 | 106 | 216 | 466 |
Male Titanic passengers | ||||
---|---|---|---|---|
Class | Total | |||
1 | 2 | 3 | ||
Survived | 61 | 25 | 75 | 161 |
Died | 118 | 146 | 418 | 682 |
Total | 179 | 171 | 493 | 843 |
If we look at the conditional distribution of survival given class for females, 96.53% of first-class females survived, 88.68% survival among second-class females, and 49.07% survival among third-class females. Survival depended on class.
For males, 34.08% survival among first class, 14.62% survival among second class, and 15.21% survival among third class. Once again, survival depended on class.
Females overall had much higher survival rates than males.
2.153 Answers will vary.