## Homework 1

Due date: Sept. 12. Homework assignments should be submitted to me by email. Please do not send a Word document. Instead, save the Word file as pdf and send the pdf file. Put stat3355 hw1 on the subject line.

1. Use the HairEyeColor data set to obtain the joint frequency table for eye color and sex. Find the following percentages and express each as a probability statement.

[a] The percentage of the group that are male.
[b] The percentage of the group with green eyes.
[c] The percentage of males with green eyes.
[d] The percentage of those with green eyes who are male.
[e] The percentage of those with brown eyes who are female.

2. HairEyeColor continued.
[a] Obtain the expected frequencies under the assumption of independence for each combination of eye color and sex, the distance from independence for each combination, and the total distance of this frequency table from independence.
[b] Construct a barplot that shows the relative proportions of eye color within sex categories.
[c] Construct an informative barplot that shows the relative proportions of males and females within eye colors.

3. Use the data contained in the file
http://www.utdallas.edu/~ammann/stat3355scripts/Smoking.txt

[a] Find the means and standard deviations for each variable.
[b] Which states are more than 2 sd's above the mean for cigarette consumption? for bladder cancer? for lung cancer?
[c] Which states are in the top 10% of cigarette consumption? of bladder cancer? of lung cancer? (see documentation for R function quantile())
[d] Plot cigarette consumption versus lung cancer and add an informative title. Be sure to think about which variable should be plotted on the Y-axis. Do the same for cigarette consumption versus bladder cancer.

ammann
2017-11-16