Question:
A Secretary of State wants to survey the primary owners of motorcycles registered in the state to estimate the proportion who want the license plates redesigned. (Primary owner means that if more than one owner is listed, the person listed first is considered the primary owner.) There are 6,000,000 motorcycles registered. He asks an employee to pick the sample, who then picks a SRS with replacement of 1000 license plate numbers corresponding to the motorcycles. All 1000 license plate numbers are distinct – no duplicates. The employee sends a questionnaire to all of the primary owners for those motorcycles. The response rate is 100%, and 400 of the 1000 respondents report that they favor redesigning the license plates, and the remaining 600 report that they do not favor redesigning the plates. After receiving this data, the employee realizes that some people own multiple motorcycles and could have been included multiple times in the survey. However in fact no owner was selected more than one time.
(a) Is the sample proportion (40%, or 400/1000) an unbiased estimator of the proportion of primary motorcycle owners who want the license plates redesigned? Why or why not?
(b) The employee finds out how many motorcycles each survey respondent has registered in the state, and prepares the following tabulation.
\begin{array}{} No.of motorcycles Owned & No. of Respondents & No.Favoring Redesign& Proportion Favoring \\ \hline 1 & 640 & 240 & 0.375\\ 2 & 200 & 90 & 0.450\\ 3 & 100 & 40 & 0.400\\ 4 & 50 & 25 & 0.500\\ 5 & 10 & 5 & 0.500\\ 6.or.more & 0 \\ Total & 1000 & 400 & 0.400 \end{array}
Given all of this information, estimate the total number of primary motorcycle owners who want the license plates redesigned.
Presumably, the opinion $R$ of someone doesn't change when their asked multiple times, but it may depend on the number of motorcycles owned $O$. If so, then the expected value of your sample proportion $p$ is $\sum P(O=i)E(X|O=i)$. Therefore, your estimator is biased if there is substantial stratification of opinions based on number of motorcycles owned. In your case, I'd say it looks like you'd have a biased sample, since even if you only have one response from each primary owner, the owners who own more motorcycles are more likely to even be included in the survey. Also, the data indicate that the number of motorcycles owned is a predictor of opinion.
As for (b): You'll need to divide the number of responses by the number of motorcycles owned, then do a weighted average of the observed proportions then multiply by 6 Million.