Here's the statement of the problem.
At a certain stage of a criminal investigation, the inspector in charge is 60 percent convinced of the guilt of a certain suspect. Suppose, however, that a new piece of evidence which shows that the criminal has a certain characteristic (such as left-handedness, baldness, or brown hair) is uncovered. If 20 percent of the population possesses this characteristic, how certain of the guilt of the suspect should the inspector now be if it turns out that the suspect has the characteristic?
I read the solution and had the same question as the OP of this post. This answer clarifies the situation somewhat, however I have two other questions:
- Why is $P(C | G^c) = 0.2$? It makes sense that $P (C | G) = 1$, as the problem statement explicitly says so. But I don't see why it makes sense to apply an information on the population to the suspect, if he's not guilty.
- In the answer above, the author says that "$P(C)$ is the measure of the inspector's belief that the suspect would possess the characteristic prior to examination." How can we tell that just from reading the problem statement (especially the "prior to examination" bit)? The book's author defines $G$ as the event that the suspect is guilty and $C$ the event that he possesses the characteristic of the criminal, so I read $P(C)$ as "the probability that the suspect possesses the characteristic of the criminal", and thought that $P(C) = 1$.
(Perhaps this is a language barrier thing, but sometimes I have trouble interpreting the problem statement, or telling which detail is given, and which one is not. An example is the Monty Hall problem. When computing the probability that the host would point to door 2, it is necessary to eliminate the cases where he points to door 1, as the contestant has already picked it. However, when I considered the probability $P(\text{host points to door 2})$, I was thinking that I had to consider all possibilities, including the cases where the host points to door 1. The same sort of confusion arised when I had to consider whether the event "the suspect has the characteristic" is given (i.e., $P(C) = 1$) or not.)
Why is $P(C|G^C)=0.2$? It makes sense that $P(C|G)=1$, as the problem statement explicitly says so. But I don't see why it makes sense to apply an information on the population to the suspect, if he's not guilty.
Think of the event $G$ as a singleton set consisting of one and only one person, that is, the person who committed the crime, among all people in the population. Then, $G^C$ is an event consisting of everyone in the population except for that one person who committed the crime. Hence, we interpret $P(C|G^C)$ as the probability our suspect has the characteristic in question given he is a member of $G^C$. In other words, $P(C|G^C)$ is the probability that our suspect has the characteristic if he is just one of many people in the population (minus the one guilty person). Since $20$% of the population has the characteristic and our suspect is presumed to be one the many, we have $P(C|G^C) = 0.2$.
In the answer above, the author says that "$P(C)$ is the measure of the inspector's belief that the suspect would possess the characteristic prior to examination." How can we tell that just from reading the problem statement (especially the "prior to examination" bit)? The book's author defines G as the event that the suspect is guilty and $C$ the event that he possesses the characteristic of the criminal, so I read $P(C)$ as "the probability that the suspect possesses the characteristic of the criminal", and thought that $P(C)=1$.
I think $P(C)$ is best interpreted as the probability the suspect has the characteristic in question. Nowhere in the problem description does it say the suspect actually possesses the characteristic. Thus, $P(C) \neq 1$ because we simply do not know for certain that the suspect has the characteristic. The likelihood with which he has the characteristic depends on whether he is guilty, and the law of total proability shows that
$$ P(C) = P(G)P(C|G) + P(G^C)P(C|G^C) $$
is actually a weighted average of conditional probabilities $P(C|G), P(C|G^C)$ where the weights $P(G), P(G^C)$ are the levels of belief regarding the suspect's guilt.