I am trying to learn the maths I need for data science, but I left formal maths education so long ago that my maths intuition is incredibly weak. I am confronted with the following explanation, and I just don't see it as obvious in the way that the author expects me to:
"If A and B are not independent, the equation (1.6) generalizes to
P(A and B) = P(A) P(B|A)
This should make sense to you. Suppose 30% of all UC Davis students are in engineering, and 20% of all engineering majors are female. That would imply that 0.30 x 0.20 = 0.06, i.e., 6% of all UCD students are female engineers."
Why are we multiplying P(A) and P(B|A)? Of course, I know that
(∣)=(∩)/()
and so rearranging we get the above formula, but that doesn't help explain why (for me). I've worked through his explanation of the female engineering students assuming that the overall number of students is 100, but I'm not seeing how this connects to the formula.
Clarification: equation (1.6) is:
P(A and B) = P(A) · P(B)
when A and B are independent.
We get to multiply the probabilities of individual events in a compound event when the outcome of the second doesn't depend on the first. So if I want to choose between the numbers 1-3 on the first number and 1-5 on the second and repeats are allowed, I get to just multiply $\frac 1 3$ by $\frac 1 5$.
We can view $A \cap B$ sort of the same, except for that pesky "they aren't independent". However, we can force independence! For my first probability, I just need $A$ to be true. Now, we want to shrink our universe of possibilities to only those in which $A$ is true. From that pool, we then can look at the probability of $B$ happening "Next" as independent. But that probability is exactly $P(B|A)$.
Rather than the formula viewpoint of $P(B|A)$, I prefer the "restricted universe" viewpoint. $P(B)$ is the probability of $B$ happening in the big universe. $P(B|A)$ looks at what happens if we restrict our viewpoint to only the parts in which $A$ happened, then rerun the problem of the chances of $B$ happening in our sub-universe.