The explanation with which I'm familiar goes like this. Define the conditional probability P(A|B) as the probability of their intersections P(A and B) divided by the probability P(B). To figure out the probability of the intersection P(A and B) then, simply rearrange the terms of equation algebraically to obtain P(A and B) = P(A|B) X P(B).
The problem: What I'm struggling to understand is how to apply this equation in practice. It seems like whenever we want to figure out P(A and B), we need to figure out the values of P(A|B) and P(B).But to figure out the value of P(A|B), we already must figure out the value of P(A and B). So how is the equation P(A and B) = P(A|B) X P(B) helpful for figuring out the value of P(A and B)? In other words, to solve for it, we must figure out the value of P(A|B), which we can't do without knowing the value of P(A and B).
In practice, a lot of time you can calculate $P(A|B)$ by assuming $B$ happened and see what is the probability of $A$. For example, let's say you draw 2 cards face down, then randomly choose 1 to flip up, and let $B$ is the event that exactly 1 red card and 1 blue card was drawn, and $A$ is the event that the flipped up card is blue. Then you know $P(A|B)$ without knowing $P(B)$ nor $P(A\bigcap B)$, because assuming you have 1 red card and 1 blue card faced down, randomly picking 1 of them to flip up give you blue card with probability $\frac{1}{2}$.