The conditional probability of $A$ given the occurrence of $B$ is how likely $A$ is to have occurred given that $B$ has occurred. The formal definition of the conditional probability of $A$ given $B$ is $$P(A|B)=\frac{P(A\cap B)}{P(B)}$$
I get the numerator term: when we reduce the sample space to just occurrences of $B$, then the occurrences of $A$ will be all and only those occurrences of $A$ which are also occurrences of $B$ - i.e. the intersection of $A$ and $B$.
I don't get the denominator term, though. If we know that $A$ has occurred, then $B$ becomes the sample space. The probability of an event is defined to be the long-run relative frequency of favourable outcomes to all possible outcomes (i.e. the sample space). That is: $$P(A) = \frac{A}{\Omega}$$
But if $B$ becomes the sample space, then why are we assigning a probability to $B$, as opposed to dividing by the number of occurrences of $B$? In other words, should we not have: $$P(A|B)=\frac{A\cap B}{B}$$
The probability of $B$ here is just $1$, is it not? What have I missed?
In the approach you are thinking of where you are "counting the number of favourable outcomes", it should really be the cardinality or size of the set: i.e. you $\mathbb{P}(A) = \frac{|A|}{|\Omega|}$ (where these all make sense). Now you suggest the conditional probability should be of the form $\mathbb{P}(A \mid B) = \frac{|A \cap B|}{|B|}$; compare this to the more general definition $\mathbb{P}(A \mid B) = \frac{\mathbb{P}(A \cap B)}{\mathbb{P}(B)}$. These are essentially the same thing, when our probabilities are defined by counting, i.e. $\mathbb{P}(A \cap B) = \frac{|A \cap B|}{|\Omega|}$ and $\mathbb{P}(B) = \frac{|B|}{|\Omega|}$ (sub them in!)