What is the probability/percentage of $X$ people that $Y$? [bayes/law of total probability]

49 Views Asked by At

Attempting the following problem, and it's going nowhere.

Suppose that $75$% of all people with credit records, improve their credit records within three years. Suppose that $18$% of the population at large have poor credit records, and of these only $30$% will improve their credit records within three years. What percentage of the people who will improve their credit records, are the ones who currently have good credit records?

My attempt has been (to find $P(G|I)$):

$R$: has a credit record, $B$: has poor credit record, $G$: has good credit record, $I$: will improve credit record.

$$P(G|I) = \frac{P(I|G)P(G)}{P(I)}$$

I end up proceeding with some manipulations but then being stuck with an expression for $P(G|I)$ in terms of $P(I)$, and then can't find a proper law of total probability expression that gets me a number for $P(I)$.

That is, the eventual point I get to is $$P(G|I|) = 1-\frac{P(I|B)P(B)}{P(I)} = 1-\frac{0.054}{P(I)}$$ which is where everything further leads in circles.. any help would be much appreciated.

2

There are 2 best solutions below

5
On BEST ANSWER

The answer is not uniquely determined.

I assume $R = G \cup B$ and $P[I \cap R^c]=0$ (if we do not have a record, then we cannot improve). Fix $x$ as any number that satisfies: $$ \underbrace{\frac{81}{250}}_{0.324}\leq x \leq \underbrace{\frac{87}{125}}_{0.696}$$ We can partition the sample space into 6 disjoint events and find probabilities for each of these events. The following table is consistent with all given information:

$$\begin{array}{|c|c|c|} \hline &G & B & R^c\\ \hline I & x & \frac{27}{500} & 0\\ \hline I^c & \frac{x}{3} - \frac{27}{250} & \frac{63}{500} & \frac{116}{125}-\frac{4x}{3} \\ \hline \end{array}$$

$$ P[G|I] = \frac{x}{x + \frac{27}{500}}$$


To verify: (which shows the table values are sufficient)

  • All probabilities in the table are nonnegative and sum to 1.

  • $P[B] = \frac{27}{500} + \frac{63}{500} = 0.18$

  • $P[I|B] = \frac{27}{27+63} = 0.3$.

  • $P[I|R] = \frac{x+ \frac{27}{500}}{x+\frac{27}{500} + \frac{x}{3} - \frac{27}{250} + \frac{63}{500}} = 0.75$.


Caveat: It is unclear how to treat cases of "improving" or "not improving" for people who do not have a record. I started by assuming $P[I \cap R^c]=0$. However the table requires $P[I^c \cap R^c]$ to have some potentially positive value. If we also require $P[I^c \cap R^c]=0$ then $x= 87/125$ and the (unique) answer is $$ P[G|I] = \frac{87/125}{87/125 + 27/500} = 0.928$$


How I derived the table: (which shows the table values are necessary)

I started with:

$$\begin{array}{|c|c|c|} \hline &G & B & R^c\\ \hline I & \underbrace{P[I \cap G]}_{x} & P[I \cap B] & 0\\ \hline I^c & \underbrace{P[I^c \cap G]}_{y} & P[I^c \cap B] & P[I^c \cap R^c] \\ \hline \end{array}$$

  • From given info $P[B]=0.18$ and $P[I|B]=0.3$ I got the second column:

$$ 0.18 = P[I \cap B] + P[I^c \cap B] \quad , \quad 0.3 = \frac{P[I \cap B]}{0.18} $$ $$ \implies \boxed{P[I \cap B] = \frac{27}{500}}, \boxed{P[I^c \cap B] = \frac{63}{500}}$$

  • I defined $P[I \cap G] = x$ and $P[I^c \cap G] = y$. From given info $P[I|R] = 0.75$ I got the first column:

$$ 0.75 = \frac{P[I \cap G] + P[I \cap B]}{P[I\cap G] + P[I\cap B] + P[I^c\cap G] + P[I^c \cap B]} = \frac{x + \frac{27}{500}}{x + \frac{27}{500} + y + \frac{63}{500}}$$ $$ \implies \boxed{y = \frac{x}{3} - \frac{27}{250}}$$

  • I computed $P[I^c \cap R^c]$ as 1 minus all 5 other probabilities: $$ P[I^c \cap R^c] = 1 - x - \left(\frac{x}{3}-\frac{27}{250}\right) - \frac{27}{500} - \frac{63}{500}$$

$$ \implies \boxed{P[I^c \cap R^c] = \frac{116}{125} - \frac{4x}{3}}$$

  • I needed all 6 probabilities in the table to be nonnegative:

\begin{align} \frac{x}{3}-\frac{27}{250} \geq 0 \quad &\implies \boxed{x \geq \frac{81}{250}}\\ \frac{116}{125} - \frac{4x}{3} \geq 0 \quad &\implies \boxed{x \leq \frac{87}{125}} \end{align}

1
On

If you make a Venn diagram of the current situation, you can visualise exactly what you need to find.

enter image description here

Let ABCD represent your total population - within this, the quadilateral EFGH depicts people with credit records. Now within these people, the quadliateral EIJF represents those who will improve in 3 years, and the circle represents those with poor credit scores

Now, to translate your question to this diagram, you need to find the proportion of people sitting in (EIJF - K) to (EHGF-K)

$$EIJF = 0.75 \cdot EHGF$$

$$K \cap EIJF = 0.3 K $$

$$P = \frac{(0.75)\cdot n(EHGF) - (0.3)\cdot n(K)}{n(EHGF)-n(K)}$$

Let the proportion of people having credit records at all be $p$. This gives us the missing link

$$P = \frac{(0.75)\cdot p\cdot n(ABCD) - (0.3)\cdot 0.18 \cdot n(ABCD)}{p \cdot n(ABCD)-0.18 \cdot n(ABCD)}$$

$$P = \frac{(0.75)\cdot p - (0.3)\cdot 0.18}{p -0.18}$$

Now as far as I can see, the question does not give us enough data to solve for $p$, assuming that everyone has a credit record, the probability is

$$P(p=1) = 0.848$$

Is this the answer given?