I have problems interpreting the formula of Mutual Information (MI) as shown below. It was taken from this paper http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.56.8930&rep=rep1&type=pdf 
In section 3.1, with the formula (3), I calculate MI of I(T,A), the case of single parent.
$\sum\ P_{Pr}(A)$ = 0.99 + 0.01 = 1
$\sum\ p(T|A)\ log \frac{p(T|A)}{P_{pr}(T)} $ = 0.05(log2(0.05/0.03)) + 0.95(log2(0.95/0.03)) + 0.01(log2(0.01/0.97)) + 0.99(log2(0.99/0.97)) = 4.7356
With $P_{pr}(T)$=(0.03 0.97), The correct answer of I(T,A) is 0.009, which a lot different of my calculation. So, I'm asking if anybody can provide me with a detail calculation?
My next step is to also do the same with the formula (4) in section 3.2, where I can calculate I(O,T). This is the case of multiple parents. According to the paper, the correct answer is 0.602, but no matter I calculated it, I couldn't get the answer.
Thank you very much and I highly appreciate your generous help.
For the first part though, I'm getting 0.01 which is close but not exact. Here are the details:
If $T$ takes on values $t_1$ and $t_2$ (and $A$, $a_1$ and $a_2$)
$$Pr(T=t_1) = Pr(T=t_1,A=a_1)+Pr(T=t_1,A=a_2) = Pr(T=t_1|A=a_1)Pr(A=a_1)+Pr(T=t_2|A=a_2)Pr(A=a_2)$$
Also, $\sum_{i}Pr(T=t_i|A=a_j) = 1$
Using the corresp. conditional probability matrix (i.e., $$Pr(T=t_1|A=a_1)=0.05, Pr(T=t_1|A=a_2) = 0.01, Pr(T=t_2|A=a_1) = 0.95, Pr(T=t_2|A=a_2) = 0.99$$), we get $Pr(A=a_1) = Pr(A=a_2)$ = 0.5.
Then using (3), we get 0.01.