So I have two samples, and it is currently assumed that sample 1 has a higher failure rate than sample 2. I received the data below from experiments:
p1=0.6364, n1=11, p2=0.4444, n2=18
So I constructed a confidence interval using a 5% significance level which yielded -0.114 to 0.498.
So based on this confidence interval, can I conclude that p1 > p2 ?
Is there a better way to verify this claim with the data that I have, I was going to try a one tailed hypothesis test, but I've never done one where the null hypothesis is not p1-p2=0?
Thanks in advance for any help or advice you can provide!
From Sample 1, you have $X_1 = 7$ failures among $n_1 = 11$ items, and from Sample 2, you have $X_2 = 8$ failures among $n_2 = 18$ items. You wish to test the null hypothesis $H_0: p_1 = p_2$ against the one-sided alternative $H_a: p_1 > p_2$ at the 5% level of significance.
While it is true that estimated failure fraction $\hat p_1 = X_1/n_1 = 0.6364$ from the first sample is greater than the the estimated failure fraction $\hat p_2 = 0.4444$ from the second sample, your question is whether the first estimate is significantly larger in a statistical sense.
Because you are doing a one-sided test, this might be decided by looking at a one-sided 95% confidence interval, but not directly by looking at a two-sided confidence interval. Maybe it is better for you to do a one-sided test.
The test statistic is $$Z = \frac{\hat p_1 - \hat p_2}{\sqrt{\hat p(1- \hat p)/n}},$$ were $n = n_1 + n_2$ and $\hat p = (X_1 + X_2)/n.$ You will reject $H_0$ at the 5% level, if $Z > 1.645.$ [You can find this formula in statistics texts and online at in the NIST Engineering Handbook.]
Here is printout from Minitab 17 statistical software that shows the computation, and provides some additional information.
Notice that $Z = 1.00 < 1.645,$ so you do not reject $H_0.$ Your data are consistent with equal failure proportions in the two populations. The P-value is $P(Z > 1.00) = 0.158.$ You could have rejected $H_0$ if the P-value were smaller than 5%.
This test uses a normal approximation to the binomial, which may not be exactly accurate for sample sizes as small as $n_1 = 11$ and $n_2 = 18.$ However, the result is nowhere near the 'borderline' between rejecting and not, so an error in the normal approximation is not likely to have made a difference in your decision not to reject.