Let $ X_1 \sim B(n_1, p_1), \ldots, X_k \sim B(n_k, p_k) $ be independent random variables and $ x_1, \ldots, x_k $ a sequence of realizations of them. I know the value of $n_1, \ldots, n_k $ but not that of $ p_1, \ldots, p_k $ and I'm looking for a way to test $ H_0: p_1 = \ldots = p_k $.
2026-04-01 23:54:27.1775087667
Testing $ H_0: p_1 = \ldots = p_k $ with a sequence $ x_1, \ldots, x_k $ of realizations of $ X_1 \sim B(n_1, p_1), \ldots, X_k \sim B(n_k, p_k) $
110 Views Asked by Bumbble Comm https://math.techqa.club/user/bumbble-comm/detail At
2
There are 2 best solutions below
Related Questions in PROBABILITY
- How to prove $\lim_{n \rightarrow\infty} e^{-n}\sum_{k=0}^{n}\frac{n^k}{k!} = \frac{1}{2}$?
- Is this a commonly known paradox?
- What's $P(A_1\cap A_2\cap A_3\cap A_4) $?
- Prove or disprove the following inequality
- Another application of the Central Limit Theorem
- Given is $2$ dimensional random variable $(X,Y)$ with table. Determine the correlation between $X$ and $Y$
- A random point $(a,b)$ is uniformly distributed in a unit square $K=[(u,v):0<u<1,0<v<1]$
- proving Kochen-Stone lemma...
- Solution Check. (Probability)
- Interpreting stationary distribution $P_{\infty}(X,V)$ of a random process
Related Questions in STATISTICS
- Given is $2$ dimensional random variable $(X,Y)$ with table. Determine the correlation between $X$ and $Y$
- Statistics based on empirical distribution
- Given $U,V \sim R(0,1)$. Determine covariance between $X = UV$ and $V$
- Fisher information of sufficient statistic
- Solving Equation with Euler's Number
- derive the expectation of exponential function $e^{-\left\Vert \mathbf{x} - V\mathbf{x}+\mathbf{a}\right\Vert^2}$ or its upper bound
- Determine the marginal distributions of $(T_1, T_2)$
- KL divergence between two multivariate Bernoulli distribution
- Given random variables $(T_1,T_2)$. Show that $T_1$ and $T_2$ are independent and exponentially distributed if..
- Probability of tossing marbles,covariance
Related Questions in STATISTICAL-INFERENCE
- co-variance matrix of discrete multivariate random variable
- Question on completeness of sufficient statistic.
- Probability of tossing marbles,covariance
- Estimate the square root of the success probability of a Binomial Distribution.
- A consistent estimator for theta is?
- Using averages to measure the dispersion of data
- Confidence when inferring p in a binomial distribution
- A problem on Maximum likelihood estimator of $\theta$
- Derive unbiased estimator for $\theta$ when $X_i\sim f(x\mid\theta)=\frac{2x}{\theta^2}\mathbb{1}_{(0,\theta)}(x)$
- Show that $\max(X_1,\ldots,X_n)$ is a sufficient statistic.
Related Questions in BINOMIAL-DISTRIBUTION
- Given $X$ Poisson, and $f_{Y}(y\mid X = x)$, find $\mathbb{E}[X\mid Y]$
- Estimate the square root of the success probability of a Binomial Distribution.
- Choosing oranges. I'm going to lose my mind
- Probability:Binomial Distribution Mean and Variance Problem
- Probability Bookings in a Hotel
- Using Binomial Distribution to Find the Probability of Two of the Same evnts ocurring
- uniformly most powerful test: binomial distribution
- binomial normal with dependent success probability
- A share price grows $1+\epsilon$ times with prob. $p$ or falls $1-\epsilon$ times with prob. $1-p$ each day, what's its expected value after $n$ days?
- A baseball player hits the ball 35% of the time. In 10 opportunities, what is the probability of connecting more than 2 hits?
Trending Questions
- Induction on the number of equations
- How to convince a math teacher of this simple and obvious fact?
- Find $E[XY|Y+Z=1 ]$
- Refuting the Anti-Cantor Cranks
- What are imaginary numbers?
- Determine the adjoint of $\tilde Q(x)$ for $\tilde Q(x)u:=(Qu)(x)$ where $Q:U→L^2(Ω,ℝ^d$ is a Hilbert-Schmidt operator and $U$ is a Hilbert space
- Why does this innovative method of subtraction from a third grader always work?
- How do we know that the number $1$ is not equal to the number $-1$?
- What are the Implications of having VΩ as a model for a theory?
- Defining a Galois Field based on primitive element versus polynomial?
- Can't find the relationship between two columns of numbers. Please Help
- Is computer science a branch of mathematics?
- Is there a bijection of $\mathbb{R}^n$ with itself such that the forward map is connected but the inverse is not?
- Identification of a quadrilateral as a trapezoid, rectangle, or square
- Generator of inertia group in function field extension
Popular # Hahtags
second-order-logic
numerical-methods
puzzle
logic
probability
number-theory
winding-number
real-analysis
integration
calculus
complex-analysis
sequences-and-series
proof-writing
set-theory
functions
homotopy-theory
elementary-number-theory
ordinary-differential-equations
circles
derivatives
game-theory
definite-integrals
elementary-set-theory
limits
multivariable-calculus
geometry
algebraic-number-theory
proof-verification
partial-derivative
algebra-precalculus
Popular Questions
- What is the integral of 1/x?
- How many squares actually ARE in this picture? Is this a trick question with no right answer?
- Is a matrix multiplied with its transpose something special?
- What is the difference between independent and mutually exclusive events?
- Visually stunning math concepts which are easy to explain
- taylor series of $\ln(1+x)$?
- How to tell if a set of vectors spans a space?
- Calculus question taking derivative to find horizontal tangent line
- How to determine if a function is one-to-one?
- Determine if vectors are linearly independent
- What does it mean to have a determinant equal to zero?
- Is this Batman equation for real?
- How to find perpendicular vector to another vector?
- How to find mean and median from histogram
- How many sides does a circle have?
Note: This is a revision. In the former version, I tried to generalize the usual test for comparing success probabilities that uses data from two two binomial random variables. A correct version of that is more complicated than I first assumed.
After a discussion via Comments beneath @kimchilover's Anwser (+1) I realized that my method was not the same as his suggested test on a $k \times 2$ table, but that the method in that Answer is correct. So I have changed my answer to discuss that method, which is implemented in R and other statistical software.
If $H_0: p_1 = p_2 = \cdots = p_k$ is true, then your have observed $\sum_{i=1}^k X_{i1}$ successes in $\sum_{i=1}^k n_i$ trials altogether. Also, under $H_0$ you have expected successes $E_{i1} = n_i\hat p,$ for $i = 1, \dots, k,$ to compare with the corresponding $k$ observed numbers $X_{i1}$ of successes. Similarly, you have observed failure counts $X_{i2} = n_i - X_{i1}$ and corresponding expected failure counts $E_{i2} = n_i = E_{i1}.$
If $H_0$ is true, then the goodness-of-fit statistic $$Q = \sum_{j=1}^2\sum_{i=1}^k \frac{(X_{ij}-E_{ij})^2}{E_{ij}} \stackrel{\text{aprx}}{\sim} \mathsf{Chisq}(k-1),$$
provided that all $E_i > 5$ (some texts say provided that "most" $E_i > 5$ and all $E_i > 3.$)
Notice that small values of $Q$ correspond to 'good fits' of observed counts to corresponding expected counts. You will reject $H_0$ at the 5% level of significance if $Q \ge q^*,$ where $q^*$ cuts 5% of the probability from the upper tail of $\mathsf{Chisq}(k-1).$
Example: Suppose $k = 5;\; n_1 = n_2 = 20;\; n_3 = n_4 = n_5 = 30.$ Also suppose $\mathbf{X_1} = (5,3,5,8,8),$ so that $\hat p= 29/130 = 0.2231$ and $E_{11} = E_{21} = 20\hat p = 4.462,$ $E_{31} = E_{41} = E_{51} = 30\hat p = 6.692.$ Quantities for the second column are found accordingly.
The computation of Q in R statistical software is shown below:
Thus $Q = 1.9085$ and we cannot reject $H_0$ because the 95th percentile of $\mathsf{Chisq}(4)$ is $q^* = 9.488.$ The warning message is generated because $E_{11} = E_{21} \approx 4.46,$ which is barely below $5$. One would reject for a P-value below 0.05, so we are not anywhere close to rejection. [The code
chisq.test(MAT)$expprints the matrix of $E_{ij}$'s. The codechisq.test(MAT, sim=T)simulates the more reliable P-value $\approx 0.77.$][For this example, I simulated the five $X_i$'s using all five $p_i = 1/4,$ so that $H_0$ is true. For real data, of course, one never knows absolutely for sure whether $H_0$ is true or not.]
Consider an experiment with $k = 5$ independent random variables $X_i$ each $\mathsf{Binom}(50,.7),$ so that $H_0$ is true. A histogram of 100,000 values of $Q$ from repeated performances of this experiment is shown below, along with the density function of $\mathsf{Chisq}(4).$ The $Q$-value led to rejection is about 5% of the iterations.
By contrast consider an experiment with $k = 5$ independent binomial random variables $X_i$ each with $n = 50,$ but with $p = .3, .4, .5, .6$ and $.7,$ so that $H_0$ is not true. Here about 97% of 100,000 iterations led to rejection.