I have been catching up on my statistics studies and have come to think of the variance of a distribution as $\frac{\sum_i{(x_i-\mu)^2}}{N}$ and this makes sense to me as a sum of the squared distance from the mean. But now when studying the binomial or Bernoulli distribution I am seeing that the variance is expressed as $$E[X-E[X]] = \sum_i{(x_i-\sum_j{x_jp(x_j)})^2p(x_i)}$$ I understand that the sample mean is X/N but the expected value is a weighted version where probability is not evenly distributed so it is $E[X] = \sum_i{x_ip(x_i)}$ but after so long of seeing the variance expressed as the first formI don't understand how this second equation makes sense or how probability fits into variance.
2026-03-29 19:16:44.1774811804
Understanding the Difference of Variance Formulas
67 Views Asked by Bumbble Comm https://math.techqa.club/user/bumbble-comm/detail At
1
There are 1 best solutions below
Related Questions in PROBABILITY
- How to prove $\lim_{n \rightarrow\infty} e^{-n}\sum_{k=0}^{n}\frac{n^k}{k!} = \frac{1}{2}$?
- Is this a commonly known paradox?
- What's $P(A_1\cap A_2\cap A_3\cap A_4) $?
- Prove or disprove the following inequality
- Another application of the Central Limit Theorem
- Given is $2$ dimensional random variable $(X,Y)$ with table. Determine the correlation between $X$ and $Y$
- A random point $(a,b)$ is uniformly distributed in a unit square $K=[(u,v):0<u<1,0<v<1]$
- proving Kochen-Stone lemma...
- Solution Check. (Probability)
- Interpreting stationary distribution $P_{\infty}(X,V)$ of a random process
Related Questions in RANDOM-VARIABLES
- Prove that central limit theorem Is applicable to a new sequence
- Random variables in integrals, how to analyze?
- Convergence in distribution of a discretized random variable and generated sigma-algebras
- Determine the repartition of $Y$
- What is the name of concepts that are used to compare two values?
- Convergence of sequences of RV
- $\lim_{n \rightarrow \infty} P(S_n \leq \frac{3n}{2}+\sqrt3n)$
- PDF of the sum of two random variables integrates to >1
- Another definition for the support of a random variable
- Uniform distribution on the [0,2]
Related Questions in VARIANCE
- Proof that $\mathrm{Var}\bigg(\frac{1}{n} \sum_{i=1}^nY_i\bigg) = \frac{1}{n}\mathrm{Var}(Y_1)$
- $\{ X_{i} \}_{i=1}^{n} \thicksim iid N(\theta, 1)$. What is distribution of $X_{2} - X_{1}$?
- Reason generalized linear model
- Variance of $\mathrm{Proj}_{\mathcal{R}(A^T)}(z)$ for $z \sim \mathcal{N}(0, I_m)$.
- Variance of a set of quaternions?
- Is the usage of unbiased estimator appropriate?
- Stochastic proof variance
- Bit of help gaining intuition about conditional expectation and variance
- Variance of $T_n = \min_i \{ X_i \} + \max_i \{ X_i \}$
- Compute the variance of $S = \sum\limits_{i = 1}^N X_i$, what did I do wrong?
Trending Questions
- Induction on the number of equations
- How to convince a math teacher of this simple and obvious fact?
- Find $E[XY|Y+Z=1 ]$
- Refuting the Anti-Cantor Cranks
- What are imaginary numbers?
- Determine the adjoint of $\tilde Q(x)$ for $\tilde Q(x)u:=(Qu)(x)$ where $Q:U→L^2(Ω,ℝ^d$ is a Hilbert-Schmidt operator and $U$ is a Hilbert space
- Why does this innovative method of subtraction from a third grader always work?
- How do we know that the number $1$ is not equal to the number $-1$?
- What are the Implications of having VΩ as a model for a theory?
- Defining a Galois Field based on primitive element versus polynomial?
- Can't find the relationship between two columns of numbers. Please Help
- Is computer science a branch of mathematics?
- Is there a bijection of $\mathbb{R}^n$ with itself such that the forward map is connected but the inverse is not?
- Identification of a quadrilateral as a trapezoid, rectangle, or square
- Generator of inertia group in function field extension
Popular # Hahtags
second-order-logic
numerical-methods
puzzle
logic
probability
number-theory
winding-number
real-analysis
integration
calculus
complex-analysis
sequences-and-series
proof-writing
set-theory
functions
homotopy-theory
elementary-number-theory
ordinary-differential-equations
circles
derivatives
game-theory
definite-integrals
elementary-set-theory
limits
multivariable-calculus
geometry
algebraic-number-theory
proof-verification
partial-derivative
algebra-precalculus
Popular Questions
- What is the integral of 1/x?
- How many squares actually ARE in this picture? Is this a trick question with no right answer?
- Is a matrix multiplied with its transpose something special?
- What is the difference between independent and mutually exclusive events?
- Visually stunning math concepts which are easy to explain
- taylor series of $\ln(1+x)$?
- How to tell if a set of vectors spans a space?
- Calculus question taking derivative to find horizontal tangent line
- How to determine if a function is one-to-one?
- Determine if vectors are linearly independent
- What does it mean to have a determinant equal to zero?
- Is this Batman equation for real?
- How to find perpendicular vector to another vector?
- How to find mean and median from histogram
- How many sides does a circle have?
So for any set of of points sampled from some pdf $p(x)$, the mean value is given by:
\begin{equation} E[X] = \sum_i{x_ip(x_i)} \end{equation}
see here: https://en.wikipedia.org/wiki/Expected_value#Finite_case
Now for the Bernoulli distribution in particular, the theoretical mean value is given by (as we have $x_0=1,x_1 = 1$):
\begin{equation} E[X] = p \end{equation}
see here: https://en.wikipedia.org/wiki/Bernoulli_distribution
The variance for Bernoulli is :
\begin{equation} Var(X) = E[X-E[X]] = p(1-p) \end{equation}
Ultimately I believe you are confusing an empirical approximation of the statistics, versus their true, theoretical values we get if we assume $X$ is distributed according to that pdf.
Empirically, the variance for any random variable $X$, assuming it is distributed accordingn to any $pdf$ is given by
\begin{equation} \frac{\sum_i{(x_i-\mu)^2}}{N} \end{equation}
And in the hopes that $X$ indeed came from the Bernoulli, we hope that as $N\rightarrow\infty$ the value of this empirical variance approaches $p(1-p)$.
I do believe you are conflating these two ideas. Especially because there is a similar form between them, especially for Bernoulli.
As for I dont get how probability fits into variance - you will find the for finite distributions probability comes into play explicitly because things are happening discretely. Also remember probability is just a model for our understanding of the world.
Implicitly when you calculate the empirical variance you are assuming each $x_i$ is sampled in proportion to how often it occurs (i.e. by it's magical, unseen, underlying distribution), because in real life this magical stuff is done for us, we dont need to include in our formulations. i.e. we each $x_i$ we observe is implicitly given to us according to some unseen $p(x_i)$. But for a theoretical model we need to model this data generation happening so we include a $p(x)$.