I am working through the exercises in David Mackay's book "Information theory, inference and learning algorithms" (https://www.inference.org.uk/itprnn/book.pdf) and I am stuck with problem 6.9. In that exercise I am asked to estimate the mean and the standard deviation of the length of a compressed file consisting of 1000 samples from a Bernoulli distribution with $p=0.01$. I know that the mean of the length is 83 bits because the entropy of the random variable is $H(X)=0.081$ and so the length of a compressed file using arithmetic coding is the entropy times the number of samples plus 2. However I am stuck with defining the standard deviation. I know from page 128 that the answer is 21 bits but I don't understand why.
2026-03-28 09:44:02.1774691042
Standard deviation of length of compressed file with arithmetic coding
112 Views Asked by Bumbble Comm https://math.techqa.club/user/bumbble-comm/detail At
1
There are 1 best solutions below
Related Questions in PROBABILITY
- How to prove $\lim_{n \rightarrow\infty} e^{-n}\sum_{k=0}^{n}\frac{n^k}{k!} = \frac{1}{2}$?
- Is this a commonly known paradox?
- What's $P(A_1\cap A_2\cap A_3\cap A_4) $?
- Prove or disprove the following inequality
- Another application of the Central Limit Theorem
- Given is $2$ dimensional random variable $(X,Y)$ with table. Determine the correlation between $X$ and $Y$
- A random point $(a,b)$ is uniformly distributed in a unit square $K=[(u,v):0<u<1,0<v<1]$
- proving Kochen-Stone lemma...
- Solution Check. (Probability)
- Interpreting stationary distribution $P_{\infty}(X,V)$ of a random process
Related Questions in INFORMATION-THEORY
- KL divergence between two multivariate Bernoulli distribution
- convexity of mutual information-like function
- Maximizing a mutual information w.r.t. (i.i.d.) variation of the channel.
- Probability of a block error of the (N, K) Hamming code used for a binary symmetric channel.
- Kac Lemma for Ergodic Stationary Process
- Encryption with $|K| = |P| = |C| = 1$ is perfectly secure?
- How to maximise the difference between entropy and expected length of an Huffman code?
- Number of codes with max codeword length over an alphabet
- Aggregating information and bayesian information
- Compactness of the Gaussian random variable distribution as a statistical manifold?
Related Questions in CODING-THEORY
- Solving overdetermined linear systems in GF(2)
- Inverting a generator matrix - Coding Theory
- Probability of a block error of the (N, K) Hamming code used for a binary symmetric channel.
- How to decode a Hadamard message that was encoded using the inner product method?
- How to decode a Hadamard message that was encoded using a generator matrix?
- Find the two missing digits in 10-ISBN code
- Characterize ideals in $\mathbb{F}_l[x]/(x-1) \oplus \mathbb{F}_l[x]/(\frac{x^p-1}{x-1})$
- Number of codes with max codeword length over an alphabet
- Dimension of ASCII code
- Prove how many errors CRC code can detect
Trending Questions
- Induction on the number of equations
- How to convince a math teacher of this simple and obvious fact?
- Find $E[XY|Y+Z=1 ]$
- Refuting the Anti-Cantor Cranks
- What are imaginary numbers?
- Determine the adjoint of $\tilde Q(x)$ for $\tilde Q(x)u:=(Qu)(x)$ where $Q:U→L^2(Ω,ℝ^d$ is a Hilbert-Schmidt operator and $U$ is a Hilbert space
- Why does this innovative method of subtraction from a third grader always work?
- How do we know that the number $1$ is not equal to the number $-1$?
- What are the Implications of having VΩ as a model for a theory?
- Defining a Galois Field based on primitive element versus polynomial?
- Can't find the relationship between two columns of numbers. Please Help
- Is computer science a branch of mathematics?
- Is there a bijection of $\mathbb{R}^n$ with itself such that the forward map is connected but the inverse is not?
- Identification of a quadrilateral as a trapezoid, rectangle, or square
- Generator of inertia group in function field extension
Popular # Hahtags
second-order-logic
numerical-methods
puzzle
logic
probability
number-theory
winding-number
real-analysis
integration
calculus
complex-analysis
sequences-and-series
proof-writing
set-theory
functions
homotopy-theory
elementary-number-theory
ordinary-differential-equations
circles
derivatives
game-theory
definite-integrals
elementary-set-theory
limits
multivariable-calculus
geometry
algebraic-number-theory
proof-verification
partial-derivative
algebra-precalculus
Popular Questions
- What is the integral of 1/x?
- How many squares actually ARE in this picture? Is this a trick question with no right answer?
- Is a matrix multiplied with its transpose something special?
- What is the difference between independent and mutually exclusive events?
- Visually stunning math concepts which are easy to explain
- taylor series of $\ln(1+x)$?
- How to tell if a set of vectors spans a space?
- Calculus question taking derivative to find horizontal tangent line
- How to determine if a function is one-to-one?
- Determine if vectors are linearly independent
- What does it mean to have a determinant equal to zero?
- Is this Batman equation for real?
- How to find perpendicular vector to another vector?
- How to find mean and median from histogram
- How many sides does a circle have?
Edited: improved following OP's comment
For an arithmetic binary code of length $N=1000$, since the arithmetic code corresponds esentially to an quasi optimal code of the extended (joint) sequence, we can expect the length of any input to be very close to the "ideal" length $-\log_2(p_i)$ plus an excess of $\epsilon \approx 2$ bits - here $p_i$ is the joint probability of the full sequence. That's why we get a mean value of $N H(X_i) +\epsilon$.
To compute the variance, we can disregard the $\epsilon$ term, and assume that the code length of each sequence is determined by its probability, and hence by the number of ones (which we call $k$).
$$ L =-\log p^{k}(1-p)^{N-k}=-N\log(1-p) + k \log(1/p-1) = a + b \, k \tag 1$$
where $a,b$ are given constants and $k$ and $L$ are random variables. In particular, $k$ is a Binomial with mean $Np$ and variance $N p (1-p)$.
Hence the variance of $L$ is $$ \sigma_L^2=b^2 \sigma_k^2=\log(1/p-1)^2 k p (1-p)=435.08\cdots$$
with a standard deviation around $21$.