In the context of information theory, I am trying to maximize the following function (mutual information of the Z-channel's input and output) with respect to $p$ in order to derive Z-channel's capacity: $$I(X;Y)=\mathit{H}(ap)-\mathit{H}(1-a)p$$ where $\mathit{H}(x)=-xlog_2(x)-(1-x)log_2(1-x)$, $0<x<1$ is known as the binary entropy function. So the function is: $$I(X;Y)=-aplog_2(ap)-(1-ap)log_2(1-ap)-\mathit{H}(1-a)p$$ Differentiating with respect to $p$ I get: \begin{eqnarray*} \frac{\partial I(X;Y)}{\partial p}&=&-alog_2(ap)-ap\frac{1}{pln2}+alog_2(1-ap)-(1-ap)\frac{a}{ln2(ap-1)}-\mathit{H}(1-a)\\ &=&log_2((ap)^{-a}e^{-a}(1-ap)^{-a}e^aa^a(1-a)^{1-a})\\ &=&log_2((ap)^{-a}(1-ap)^{-a}a^a(1-a)^{1-a}) \end{eqnarray*} Then: \begin{eqnarray*} \frac{\partial I(X;Y)}{\partial p}=0&\Rightarrow &log_2((ap)^{-a}(1-ap)^{-a}a^a(1-a)^{1-a})=0\\ &\Rightarrow &(ap)^{-a}(1-ap)^{-a}a^a(1-a)^{1-a}=1\\ \end{eqnarray*} I can't solve this. Eventually, I am trying to prove that the value of $p$ that maximizes the function is: $$p=\frac{1}{a(1+2^{\mathit{H}(1-a)/a})}$$ More information on the Z-channel can be found here, but I am using different notation regarding the probabilities.
2026-04-06 01:22:51.1775438571
Maximizing sum of logarithms (Z-channel capacity)
399 Views Asked by Bumbble Comm https://math.techqa.club/user/bumbble-comm/detail At
1
There are 1 best solutions below
Related Questions in PROBABILITY
- How to prove $\lim_{n \rightarrow\infty} e^{-n}\sum_{k=0}^{n}\frac{n^k}{k!} = \frac{1}{2}$?
- Is this a commonly known paradox?
- What's $P(A_1\cap A_2\cap A_3\cap A_4) $?
- Prove or disprove the following inequality
- Another application of the Central Limit Theorem
- Given is $2$ dimensional random variable $(X,Y)$ with table. Determine the correlation between $X$ and $Y$
- A random point $(a,b)$ is uniformly distributed in a unit square $K=[(u,v):0<u<1,0<v<1]$
- proving Kochen-Stone lemma...
- Solution Check. (Probability)
- Interpreting stationary distribution $P_{\infty}(X,V)$ of a random process
Related Questions in FUNCTIONS
- Functions - confusion regarding properties, as per example in wiki
- Composition of functions - properties
- Finding Range from Domain
- Why is surjectivity defined using $\exists$ rather than $\exists !$
- What are the functions satisfying $f\left(2\sum_{i=0}^{\infty}\frac{a_i}{3^i}\right)=\sum_{i=0}^{\infty}\frac{a_i}{2^i}$
- Lower bound of bounded functions.
- Does there exist any relationship between non-constant $N$-Exhaustible function and differentiability?
- Given a function, prove that it's injective
- Surjective function proof
- How to find image of a function
Related Questions in DERIVATIVES
- Derivative of $ \sqrt x + sinx $
- Second directional derivative of a scaler in polar coordinate
- A problem on mathematical analysis.
- Why the derivative of $T(\gamma(s))$ is $T$ if this composition is not a linear transformation?
- Does there exist any relationship between non-constant $N$-Exhaustible function and differentiability?
- Holding intermediate variables constant in partial derivative chain rule
- How would I simplify this fraction easily?
- Why is the derivative of a vector in polar form the cross product?
- Proving smoothness for a sequence of functions.
- Gradient and Hessian of quadratic form
Related Questions in LOGARITHMS
- Confirmation of Proof: $\forall n \in \mathbb{N}, \ \pi (n) \geqslant \frac{\log n}{2\log 2}$
- Extracting the S from formula
- How to prove the following inequality (log)
- Rewriting $(\log_{11}5)/(\log_{11} 15)$
- How to solve this equation with $x$ to a logarithmic power?
- Show that $\frac{1}{k}-\ln\left(\frac{k+1}{k}\right)$ is bounded by $\frac{1}{k^2}$
- Why do we add 1 to logarithms to get number of digits?
- Is my method correct for to prove $a^{\log_b c} = c^{\log_b a}$?
- How to prove the inequality $\frac{1}{n}+\frac{1}{n+1}+\cdots+\frac{1}{2n-1}\geq \log (2)$?
- Unusual Logarithm Problem
Related Questions in INFORMATION-THEORY
- KL divergence between two multivariate Bernoulli distribution
- convexity of mutual information-like function
- Maximizing a mutual information w.r.t. (i.i.d.) variation of the channel.
- Probability of a block error of the (N, K) Hamming code used for a binary symmetric channel.
- Kac Lemma for Ergodic Stationary Process
- Encryption with $|K| = |P| = |C| = 1$ is perfectly secure?
- How to maximise the difference between entropy and expected length of an Huffman code?
- Number of codes with max codeword length over an alphabet
- Aggregating information and bayesian information
- Compactness of the Gaussian random variable distribution as a statistical manifold?
Trending Questions
- Induction on the number of equations
- How to convince a math teacher of this simple and obvious fact?
- Find $E[XY|Y+Z=1 ]$
- Refuting the Anti-Cantor Cranks
- What are imaginary numbers?
- Determine the adjoint of $\tilde Q(x)$ for $\tilde Q(x)u:=(Qu)(x)$ where $Q:U→L^2(Ω,ℝ^d$ is a Hilbert-Schmidt operator and $U$ is a Hilbert space
- Why does this innovative method of subtraction from a third grader always work?
- How do we know that the number $1$ is not equal to the number $-1$?
- What are the Implications of having VΩ as a model for a theory?
- Defining a Galois Field based on primitive element versus polynomial?
- Can't find the relationship between two columns of numbers. Please Help
- Is computer science a branch of mathematics?
- Is there a bijection of $\mathbb{R}^n$ with itself such that the forward map is connected but the inverse is not?
- Identification of a quadrilateral as a trapezoid, rectangle, or square
- Generator of inertia group in function field extension
Popular # Hahtags
second-order-logic
numerical-methods
puzzle
logic
probability
number-theory
winding-number
real-analysis
integration
calculus
complex-analysis
sequences-and-series
proof-writing
set-theory
functions
homotopy-theory
elementary-number-theory
ordinary-differential-equations
circles
derivatives
game-theory
definite-integrals
elementary-set-theory
limits
multivariable-calculus
geometry
algebraic-number-theory
proof-verification
partial-derivative
algebra-precalculus
Popular Questions
- What is the integral of 1/x?
- How many squares actually ARE in this picture? Is this a trick question with no right answer?
- Is a matrix multiplied with its transpose something special?
- What is the difference between independent and mutually exclusive events?
- Visually stunning math concepts which are easy to explain
- taylor series of $\ln(1+x)$?
- How to tell if a set of vectors spans a space?
- Calculus question taking derivative to find horizontal tangent line
- How to determine if a function is one-to-one?
- Determine if vectors are linearly independent
- What does it mean to have a determinant equal to zero?
- Is this Batman equation for real?
- How to find perpendicular vector to another vector?
- How to find mean and median from histogram
- How many sides does a circle have?
It is easier to separate the process using the chain rule instead of using the full equation for I: $$I=H(ap)-H(1-a)p$$ $$\frac{dI}{dp}=\frac{dH(ap)}{d(ap)}\frac{d(ap)}{dp}-H(1-a)=\frac{dH(ap)}{d(ap)}a-H(1-a)$$ Now we need to find $H'$ $$\frac{dH}{dx}=-log_2(x)-x\frac{1}{x ln2}+log_2(1-x)+(1-x)\frac{1}{(1-x)ln2}=\frac{-ln(x)-1+ln(1-x)+1}{ln2}=log_2(1-x)-log_2(x)$$ Substituting in the previous equation with $x=ap$: $$\frac{dI}{dp}=a(log_2(1-ap)-log_2(ap))-H(1-a)$$ Setting $dI/dp=0$ $$\frac{H(1-a)}{a}=log_2(1-ap)-log_2(ap)$$ $$2^{\frac{H(1-a)}{a}}=2^{log_2(1-ap)-log_2(ap)}=\frac{2^{log_2(1-ap)}}{2^{log_2(ap)}}=\frac{1-ap}{ap}$$ $$ap2^{\frac{H(1-a)}{a}}+ap=1$$ $$ap(2^{\frac{H(1-a)}{a}}+1)=1$$ $$p=\frac{1}{a(2^{\frac{H(1-a)}{a}}+1)}$$