Orthogonal diagonalization of dummy variables vectors covariance matrix

70 Views Asked by Bumbble Comm At 28 Mar 2026 - 11:27

Let $n$ be a positive integer, and let $(p_i)_{i \in \{1,\cdots,n\}}$ be a finite sequence of real numbers, that are assumed to be nonnegative and sum to $1$. Let us denote by $\mathbf{p}$ the column vector of the $p_i$'s.

We consider the $n\times n$ matrix $CV_p := Diag(p_1,\cdots ,p_n) - \mathbf{p}\mathbf{p}^T$.

Let also $Y$ be a random variable such that for all $i \in \{1, \cdots, n\}$, $\mathbb{P}[Y = i] = p_i$. Let us denote by $X$ the dummy-variable encoding of $Y$, that is, $X_i := \delta_{Y,i}$.

Then $CV_p$ is the covariance matrix of $X$: indeed, if $i,j \in \{1,\cdots,n\}$, $\mathbb{E}[(\delta_{Y,i} - \mathbb{E}[\delta_{Y,i}] )(\delta_{Y,j} - \mathbb{E}[\delta_{Y,j}])]$ equals $p_i(1-p_i)$ if $i=j$ and $-p_ip_j$ if $i \neq j$.

Indeed, $\mathbb{E}[\delta_{Y,i}] = p_i$, and

$\mathbb{E}[(\delta_{Y,i} - \mathbb{E}[\delta_{Y,i}] )(\delta_{Y,j} - \mathbb{E}[\delta_{Y,j}])]\\ = \mathbb{E}[(\delta_{Y,i} - p_i )(\delta_{Y,j} - p_j)]\\ = \mathbb{E}[\delta_{Y,i}\delta_{Y,j}] - p_ip_j - p_jp_i + p_ip_j\\ = \mathbb{E}[\delta_{Y,i}\delta_{Y,j}] - p_ip_j.$

Now, if $i=j$, $\mathbb{E}[\delta_{Y,i}\delta_{Y,j}] = \mathbb{E}[\delta_{Y,i}] = p_i$, so we get $\mathbb{E}[(\delta_{Y,i} - \mathbb{E}[\delta_{Y,i}] )(\delta_{Y,j} - \mathbb{E}[\delta_{Y,j}])] = \mathbb{E}[\delta_{Y,i}\delta_{Y,j}] - p_ip_j = p_i(1-p_i)$.

And if $i\neq j$, $\delta_{Y,i}\delta_{Y,j} = 0$, so $\mathbb{E}[(\delta_{Y,i} - \mathbb{E}[\delta_{Y,i}] )(\delta_{Y,j} - \mathbb{E}[\delta_{Y,j}])] = \mathbb{E}[\delta_{Y,i}\delta_{Y,j}] - p_ip_j = -p_ip_j$.

Is it possible to provide explicit (in terms of the $p_i$'s) and beautiful formulae for the spectrum of $CV_p$? Of an orthogonal diagonalization matrix?

Original Q&A

There are 1 best solutions below

Bumbble Comm On 30 Jan 2023 - 11:41 BEST ANSWER

This is an instance of a generally intractable problem (see here and here). Numerically, we can make some use of the structure of the matrix in order to quickly find eigenvalues using the BNS algorithm, as is explained here.

A few things that can be said about $CV_p$: $CV_p$ has rank $n-1$ as long as all $p_i$ are non-zero (more generally, the rank is equal to the size of the support of the distribution of $Y$). Its kernel is spanned by the vector $(1,1,\dots,1)$.

One strategy that might yield some insight is looking at the similar matrix $M = W^*C_pW$, where $W$ denotes the DFT matrix. The fact that the first column of $W$ spans the kernel means that the first row and column of $M$ will be zero. Empirically, there seems to be some kind of structure in the entries of the resulting matrix (for instance, repeated entries along the diagonal).

Orthogonal diagonalization of dummy variables vectors covariance matrix

There are 1 best solutions below

Related Questions in LINEAR-ALGEBRA

Related Questions in STATISTICS

Related Questions in DIAGONALIZATION

Related Questions in SYMMETRIC-MATRICES

Related Questions in ORTHOGONAL-MATRICES

Trending Questions

Popular # Hahtags

Popular Questions