Solution Form of a Ridge Regression Problem

Question

Solution Form of a Ridge Regression Problem

110 Views Asked by Bumbble Comm At 27 Mar 2026 - 8:40

The linear ridge regression loss function: $$ J(\beta)=\Sigma_{i=1}^n(x_i^T\beta-y_i)^2+\lambda\Sigma_{j=1}^p\beta_j^2= \Vert X\beta-Y \Vert^2 + \lambda\beta^TI\beta \text{ (matrix form)} $$ where $x_i$'s are the input vectors, $y_i$'s are the outputs (observations), $\beta$ is the vector of coefficients, and $\beta_j$'s are the elemenents of $\beta$, has the solution: $$ \hat{\beta}=(X^TX+\lambda I)^{-1}X^TY $$

On the other hand, in my textbook, it is said that by setting the derivative of $J(\beta)$ to $0$, we can obtain the solution $\hat{\beta}$ of the form: $$ \hat{\beta}=\Sigma_{i=1}^n \alpha_ix_i \tag{*} $$ where: $$ \alpha_i=\frac{-1}{\lambda}(x_i^T\beta-y_i) $$

How do we obtain (*)?

Original Q&A

There are 2 best solutions below

**Bumbble Comm** · Answer 1 · 2020-03-27 09:25:02

Since $J(\beta) = \frac{1}{2}\sum_{i=1}^n(x_i^T\beta-y_i)^2 + \frac{1}{2}\lambda\beta^T\beta$, we have $\nabla J(\beta)=\sum_{i=1}^n(x_i^T\beta-y)x_i + \lambda \beta$. Thus setting $\nabla J(\beta)$ to zero gives $\sum_{i=1}^n(x_i^T\beta-y)x_i + \lambda \beta = 0$, that is $\beta = -\frac{1}{\lambda}\sum_{i=1}^n(x_i^T\beta-y)x_i$, a fixed-point equation for the coefficients $\beta$.

**Bumbble Comm** · Answer 2 · 2020-03-27 09:29:02

Setting the derivative, we get $$2\sum\limits_{i=1}^n(x_i^T \beta - y_i)x_i + 2 \lambda \beta = 0$$ Expressing this first order condition in fixed point, we arrive at the desired result $$\hat{\beta} = \sum\limits_{i=1}^n\underbrace{-\frac{1}{\lambda}(x_i^T \beta - y_i)}_{\alpha_i}x_i $$

Solution Form of a Ridge Regression Problem

There are 2 best solutions below

Related Questions in LINEAR-REGRESSION

Trending Questions

Popular # Hahtags

Popular Questions