I try to understand a specific part of the paper "Consistent shape maps via semidefinite programming", where a binary symmetric Input matrix $X^{in}$ is given with $X^{in} \in \{0,1\}^{nm \times nm}$ and an output matrix $X$, which has to fulfill certain constraints, should be found. They introduce an objective function $f$, summing up the element wise L1-norms between the input maps and output maps, where $X_{i,j}^{(in)} \in \{0,1\}^{m \times m}$ are again matrices:
$$
f=\sum_{i,j} \|X_{i,j}^{in}-X_{i,j}\|
$$
Now they rewrite $f$ and formulate out of this an optimization problem (which they then solve with convex relaxation). My problem is to understand why they introduce the rewriting with the following sentence: " As $X_{i,j}^{in}$ are binary matrices, it turns out whenever the elements of $X_{i,j}$are between $0$ and $1$ (i.e. when they are relaxed), we can rewrite $f$ as a linear function over $X$:
$$
f=\sum_{i,j}(\sum_{X_{i,j}^{in}(s,s')=0}X_{i,j}(s,s') + \sum_{X_{i,j}^{in}(s,s')=1}(1-X_{i,j}(s,s')) )
$$
So they allow the output matrix to take values between $0$ and $1$, but I don't get why this is needed! I would be happy about any kind of help, thanks a lot!
2026-04-20 13:56:34.1776693394
Relaxing the elements of a matrix
106 Views Asked by Bumbble Comm https://math.techqa.club/user/bumbble-comm/detail At
1
The reason they allow the output matrix to take values between 0 and 1 is simple: the problem is intractable if they do not. Binary programming problems are non-convex, and in the worst case solving them would require examining all $2^{n^2m^2}$ possible $\{0,1\}$ combinations. That quickly becomes intractable for reasonable values of $m$ and $n$.
One approach is to relax the problem by allowing the binary values to take any value in the interval $[0,1]$. The resulting problem is tractable. In some cases, you will find that the answer happens to satisfy the original $\{0,1\}$ requirements anyway. If so, congratulations, you won the lottery! You have found the solution to the original, non-relaxed problem.
In all likelihood, this will not happen; some of the values of $X_{ij}$ will be fractional. In that case, you have a few options:
As for why $f$ can be rewritten the way they've done it, that is straightforward. After all, if $y\in[0,1]$, $$x=0\quad\Longrightarrow\quad |x-y|=|-y| = y$$ $$x=1\quad\Longrightarrow\quad |x-y|=|1-y| = 1-y$$ Note that these simple rewrite rules don't actually require the relaxation! They are true even for the original binary problem. Since it's generally good to remove unnecessary nonlinearities from any optimization problem (even simple, polyhedral ones), this is a step worth taking.