So say you have the set of all unsigned $8$ bit grayscale, $N\times M$ images. This means there are $256^{NM}$ images in this space. If these images were binary, you could represent them with an $NM$ bit binary digit, but since your intensity values range from $0$ to $255$ this is not possible. I've been trying to come up with a way to order these images such that I can calculate a distance metric between two images. Intuitively it seems that would be possible because if you could come up with an ordering you could just find the distance from the origin (the origin would likely be the image with all $0$ values). I've looked around online but I've found nothing helpful so far.
2025-01-13 02:22:27.1736734947
Need help coming up with (or finding) an image metric for $N\times M$ image.
43 Views Asked by GregoryNeal https://math.techqa.club/user/gregoryneal/detail At
1
There are 1 best solutions below
Related Questions in IMAGE-PROCESSING
- Matrix values increasing after SVD, singular value decomposition
- Energy functions for CRF/MRF
- Need help coming up with (or finding) an image metric for $N\times M$ image.
- How to measure the irregularity of a hexagon?
- How to represent a given equation more clear, professional and short form?
- Solution for multiple level set equation
- Cauchy Schwarz inequality in Normalized Cross Correlation
- Gradient transpose along $x$ or $y$ direction in images
- Why do we scale by $\frac{1}{N-1}$ while calculating the covariance matrix in PCA?
- what do the coefficients means in this matrix multiplication?
Trending Questions
- Induction on the number of equations
- How to convince a math teacher of this simple and obvious fact?
- Refuting the Anti-Cantor Cranks
- Find $E[XY|Y+Z=1 ]$
- Determine the adjoint of $\tilde Q(x)$ for $\tilde Q(x)u:=(Qu)(x)$ where $Q:U→L^2(Ω,ℝ^d$ is a Hilbert-Schmidt operator and $U$ is a Hilbert space
- Why does this innovative method of subtraction from a third grader always work?
- What are the Implications of having VΩ as a model for a theory?
- How do we know that the number $1$ is not equal to the number $-1$?
- Defining a Galois Field based on primitive element versus polynomial?
- Is computer science a branch of mathematics?
- Can't find the relationship between two columns of numbers. Please Help
- Is there a bijection of $\mathbb{R}^n$ with itself such that the forward map is connected but the inverse is not?
- Identification of a quadrilateral as a trapezoid, rectangle, or square
- A community project: prove (or disprove) that $\sum_{n\geq 1}\frac{\sin(2^n)}{n}$ is convergent
- Alternative way of expressing a quantied statement with "Some"
Popular # Hahtags
real-analysis
calculus
linear-algebra
probability
abstract-algebra
integration
sequences-and-series
combinatorics
general-topology
matrices
functional-analysis
complex-analysis
geometry
group-theory
algebra-precalculus
probability-theory
ordinary-differential-equations
limits
analysis
number-theory
measure-theory
elementary-number-theory
statistics
multivariable-calculus
functions
derivatives
discrete-mathematics
differential-geometry
inequality
trigonometry
Popular Questions
- How many squares actually ARE in this picture? Is this a trick question with no right answer?
- What is the difference between independent and mutually exclusive events?
- Visually stunning math concepts which are easy to explain
- taylor series of $\ln(1+x)$?
- Determine if vectors are linearly independent
- What does it mean to have a determinant equal to zero?
- How to find mean and median from histogram
- Difference between "≈", "≃", and "≅"
- Easy way of memorizing values of sine, cosine, and tangent
- How to calculate the intersection of two planes?
- What does "∈" mean?
- If you roll a fair six sided die twice, what's the probability that you get the same number both times?
- Probability of getting exactly 2 heads in 3 coins tossed with order not important?
- Fourier transform for dummies
- Limit of $(1+ x/n)^n$ when $n$ tends to infinity
Trying to impose a total ordering on the set of images sounds like a really strange idea.
A slightly better idea would be to treat the images as if they were vectors in $\mathbb{R}^{NM}$ (if you make them grayscale), or in $\mathbb{R}^{3NM}$ (if you treat RGB channels separately). Now you can simply take Euclidean distance, or supremum norm, or any other metric already defined on $\mathbb{R}^d$. Depending on the application, you might consider transforming the colors into some other color space, like HSV, and only then treating the resulting image as a vector in $\mathbb{R}^d$.
An even better idea would be to look for more advanced feature selection methods, and then base your metric on them. You might start with a simple PCA (apparently, it was good enough for simple face recognition), and end with an arbitrarily complex machine learning algorithm that can actually understand what the image shows.