I understand that a 2D Affine Transformation has 6 DOF and a 2D Homogeneous Transformation has 8 DOF. However, how can I identify what those independent paramters are?
If we consider Euclidean Transformation, it has 3 DOF: rotation, translation in x and translation in y. \begin{bmatrix}C_\theta&-S_\theta&t_x\\S_\theta&C_\theta&t_y\\0&0&1\end{bmatrix} If we consider Similarity transform, it has 4 DOF: rotation, translation in x, translation in y and scaling. \begin{bmatrix}sC_\theta&-sS_\theta&t_x\\sS_\theta&sC_\theta&t_y\\0&0&1\end{bmatrix}
1) Similarily, what makes up the 6 DOF of Affine matrix and 8 DOF of Homogeneous matrix?
2) Unlike the Euclidean and Similarity Transformation, is there no fixed set of DOF?
3) Can they be any six (if we take Affine as example) of rotation, translation (in x, y), scaling (in x, y), shearing, reflection etc. depending on the application?
4) If so, given an Affine matrix, can we know what the DOF are without knowledge of application?
Link1 says Affine transformation is a combination of translation, rotation, scale, aspect ratio and shear. Link2 says it consists of 2 rotations, 2 scaling and traslations (in x, y). Link3 indicates that it can be a combination of various different transformations.
I am a little confused about the whole idea. Thanks in advance.
I have finally come across an answer which I find convincing and satisfactory. The actual explanation is available in Page 40 of the book Hartley, Richard, and Andrew Zisserman. Multiple view geometry in computer vision. Cambridge university press, 2003..
Consider a planar (2D) affine transformation,
$H_A = \begin{bmatrix}A& t\\0&1\end{bmatrix}$.
The element
Ais a 2 x 2 non-singular matrix. This can be decomposed as:$A = R(\theta)R(-\phi)DR(\phi)$
where R is a rotation by the angle shown in the argument and $D = \begin{bmatrix}\lambda_1& 0\\0&\lambda_2\end{bmatrix}$.
Note that since there is a rotation first, the non-isotropic scaling is applied in directions different from those of the original geometry (so instead of scaling in the default x and y axis directions, scale at angle $\phi$ (wrt. x axis) and its orthogonal direction). Once scaling is applied in the required direction, the geometry/ shape is rotated back and then a required rotation is applied.
Compared to a similarity transformation the only new geometry is the non-isotropic scaling. This accounts for the two extra degrees of freedom: $phi$ and scaling ratio, which gives a total of 6 DoF (apart from 2 parameters for translation, 1 for rotation and 1 to fix the scaling).