I'm a learner of machine learning and statistics and I have some experience with both of the subjects. However, until this day it has not yet been fully revealed to me what is the fundamental difference between the four following variables:
- Response variable
- Output variable
- Hidden variable
- Latent variable
To my knowledge, in data analysis our task in general is to find the functional relationship between explanatory observed data $x$ and the variable of interest $y$. That is, in general our goal in many modeling applications is to find a function $f$ such that:
$$f(x)=y,$$
using some data set $D={(x_1,y_1), (x_2,y_2), ..., (x_n, y_n)}$. I think the "response" and "output" variables are synonyms and generally refer to the $y$ variable.
But what about "hidden" and "latent" variables? What do they refer to? Are they also synonyms for $y$ or do they refer to the parameters of $f$?
Concrete and simple examples would be both sufficient and excellent answers, thank you!
UPDATE:
As requested, I will also add the following into the list of variables to be explained:
- independent variable
- dependent variable
- confound variable
A hidden variable is a variable that you cannot measure by a sample process. For instance, this appears in the hidden Markov model (HMM), where you can sample output data $(y_1,\ldots,y_n)$ but not the states $(x_1,\ldots,x_n)$ which led to the output.
Note that the difference is between observable and hidden variables. The observable variables are instantiated (there are values for these variables) while there are no values for the hidden variables.
In the HMM (which is a specific Bayesian network), given the observable (here output) variables (values for them), estimate the most probable values of the state variable that led to the output.
The initial application of the HMM is speech recognition by Viterbi in the 1960s. Basically, the output are spoken siblings (output observed) and the corresponding states (hidden) are the written (real) siblings. The aim is to find the most probable sequence of written siblings (sentences) corresponding to the spoken words. The associated algorithm is named after its inventor Viterbi.