Assume that we have a nonlinear dynamical model, e.g called transition function
$$\hat x = f(x, u)$$
And $y = x$ as our observability function.
According to Mathworks
Kalman filters are used to optimally estimate the variables of interests when they can't be measured directly, but an indirect measurement is available. They are also used to find the best estimate of states by combining measurements from various sensors in the presence of noise.
In this case, practical case, the observability function is often an identity matrix because states are assumed as they were close as measurements.
So my question is:
If Mathworks says that Kalman filters are used when the user need to measure the states, when the output is only known. Then what if the states is the outputs. I'm most cases, the states are the output. Especially in system identification.
So what's is the use case for kalman filter then?
As stated by the second sentence of the quoted text, in this case a Kalman filter can still be beneficial to minimize the impact of noise. Noises can both act on the state dynamics as the measurements. If there is no noise on the measurements then directly using the measurements will be best. If there is no noise on the dynamics then a Kalman filter will eventually use the measurements less and less and more and more rely on the prediction steps. However, in practice both will act to some degree on actual systems. Though, it might be that the not all assumptions need for a Kalman filter are satisfied, such as that noises are Gaussian zero mean white noise and that dynamics can be described by a linear model. But in many cases those assumptions give a close enough approximation of the actual system and thus Kalman filters at least yield some improvement over just using the output directly in this case.