A simple SDE is given as: $$dX_t=\sigma X_tdW_t$$ Ito's lemma confirms that the following is a solution: $$X_t=X_0e^{\sigma W_t-\frac{1}{2}\sigma^2t}$$
I understand how this result is correct according to Ito's lemma, but I don't see why it makes any sense that the $-\frac{1}{2}\sigma^2t$ is there.
For example: Assume that a very unlikely thing happens, namely $W_t=0$ on some interval $t\in[0,a]$. I know that this will Almost Surely not happen, but assume it nontheless (I think the same conclusion should apply if $W_t$ simply coincidentally stays very very close to $0$).
Then we have a situation where $dW_t = 0$ over this interval, and therefore $dX_t=\sigma X_t*0=0$ over this interval. Hence $X_t$ should remain constant over this interval.
Nevertheless, we have $X_a=X_0e^{-\frac{1}{2}\sigma^2 * a}<X_0$, so that $X_t$ does not remain constant on the interval.
If we let go of the assumption that $W_t$ is exactly equal to $0$, but assume that it fluctuates extremely closely around $0$ for a relatively large amount of time, then we still get the seemingly incorrect result that $X_t$ drops to zero over time.
How do we explain this?
Let me formulate my question in less technical terms: If the change of $X$ depends only on the change of the brownian motion $W$, then why does $X$ go to zero over time according to a factor $e^{-\frac{1}{2}\sigma^2}$, regardless of how the Brownian motion moves? i.e. regardless of how close the Brownian motion remains to zero, $X$ still moves exponentially fast towards zero.
The first equation you've written, namely $$ dX_t = \sigma X_t dW_t \tag{1}$$ is simply a convenient, yet informal, way to denote the corresponding integral equation, it has no proper understanding apart from that. Without getting into strong versus weak solutions (which have to do with what probability spaces the processes are defined on), we say that $X_t$ solves $(1)$, with initial condition $x\in \mathbb{R}$, if it satisfies $$X_t = x + \int_0^t \sigma X_sdW_s \tag{2}$$ for all $t \geq 0,$ almost surely. That is, solutions to the differential equation are only taken to hold on a set of probability one, so the situation you're describing is a moot point. Moreover, it is improper to think about $dX_t(\omega)$ being a path-wise slope, it is simply a notational shorthand.