My question is how to derive we can use sigmoid function 1/(1+e^(-w*x)) to map w*x to probability space?
what's the theory behind to prove sigmoid function is optimal method for mapping?
Thanks for you help.
2026-03-31 10:06:54.1774951614
How to derive the logistic regression formula?
73 Views Asked by Bumbble Comm https://math.techqa.club/user/bumbble-comm/detail At
1
The purpose of sigmoid function is not to "map $w^*x$ to probability space", it is to simulate threshold behaviour for the activation of perceptrons. It is a nice function $\sigma : \mathbb{R}^{length(w)}\mapsto [0 , 1]^{length(w)}$.
But those occurrences of the interval $[0 , 1]$ aren't meaningful probability spaces, they are just activation range of some perceptrons.