Derivative of Softmax loss function issue

445 Views Asked by At

I am trying to use chain rule in derviating the loss for the Softmax function, but i stuck. In this great answer: I can't realize, why the log(p_k) derivative with respect to o is 1/p_k but not 1/(p_k * ln(10))?