This is the question from the book Mathematics for machine learning 
because using the identities from this book
the derivative of $g(y)$ w.r.t. $y$ should be $(y^T(S^{-1} + (S^T)^{-1})$ . which is not same as the answer so in the given identities what are the conditions on $X$ matrix
