Approximating the gradient of cross-entropy loss function Academic Article uri icon