confusing definition for cross-entropy loss referring to Stanford lecture notes http://cs231n.github.io/linear-classify/ you are calling log loss same as cross entropy loss