Mini-Batching, Gradient-Clipping, First-versus Second-Order