How does mini-batching affect Curvature information for second order deep learning optimization?

Publication
In NeurIPS workshop on Beyond First Order Methods in Machine Learning
Date
Links

* Equal first authorship