Empirical Analysis Of Optimization And Generalization Of Deep Neural Networks