How does the second-order derivative information affect generalization error or test error?

The goal of the project is to compare empirically the generalization error of two stochastic optimization algorithms used in Reinforcement Learning (SCRN and momentum-based SGD) on cost functions that satisfy the so-called gradient dominance property.

