Monthly Archives: June 2015
Revisiting Nesterov’s Acceleration
Nesterov’s accelerated gradient descent (AGD) is hard to understand. Since Nesterov’s 1983 paper people have tried to explain “why” acceleration is possible, with the hope that the answer would go beyond the mysterious (but beautiful) algebraic manipulations of the original … Continue reading
Posted in Optimization
7 Comments