Understanding the behavior of stochastic gradient descent (SGD) in the context of deep neural networks has raised lots of concerns recently. Along this line, we theoretically study a general form of gradient based optimization dynamics with unbiased noise, which unifies SGD and standard Langevin dynamics... (read more)
PDF AbstractMETHOD | TYPE | |
---|---|---|
![]() |
Stochastic Optimization |