Stochastic gradient descent with finite samples sizes | IEEE Conference Publication | IEEE Xplore

Stochastic gradient descent with finite samples sizes


Abstract:

The minimization of empirical risks over finite sample sizes is an important problem in large-scale machine learning. A variety of algorithms has been proposed in the lit...Show More

Abstract:

The minimization of empirical risks over finite sample sizes is an important problem in large-scale machine learning. A variety of algorithms has been proposed in the literature to alleviate the computational burden per iteration at the expense of convergence speed and accuracy. Many of these approaches can be interpreted as stochastic gradient descent algorithms, where data is sampled from particular empirical distributions. In this work, we leverage this interpretation and draw from recent results in the field of online adaptation to derive new tight performance expressions for empirical implementations of stochastic gradient descent, mini-batch gradient descent, and importance sampling. The expressions are exact to first order in the step-size parameter and are tighter than existing bounds. We further quantify the performance gained from employing mini-batch solutions, and propose an optimal importance sampling algorithm to optimize performance.
Date of Conference: 13-16 September 2016
Date Added to IEEE Xplore: 10 November 2016
ISBN Information:
Conference Location: Vietri sul Mare, Italy

Contact IEEE to Subscribe

References

References is not available for this document.