Hyper-parameter optimization for support vector machines using stochastic gradient descent and dual coordinate descent

W.e.i. Jiang; Sauleh Siddiqui

doi:10.1007/s13675-019-00115-7

EURO Journal on Computational Optimization (Mar 2020)

Hyper-parameter optimization for support vector machines using stochastic gradient descent and dual coordinate descent

W.e.i. Jiang,
Sauleh Siddiqui

Affiliations

W.e.i. Jiang: Department of Civil Engineering, Johns Hopkins System Institute, Johns Hopkins University, 3400 N Charles St, MD 21218, Baltimore, USA.
Sauleh Siddiqui: Department of Civil Engineering, Johns Hopkins University, MD 21218, Baltimore, USA.

DOI: https://doi.org/10.1007/s13675-019-00115-7
Journal volume & issue: Vol. 8, no. 1
pp. 85 – 101

Abstract

Read online

We developed a gradient-based method to optimize the regularization hyper-parameter, C, for support vector machines in a bilevel optimization framework. On the upper level, we optimized the hyper-parameter C to minimize the prediction loss on validation data using stochastic gradient descent. On the lower level, we used dual coordinate descent to optimize the parameters of support vector machines to minimize the loss on training data. The gradient of the loss function on the upper level with respect to the hyper-parameter, C, was computed using the implicit function theorem combined with the optimality condition of the lower-level problem, i.e., the dual problem of support vector machines. We compared our method with the existing gradient-based method in the literature on several datasets. Numerical results showed that our method converges faster to the optimal solution and achieves better prediction accuracy for large-scale support vector machine problems.

90–08

Published in EURO Journal on Computational Optimization

ISSN: 2192-4406 (Print); 2192-4414 (Online)
Publisher: Elsevier
Country of publisher: United Kingdom
LCC subjects: Technology: Technology (General): Industrial engineering. Management engineering: Applied mathematics. Quantitative methods; Science: Mathematics: Instruments and machines: Electronic computers. Computer science
Website: https://www.journals.elsevier.com/euro-journal-on-computational-optimization

About the journal

Abstract

Keywords