Efficient Distributed Learning for Large-Scale Expectile Regression With Sparsity

Yingli Pan; Zhan Liu

doi:10.1109/ACCESS.2021.3075686

IEEE Access (Jan 2021)

Efficient Distributed Learning for Large-Scale Expectile Regression With Sparsity

Yingli Pan,
Zhan Liu

Affiliations

Yingli Pan: ORCiD; Hubei Key Laboratory of Applied Mathematics, School of Mathematics and Statistics, Hubei University, Wuhan, China
Zhan Liu: ORCiD; Hubei Key Laboratory of Applied Mathematics, School of Mathematics and Statistics, Hubei University, Wuhan, China

DOI: https://doi.org/10.1109/ACCESS.2021.3075686
Journal volume & issue: Vol. 9
pp. 64732 – 64746

Abstract

Read online

High-dimensional datasets often display heterogeneity due to heteroskedasticity or other forms of non-location-scale covariance effects. When the size of datasets becomes very large, it may be infeasible to store all of the high-dimensional datasets on one machine, or at least to keep the datasets in memory. In this paper, we consider penalized expectile regression using smoothly clipped absolute deviation (SCAD) and adaptive LASSO penalties, which can effectively detect the heteroskedasticity of high-dimensional data. We propose a communication-efficient approach for distributed sparsity learning, where observations are randomly partitioned across machines. By selecting the appropriate tuning parameters, we show that the proposed estimators display oracle properties. Extensive numerical experiments on both synthetic and real data validate the theoretical results and demonstrate the superior performance of our proposed method.

Published in IEEE Access

ISSN: 2169-3536 (Online)
Publisher: IEEE
Country of publisher: United States
LCC subjects: Technology: Electrical engineering. Electronics. Nuclear engineering
Website: https://ieeexplore.ieee.org/xpl/RecentIssue.jsp?punumber=6287639

About the journal

Abstract

Keywords