TY - GEN
T1 - Fast training for Large-Scale One-versus-All linear classifiers using Tree-Structured initialization
AU - Fang, Huang
AU - Cheng, Minhao
AU - Hsieh, Cho Jui
AU - Friedlander, Michael
N1 - Publisher Copyright:
Copyright © 2019 by SIAM.
PY - 2019
Y1 - 2019
N2 - We consider the problem of training one-versus-all (OVA) linear classifiers for multiclass or multilabel classification when the number of labels is large. A naive extension of OVA to this problem, even with hundreds of cores, usually requires hours for training on large real world datasets. We propose a novel algorithm called OVA-Primal++ that speeds up the training of OVA by using a tree-structured training order, where each classifier is trained using its parent’s classifier as initialization. OVA-Primal++ is both theoretically and empirically faster than the naive OVA algorithm, and yet still enjoys the same highly parallelizability and small memory footprint. Extensive experiments on multiclass and multilabel classification datasets validate the effectiveness of our method.
AB - We consider the problem of training one-versus-all (OVA) linear classifiers for multiclass or multilabel classification when the number of labels is large. A naive extension of OVA to this problem, even with hundreds of cores, usually requires hours for training on large real world datasets. We propose a novel algorithm called OVA-Primal++ that speeds up the training of OVA by using a tree-structured training order, where each classifier is trained using its parent’s classifier as initialization. OVA-Primal++ is both theoretically and empirically faster than the naive OVA algorithm, and yet still enjoys the same highly parallelizability and small memory footprint. Extensive experiments on multiclass and multilabel classification datasets validate the effectiveness of our method.
UR - http://www.scopus.com/inward/record.url?scp=85066105980&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85066105980&partnerID=8YFLogxK
U2 - 10.1137/1.9781611975673.32
DO - 10.1137/1.9781611975673.32
M3 - Conference contribution
AN - SCOPUS:85066105980
T3 - SIAM International Conference on Data Mining, SDM 2019
SP - 280
EP - 288
BT - SIAM International Conference on Data Mining, SDM 2019
PB - Society for Industrial and Applied Mathematics Publications
T2 - 19th SIAM International Conference on Data Mining, SDM 2019
Y2 - 2 May 2019 through 4 May 2019
ER -