TY - JOUR
T1 - ROPU
T2 - A robust online positive-unlabeled learning algorithm
AU - Liang, Xijun
AU - Zhu, Kaili
AU - Xiao, An
AU - Wen, Ya
AU - Zhang, Kaili
AU - Wang, Suhang
AU - Jian, Ling
N1 - Publisher Copyright:
© 2024 Elsevier B.V.
PY - 2025/1/30
Y1 - 2025/1/30
N2 - Positive-unlabeled (PU) learning aims to train classifiers using positive and unlabeled samples. Most methods assume a selected completely at random labeling scenario, which may not reflect real-world PU learning conditions. Our investigation across multiple peptide spectrum match datasets reveals a nonuniform distribution of labeled positive samples, concentrated in specific subsets. To address this, we propose a “missing in a subset area” labeling assumption and analyze resulting model biases. Furthermore, we introduce nonconvex loss functions and develop a robust online positive-unlabeled (ROPU)classification algorithm using gradient descent. Theoretically, ROPU achieves sublinear nonstationary regret bounds under mild conditions. Experimental results demonstrate the effectiveness of ROPU across various simulated and practical PU learning datasets. The source code is available at https://github.com/Isaac-QiXing/ROPU.
AB - Positive-unlabeled (PU) learning aims to train classifiers using positive and unlabeled samples. Most methods assume a selected completely at random labeling scenario, which may not reflect real-world PU learning conditions. Our investigation across multiple peptide spectrum match datasets reveals a nonuniform distribution of labeled positive samples, concentrated in specific subsets. To address this, we propose a “missing in a subset area” labeling assumption and analyze resulting model biases. Furthermore, we introduce nonconvex loss functions and develop a robust online positive-unlabeled (ROPU)classification algorithm using gradient descent. Theoretically, ROPU achieves sublinear nonstationary regret bounds under mild conditions. Experimental results demonstrate the effectiveness of ROPU across various simulated and practical PU learning datasets. The source code is available at https://github.com/Isaac-QiXing/ROPU.
UR - http://www.scopus.com/inward/record.url?scp=85211044251&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85211044251&partnerID=8YFLogxK
U2 - 10.1016/j.knosys.2024.112808
DO - 10.1016/j.knosys.2024.112808
M3 - Article
AN - SCOPUS:85211044251
SN - 0950-7051
VL - 309
JO - Knowledge-Based Systems
JF - Knowledge-Based Systems
M1 - 112808
ER -