TY - CHAP
T1 - Job Offers Classifier Using Neural Networks and Oversampling Methods
AU - Ortiz, Germán
AU - Enguix, Gemma Bel
AU - Gómez-Adorno, Helena
AU - Ameer, Iqra
AU - Sidorov, Grigori
N1 - Publisher Copyright:
© 2023, The Author(s), under exclusive license to Springer Nature Switzerland AG.
PY - 2023
Y1 - 2023
N2 - Both policy and research benefit from a better understanding of individuals’ jobs. However, as large-scale administrative records are increasingly employed to represent labor market activity, new automatic methods to classify jobs will become necessary. We developed an automatic job offers classifier using a dataset collected from the largest job bank of Mexico known as Bumeran. We applied machine learning algorithms such as Support Vector Machines, Naive-Bayes, Logistic Regression, Random Forest, and deep learning Long-Short Term Memory (LSTM). Using these algorithms, we trained multi-class models to classify job offers in one of the 23 classes (not uniformly distributed): Sales, Administration, Call Center, Technology, Trades, Human Resources, Logistics, Marketing, Health, Gastronomy, Financing, Secretary, Production, Engineering, Education, Design, Legal, Construction, Insurance, Communication, Management, Foreign Trade, and Mining. We used the SMOTE, Geometric-SMOTE, and ADASYN synthetic oversampling algorithms to handle imbalanced classes. The proposed convolutional neural network architecture achieved the best results when applied the Geometric-SMOTE algorithm.
AB - Both policy and research benefit from a better understanding of individuals’ jobs. However, as large-scale administrative records are increasingly employed to represent labor market activity, new automatic methods to classify jobs will become necessary. We developed an automatic job offers classifier using a dataset collected from the largest job bank of Mexico known as Bumeran. We applied machine learning algorithms such as Support Vector Machines, Naive-Bayes, Logistic Regression, Random Forest, and deep learning Long-Short Term Memory (LSTM). Using these algorithms, we trained multi-class models to classify job offers in one of the 23 classes (not uniformly distributed): Sales, Administration, Call Center, Technology, Trades, Human Resources, Logistics, Marketing, Health, Gastronomy, Financing, Secretary, Production, Engineering, Education, Design, Legal, Construction, Insurance, Communication, Management, Foreign Trade, and Mining. We used the SMOTE, Geometric-SMOTE, and ADASYN synthetic oversampling algorithms to handle imbalanced classes. The proposed convolutional neural network architecture achieved the best results when applied the Geometric-SMOTE algorithm.
UR - http://www.scopus.com/inward/record.url?scp=85162229005&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85162229005&partnerID=8YFLogxK
U2 - 10.1007/978-3-031-20153-0_18
DO - 10.1007/978-3-031-20153-0_18
M3 - Chapter
AN - SCOPUS:85162229005
T3 - Studies in Fuzziness and Soft Computing
SP - 235
EP - 248
BT - Studies in Fuzziness and Soft Computing
PB - Springer Science and Business Media Deutschland GmbH
ER -