TY - CHAP
T1 - Job Offers Classifier Using Neural Networks and Oversampling Methods
AU - Ortiz, Germán
AU - Enguix, Gemma Bel
AU - Gómez-Adorno, Helena
AU - Ameer, Iqra
AU - Sidorov, Grigori
N1 - Funding Information:
Acknowledgements The work was done with partial support from the Mexican Government through the grants A1-S-47854 and CB A1-S-27780 of the CONACYT-Mexico, grants of PAPIIT-UNAM projects TA400121 and TA101722, and grants 20211784, 20211884, and 20211178 of the Secretaría de Investigación y Posgrado of the Instituto Politécnico Nacional, Mexico. The authors also thank the CONACYT for the computing resources brought to them through the Plataforma de Aprendizaje Profundo para Tecnologías del Lenguaje of the Laboratorio de Supercómputo of the INAOE, Mexico. The authors also acknowledge the support of the DiCoDe project.
Publisher Copyright:
© 2023, The Author(s), under exclusive license to Springer Nature Switzerland AG.
PY - 2023
Y1 - 2023
N2 - Both policy and research benefit from a better understanding of individuals’ jobs. However, as large-scale administrative records are increasingly employed to represent labor market activity, new automatic methods to classify jobs will become necessary. We developed an automatic job offers classifier using a dataset collected from the largest job bank of Mexico known as Bumeran. We applied machine learning algorithms such as Support Vector Machines, Naive-Bayes, Logistic Regression, Random Forest, and deep learning Long-Short Term Memory (LSTM). Using these algorithms, we trained multi-class models to classify job offers in one of the 23 classes (not uniformly distributed): Sales, Administration, Call Center, Technology, Trades, Human Resources, Logistics, Marketing, Health, Gastronomy, Financing, Secretary, Production, Engineering, Education, Design, Legal, Construction, Insurance, Communication, Management, Foreign Trade, and Mining. We used the SMOTE, Geometric-SMOTE, and ADASYN synthetic oversampling algorithms to handle imbalanced classes. The proposed convolutional neural network architecture achieved the best results when applied the Geometric-SMOTE algorithm.
AB - Both policy and research benefit from a better understanding of individuals’ jobs. However, as large-scale administrative records are increasingly employed to represent labor market activity, new automatic methods to classify jobs will become necessary. We developed an automatic job offers classifier using a dataset collected from the largest job bank of Mexico known as Bumeran. We applied machine learning algorithms such as Support Vector Machines, Naive-Bayes, Logistic Regression, Random Forest, and deep learning Long-Short Term Memory (LSTM). Using these algorithms, we trained multi-class models to classify job offers in one of the 23 classes (not uniformly distributed): Sales, Administration, Call Center, Technology, Trades, Human Resources, Logistics, Marketing, Health, Gastronomy, Financing, Secretary, Production, Engineering, Education, Design, Legal, Construction, Insurance, Communication, Management, Foreign Trade, and Mining. We used the SMOTE, Geometric-SMOTE, and ADASYN synthetic oversampling algorithms to handle imbalanced classes. The proposed convolutional neural network architecture achieved the best results when applied the Geometric-SMOTE algorithm.
UR - http://www.scopus.com/inward/record.url?scp=85162229005&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85162229005&partnerID=8YFLogxK
U2 - 10.1007/978-3-031-20153-0_18
DO - 10.1007/978-3-031-20153-0_18
M3 - Chapter
AN - SCOPUS:85162229005
T3 - Studies in Fuzziness and Soft Computing
SP - 235
EP - 248
BT - Studies in Fuzziness and Soft Computing
PB - Springer Science and Business Media Deutschland GmbH
ER -