TY - GEN
T1 - JAMES
T2 - 10th IEEE International Conference on Data Science and Advanced Analytics, DSAA 2023
AU - Yamashita, Michiharu
AU - Shen, Jia Tracy
AU - Tran, Thanh
AU - Ekhtiari, Hamoon
AU - Lee, Dongwon
N1 - Publisher Copyright:
© 2023 IEEE.
PY - 2023
Y1 - 2023
N2 - In online job marketplaces, it is important to establish a well-defined job title taxonomy for various downstream tasks (e.g., job recommendation, users' career analysis, and turnover prediction). Job Title Normalization (JTN) is such a cleaning step to classify user-created non-standard job titles into normalized ones. However, solving the JTN problem is non-trivial with challenges: (1) semantic similarity of different job titles, (2) non-normalized user-created job titles, and (3) large-scale and long-tailed job titles in real-world applications. To this end, we propose a novel solution, named JAMES, that constructs three unique embeddings (i.e., graph, contextuat, and syntactic) of a target job title to effectively capture its various traits. We further propose a multi-aspect co-attention mechanism to attentively combine these embeddings, and employ neural logical reasoning representations to collaboratively estimate similarities between messy job titles and normalized job titles in a reasoning space. To evaluate JAMES, we conduct comprehensive experiments against ten competing models on a large-scale real-world dataset with over 350,000 job titles. Our experimental results show that JAMES significantly outperforms the best baseline by 10.06% in Precision@10 and by 17.52% in NDCG@10, respectively. To further facilitate the acquisition of normalized job titles for job-domain applications, our JAMES API is available at: https://tinyurl.con JAMES-job-title-mapping.
AB - In online job marketplaces, it is important to establish a well-defined job title taxonomy for various downstream tasks (e.g., job recommendation, users' career analysis, and turnover prediction). Job Title Normalization (JTN) is such a cleaning step to classify user-created non-standard job titles into normalized ones. However, solving the JTN problem is non-trivial with challenges: (1) semantic similarity of different job titles, (2) non-normalized user-created job titles, and (3) large-scale and long-tailed job titles in real-world applications. To this end, we propose a novel solution, named JAMES, that constructs three unique embeddings (i.e., graph, contextuat, and syntactic) of a target job title to effectively capture its various traits. We further propose a multi-aspect co-attention mechanism to attentively combine these embeddings, and employ neural logical reasoning representations to collaboratively estimate similarities between messy job titles and normalized job titles in a reasoning space. To evaluate JAMES, we conduct comprehensive experiments against ten competing models on a large-scale real-world dataset with over 350,000 job titles. Our experimental results show that JAMES significantly outperforms the best baseline by 10.06% in Precision@10 and by 17.52% in NDCG@10, respectively. To further facilitate the acquisition of normalized job titles for job-domain applications, our JAMES API is available at: https://tinyurl.con JAMES-job-title-mapping.
UR - https://www.scopus.com/pages/publications/85178995504
UR - https://www.scopus.com/pages/publications/85178995504#tab=citedBy
U2 - 10.1109/DSAA60987.2023.10302559
DO - 10.1109/DSAA60987.2023.10302559
M3 - Conference contribution
AN - SCOPUS:85178995504
T3 - 2023 IEEE 10th International Conference on Data Science and Advanced Analytics, DSAA 2023 - Proceedings
BT - 2023 IEEE 10th International Conference on Data Science and Advanced Analytics, DSAA 2023 - Proceedings
A2 - Manolopoulos, Yannis
A2 - Zhou, Zhi-Hua
PB - Institute of Electrical and Electronics Engineers Inc.
Y2 - 9 October 2023 through 12 October 2023
ER -