TY - JOUR
T1 - Leveraging Semantics in WordNet to Facilitate the Computer-Assisted Coding of ICD-11
AU - Chen, Donghua
AU - Zhang, Runtong
AU - Qiu, Robin G.
N1 - Funding Information:
Manuscript received March 29, 2019; revised September 16, 2019; accepted October 19, 2019. Date of publication October 25, 2019; date of current version May 6, 2020. This work was supported in part by a key project of National Natural Science Foundation of China with Grant Number 71532002 and in part by a major project of the National Social Science Foundation of China with Grant Number 18ZDA086. (Corresponding author: Runtong Zhang.) D. Chen and R. Zhang are with the Department of Information Management, School of Economics and Management, Beijing Jiaotong University, Beijing 100044, China (e-mail: [email protected]; [email protected]).
Funding Information:
This work was supported in part by a key project of National Natural Science Foundation of China with Grant Number 71532002 and in part by a major project of the National Social Science Foundation of China with Grant Number 18ZDA086. (Corresponding author: Runtong Zhang.)
Publisher Copyright:
© 2013 IEEE.
Copyright:
Copyright 2020 Elsevier B.V., All rights reserved.
PY - 2020/5
Y1 - 2020/5
N2 - The International Classification of Diseases (ICD) not only serves as the bedrock for health statistics but also provides a holistic overview of every health aspect of life. This study aims to facilitate the computer-assisted coding of the 11th revision of the ICD (ICD-11) by leveraging the data structures of ICD-11 and semantics in WordNet. First, a computer-assisted coding framework using WordNet and ICD-11 application programming interface (API) is proposed. Secondly, a network based on entity relations in ICD-11 and synonym sets in WordNet, called CodeNet, is developed. Thirdly, an algorithm for generating ICD-11 code candidates from CodeNet with two tuning parameters on the basis of the input of disease-related text is illustrated. Finally, the discharge summaries in the Medical Information Mart for Intensive Care III database and textual information from ICD-11 entities are used to evaluate the proposed method. Experimental results indicate that the proposed coding method achieves a precision of 84% and a recall of 89% relative to a precision of 65% and a recall of 81% achieved with the existing ICD-11 API. The proposed method also outperforms other methods in the literature by reducing a failure rate of up to 8% in ICD-11 coding. The proposed thresholds of similarity and percentage can be applied to tuning the performance of our method to meet different coding needs. In sum, improving the new structures of ICD-11 with the use of semantics in WordNet can help develop more reliable computer-aided coding systems for ICD-11 coders.
AB - The International Classification of Diseases (ICD) not only serves as the bedrock for health statistics but also provides a holistic overview of every health aspect of life. This study aims to facilitate the computer-assisted coding of the 11th revision of the ICD (ICD-11) by leveraging the data structures of ICD-11 and semantics in WordNet. First, a computer-assisted coding framework using WordNet and ICD-11 application programming interface (API) is proposed. Secondly, a network based on entity relations in ICD-11 and synonym sets in WordNet, called CodeNet, is developed. Thirdly, an algorithm for generating ICD-11 code candidates from CodeNet with two tuning parameters on the basis of the input of disease-related text is illustrated. Finally, the discharge summaries in the Medical Information Mart for Intensive Care III database and textual information from ICD-11 entities are used to evaluate the proposed method. Experimental results indicate that the proposed coding method achieves a precision of 84% and a recall of 89% relative to a precision of 65% and a recall of 81% achieved with the existing ICD-11 API. The proposed method also outperforms other methods in the literature by reducing a failure rate of up to 8% in ICD-11 coding. The proposed thresholds of similarity and percentage can be applied to tuning the performance of our method to meet different coding needs. In sum, improving the new structures of ICD-11 with the use of semantics in WordNet can help develop more reliable computer-aided coding systems for ICD-11 coders.
UR - http://www.scopus.com/inward/record.url?scp=85077756627&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85077756627&partnerID=8YFLogxK
U2 - 10.1109/JBHI.2019.2949567
DO - 10.1109/JBHI.2019.2949567
M3 - Article
C2 - 31670684
AN - SCOPUS:85077756627
SN - 2168-2194
VL - 24
SP - 1469
EP - 1476
JO - IEEE Journal of Biomedical and Health Informatics
JF - IEEE Journal of Biomedical and Health Informatics
IS - 5
M1 - 8883053
ER -