Abstract
ICD coding aims to automatically assign International Classification of Diseases (ICD) codes from unstructured clinical notes or discharge summaries, which saves human labor and reduces errors. Although several studies are proposed to solve this challenging task, none distinguishes the importance of different phrases with a word window. Intuitively, informative phrases should be more useful for the prediction. This paper proposes a feature compressed ICD coding model named Fusion to address this issue. In particular, we propose an attentive soft-pooling approach to compress the sparse and redundant word representations into informative and dense ones as local features. Besides, we use the key-query attention mechanism for modeling the inner relations among local features to generate the global features, which are further used to predict ICD codes. Experiments on two widely used datasets demonstrate that Fusion outperforms baselines. However, on the MIMIC-III Full dataset, we find that none of the state-of-the-art approaches significantly perform better than others. Thus, automated ICD coding is still a challenging task.
| Original language | English (US) |
|---|---|
| Title of host publication | Findings of the Association for Computational Linguistics |
| Subtitle of host publication | ACL-IJCNLP 2021 |
| Editors | Chengqing Zong, Fei Xia, Wenjie Li, Roberto Navigli |
| Publisher | Association for Computational Linguistics (ACL) |
| Pages | 2096-2101 |
| Number of pages | 6 |
| ISBN (Electronic) | 9781954085541 |
| DOIs | |
| State | Published - 2021 |
| Event | Findings of the Association for Computational Linguistics: ACL-IJCNLP 2021 - Virtual, Online Duration: Aug 1 2021 → Aug 6 2021 |
Publication series
| Name | Findings of the Association for Computational Linguistics: ACL-IJCNLP 2021 |
|---|
Conference
| Conference | Findings of the Association for Computational Linguistics: ACL-IJCNLP 2021 |
|---|---|
| City | Virtual, Online |
| Period | 8/1/21 → 8/6/21 |
UN SDGs
This output contributes to the following UN Sustainable Development Goals (SDGs)
-
SDG 3 Good Health and Well-being
All Science Journal Classification (ASJC) codes
- Language and Linguistics
- Linguistics and Language
Fingerprint
Dive into the research topics of 'Fusion: Towards Automated ICD Coding via Feature Compression'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver