TY - JOUR
T1 - A Novel Approach for Perceptions of Physician Decision-Making and Latent Topic Refinement in Large Language Model-Enhanced Medical Dialog Generation
AU - Zhang, Jinli
AU - Jiang, Junzhe
AU - Ma, Fenglong
AU - Jiang, Zongli
AU - Tan, Qi
AU - Zhou, Yongcheng
N1 - Publisher Copyright:
© 2012 IEEE.
PY - 2025
Y1 - 2025
N2 - The rapid advancement of medical dialog generation (MDG) techniques has enabled medical dialog systems (MDSs) to generate high-quality responses rich in medical expertise by integrating diverse medical information. However, they still encounter several challenges, including generic response generation, lack of semantic precision, and imprecise dialog topic extraction. This study aims to design a novel model to address these challenges simultaneously. Correspondingly, we propose the TRL-HMIE model, which represents transformer reinforcement learning (RL) for heterogeneous medical information extraction. In particular, we incorporate GPT-3 from transformer-related models as the reference language model. Our enhancements focused on three key aspects. First, we developed a conversation-topic classifier to precisely categorize conversation topics, supporting the conversation-topic locator module in generating reliable conversation topics. Second, the model employs a multihead attention mechanism to capture crucial information from the dialog context, facilitating the extraction of key dialog information and enhancing the accuracy of heterogeneous information extraction. Finally, the model integrates RL and a reward fusion mechanism, which, combined with its ability to handle multisource information and long dialog contexts, generates optimized rewards for the TRL-HMIE model, encouraging the production of doctor responses with precise semantics and dialog topics. The experimental results demonstrate that the proposed method achieves a 6.07% improvement over the benchmark model on the MedDG and MedDialog datasets. The experimental results demonstrate that the proposed method achieves a 6.07% improvement over the benchmark model on the MedDG and MedDialog datasets.
AB - The rapid advancement of medical dialog generation (MDG) techniques has enabled medical dialog systems (MDSs) to generate high-quality responses rich in medical expertise by integrating diverse medical information. However, they still encounter several challenges, including generic response generation, lack of semantic precision, and imprecise dialog topic extraction. This study aims to design a novel model to address these challenges simultaneously. Correspondingly, we propose the TRL-HMIE model, which represents transformer reinforcement learning (RL) for heterogeneous medical information extraction. In particular, we incorporate GPT-3 from transformer-related models as the reference language model. Our enhancements focused on three key aspects. First, we developed a conversation-topic classifier to precisely categorize conversation topics, supporting the conversation-topic locator module in generating reliable conversation topics. Second, the model employs a multihead attention mechanism to capture crucial information from the dialog context, facilitating the extraction of key dialog information and enhancing the accuracy of heterogeneous information extraction. Finally, the model integrates RL and a reward fusion mechanism, which, combined with its ability to handle multisource information and long dialog contexts, generates optimized rewards for the TRL-HMIE model, encouraging the production of doctor responses with precise semantics and dialog topics. The experimental results demonstrate that the proposed method achieves a 6.07% improvement over the benchmark model on the MedDG and MedDialog datasets. The experimental results demonstrate that the proposed method achieves a 6.07% improvement over the benchmark model on the MedDG and MedDialog datasets.
UR - https://www.scopus.com/pages/publications/105020288895
UR - https://www.scopus.com/pages/publications/105020288895#tab=citedBy
U2 - 10.1109/TNNLS.2025.3569168
DO - 10.1109/TNNLS.2025.3569168
M3 - Article
C2 - 41160764
AN - SCOPUS:105020288895
SN - 2162-237X
VL - 36
SP - 19974
EP - 19985
JO - IEEE Transactions on Neural Networks and Learning Systems
JF - IEEE Transactions on Neural Networks and Learning Systems
IS - 12
ER -