Skip to main navigation Skip to search Skip to main content

Comparative Fine-Tuning of GPT-2 on Question Answering and Dialogue Datasets for Medical Text Generation

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Fine-tuning large language models (LLM) using medical data-sets presents significant opportunities for developing reliable and informative AI-driven health applications. This research investigates how different dataset structures (formatted question-answer (QA) pairs versus conversational doctor-patient dialogues) influence the effectiveness of a GPT-2-based generative model. Models trained on each dataset were evaluated using established NLP metrics (BLEU, ROUGE-1, ROUGE-L, BERTScore) and qualitative evaluations covering sentiment alignment, factual consistency (assessed via natural language inference), and readability. The results indicate that the QA-trained model achieves superior performance in semantic accuracy and sentiment alignment compared to the dialogue-based model, which produced responses that were marginally more readable. However, both models exhibited notably low factual entailment scores, highlighting an essential area for further improvement. These insights emphasize the importance of cautious dataset selection and model assessment strategies in clinical NLP. They also suggest promising directions for enhancing factual accuracy, domain specificity, and explanatory capabilities in future research.

Original languageEnglish (US)
Title of host publicationSEET - Software Engineering for Emerging Technologies - 1st International Conference, SEET 2025, Proceedings
EditorsShahid Hussain, Arif Ali Khan, Muhammad Abdul Basit Ur Rahim, Saif Ur Rehman Khan
PublisherSpringer Science and Business Media Deutschland GmbH
Pages575-590
Number of pages16
ISBN (Print)9783032089762
DOIs
StatePublished - 2026
Event1st International Conference on Software Engineering of Emerging Technologies, SEET 2025 - Long Beach, United States
Duration: Aug 11 2025Aug 12 2025

Publication series

NameCommunications in Computer and Information Science
Volume2725 CCIS
ISSN (Print)1865-0929
ISSN (Electronic)1865-0937

Conference

Conference1st International Conference on Software Engineering of Emerging Technologies, SEET 2025
Country/TerritoryUnited States
CityLong Beach
Period8/11/258/12/25

All Science Journal Classification (ASJC) codes

  • General Computer Science
  • General Mathematics

Fingerprint

Dive into the research topics of 'Comparative Fine-Tuning of GPT-2 on Question Answering and Dialogue Datasets for Medical Text Generation'. Together they form a unique fingerprint.

Cite this