Skip to main navigation Skip to search Skip to main content

A Data-Driven Digital Twin for Student Engagement Prediction in e-Learning Systems

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Machine Learning (ML) models are increasingly applied to Learning Management System (LMS) data to predict student engagement and performance. LMS data often contain missing values that can be informative. However, existing modeling approaches in education remove or impute missing values, which can lead to inaccurate or biased models. In this paper, we propose the use of digital twins to model students' engagement based on their learning activities on LMS while preserving the missingness patterns. We leveraged synthetic data generators such as Conditional Tabular Generative Adversarial Network (CTGAN), Tabular Variational Autoencoder (TVAE), and RealTabFormer with reversible data transformations to create a virtual replica of students' data. The CTGAN and TVAE generated balanced synthetic data that accurately captured the meaningful patterns of the real data. Moreover, XGBoost trained on a balanced virtual replica of the students' learning activities data obtained an F1-score of above 80% in predicting the students' engagement levels when evaluated on real data with both complete and incomplete entries. Our findings demonstrate how digital twins can be used to address the complexities of data in the education sector, improve the generalization of models, and reduce bias in real-world performance.

Original languageEnglish (US)
Title of host publication2025 IEEE 6th Annual World AI IoT Congress, AIIoT 2025
EditorsRajashree Paul
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages560-566
Number of pages7
ISBN (Electronic)9798331525088
DOIs
StatePublished - 2025
Event6th IEEE Annual World AI IoT Congress, AIIoT 2025 - Seattle, United States
Duration: May 28 2025May 30 2025

Publication series

Name2025 IEEE 6th Annual World AI IoT Congress, AIIoT 2025

Conference

Conference6th IEEE Annual World AI IoT Congress, AIIoT 2025
Country/TerritoryUnited States
CitySeattle
Period5/28/255/30/25

All Science Journal Classification (ASJC) codes

  • Artificial Intelligence
  • Computer Networks and Communications
  • Computer Science Applications
  • Computer Vision and Pattern Recognition
  • Safety, Risk, Reliability and Quality
  • Education

Fingerprint

Dive into the research topics of 'A Data-Driven Digital Twin for Student Engagement Prediction in e-Learning Systems'. Together they form a unique fingerprint.

Cite this