Synthetic Data Digital Twins and Data Trusts Control for Privacy in Health Data Sharing

Richard K. Lomotey, Sandra Kumi, Madhurima Ray, Ralph Deters

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Health data sharing is very valuable for medical research since it has the propensity to improve diagnostics, policy, medication, and so on. At the same time, sharing health data needs to be done without compromising the privacy of patients and stakeholders. However, recent advances in AI/ML and sophisticated analytics have proven to introduce biases that can easily identify patients based on their healthcare data, which violates privacy. In this work, we sort to address this major issue by exploring two emerging topics that are gaining attention from industry, academia, and governments, i.e., digital twins and data trusts. First, we proposed the use of digital twins (DTs) to generate synthetic records of patient's heart rate data. DTs are virtual replicas of the actual data and were created using two synthetic data generative models - Gaussian Copula (GC) and Tabular Variational Autoencoder (TVAE). The GC and TVAE achieved a maximum data quality score of 88% and 96% respectively. Next, we posit that the DTs should be shared with a data trusts layer. Data trusts are fiduciary frameworks that govern multi-party data sharing. The data trusts enforce access controls (based on metrics such as location, role-based, and policy-based) to the synthetic health data and reports to the data subject. The preliminary evaluations of the work show that merging the two techniques (i.e., synthetic data digital twins and data trusts) enforces better privacy for health data access. The synthetic data ensures more anonymization while the data trusts provide easy auditing, tracking, and efficient reporting to the patient or data subject. The paper also detailed the architectural design of the data trusts and evaluated the efficiency of the access control techniques.

Original languageEnglish (US)
Title of host publicationSaT-CPS 2024 - Proceedings of the 2024 ACM Workshop on Secure and Trustworthy Cyber-Physical Systems
PublisherAssociation for Computing Machinery, Inc
Pages1-10
Number of pages10
ISBN (Electronic)9798400705564
DOIs
StatePublished - Jun 21 2024
Event4th ACM Workshop on Secure and Trustworthy Cyber-Physical Systems, SaT-CPS 2024, held in conjunction with the 14th ACM Conference on Data and Application Security and Privacy, CODASPY 2024 - Porto, Portugal
Duration: Jun 21 2024 → …

Publication series

NameSaT-CPS 2024 - Proceedings of the 2024 ACM Workshop on Secure and Trustworthy Cyber-Physical Systems

Conference

Conference4th ACM Workshop on Secure and Trustworthy Cyber-Physical Systems, SaT-CPS 2024, held in conjunction with the 14th ACM Conference on Data and Application Security and Privacy, CODASPY 2024
Country/TerritoryPortugal
CityPorto
Period6/21/24 → …

All Science Journal Classification (ASJC) codes

  • Artificial Intelligence
  • Computer Networks and Communications
  • Hardware and Architecture
  • Human-Computer Interaction
  • Control and Optimization

Fingerprint

Dive into the research topics of 'Synthetic Data Digital Twins and Data Trusts Control for Privacy in Health Data Sharing'. Together they form a unique fingerprint.

Cite this