TY - JOUR
T1 - Pediatric Long COVID Subphenotypes
T2 - An EHR-based study from the RECOVER program
AU - on behalf of the RECOVER Consortium
AU - Lorman, Vitaly
AU - Bailey, L. Charles
AU - Song, Xing
AU - Rao, Suchitra
AU - Hornig, Mady
AU - Utidjian, Levon
AU - Razzaghi, Hanieh
AU - Mejias, Asuncion
AU - Leikauf, John Erik
AU - Brill, Seuli Bose
AU - Allen, Andrea
AU - Bunnell, H. Timothy
AU - Reedy, Cara
AU - Mosa, Abu Saleh Mohammad
AU - Horne, Benjamin D.
AU - Geary, Carol Reynolds
AU - Chuang, Cynthia H.
AU - Williams, David A.
AU - Christakis, Dimitri A.
AU - Chrischilles, Elizabeth A.
AU - Mendonca, Eneida A.
AU - Cowell, Lindsay G.
AU - McCorkell, Lisa
AU - Liu, Mei
AU - Cummins, Mollie R.
AU - Jhaveri, Ravi
AU - Blecker, Saul
AU - Forrest, Christopher B.
N1 - Publisher Copyright:
© 2025 Lorman et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
PY - 2025/4
Y1 - 2025/4
N2 - Pediatric Long COVID has been associated with a wide variety of symptoms, conditions, and organ systems, but distinct clinical presentations, or subphenotypes, are still being elucidated. In this exploratory analysis, we identified a cohort of pediatric (age <21) patients with evidence of Long COVID and no pre-existing complex chronic conditions using electronic health record data from 38 institutions and used an unsupervised machine learning-based approach to identify subphenotypes. Our method, an extension of the Phe2Vec algorithm, uses tens of thousands of clinical concepts from multiple domains to represent patients’ clinical histories to then identify groups of patients with similar presentations. The results indicate that cardiorespiratory presentations are most common (present in 54% of patients) followed by subphenotypes marked (in decreasing order of frequency) by musculoskeletal pain, neuropsychiatric conditions, gastrointestinal symptoms, headache, and fatigue.
AB - Pediatric Long COVID has been associated with a wide variety of symptoms, conditions, and organ systems, but distinct clinical presentations, or subphenotypes, are still being elucidated. In this exploratory analysis, we identified a cohort of pediatric (age <21) patients with evidence of Long COVID and no pre-existing complex chronic conditions using electronic health record data from 38 institutions and used an unsupervised machine learning-based approach to identify subphenotypes. Our method, an extension of the Phe2Vec algorithm, uses tens of thousands of clinical concepts from multiple domains to represent patients’ clinical histories to then identify groups of patients with similar presentations. The results indicate that cardiorespiratory presentations are most common (present in 54% of patients) followed by subphenotypes marked (in decreasing order of frequency) by musculoskeletal pain, neuropsychiatric conditions, gastrointestinal symptoms, headache, and fatigue.
UR - https://www.scopus.com/pages/publications/105002470370
UR - https://www.scopus.com/pages/publications/105002470370#tab=citedBy
U2 - 10.1371/journal.pdig.0000747
DO - 10.1371/journal.pdig.0000747
M3 - Article
C2 - 40208885
AN - SCOPUS:105002470370
SN - 2767-3170
VL - 4
JO - PLOS Digital Health
JF - PLOS Digital Health
IS - 4 April
M1 - e0000747
ER -