TY - JOUR
T1 - Identification of microbial features in multivariate regression under false discovery rate control
AU - Srinivasan, Arun
AU - Xue, Lingzhou
AU - Zhan, Xiang
N1 - Publisher Copyright:
© 2022 Elsevier B.V.
PY - 2023/5
Y1 - 2023/5
N2 - In many microbiome studies, researchers often aim at detecting statistical associations between microbial taxa and multiple disease-related secondary phenotypes of interest, which are further investigated in downstream functional studies. Most existing approaches tackle this aim by analyzing one taxon at a time and then followed by multiple testing correction. However, the large number of microbial taxa poses a heavy multiple correction burden which often limits the power of discovery of the aforementioned individual taxon-based analyses. Moreover, complicated correlation structures among taxa poses grand challenges for multiple testing correction procedures to achieve a satisfactory performance (e.g., false discovery rate control). To address these potential limitations, a new approach is proposed to detect statistical associations between multiple responses and microbial features in a multivariate regression model, which models the correlations among responses to boost power of association discovery. By utilizing the knockoff filter technique, the proposed procedure also enjoys the property of finite-sample false discovery rate control. It is demonstrated through a comprehensive simulation study to show the validity and usefulness of our new method and apply the methodology to a data set collected from microbiome studies to gain additional biological insights.
AB - In many microbiome studies, researchers often aim at detecting statistical associations between microbial taxa and multiple disease-related secondary phenotypes of interest, which are further investigated in downstream functional studies. Most existing approaches tackle this aim by analyzing one taxon at a time and then followed by multiple testing correction. However, the large number of microbial taxa poses a heavy multiple correction burden which often limits the power of discovery of the aforementioned individual taxon-based analyses. Moreover, complicated correlation structures among taxa poses grand challenges for multiple testing correction procedures to achieve a satisfactory performance (e.g., false discovery rate control). To address these potential limitations, a new approach is proposed to detect statistical associations between multiple responses and microbial features in a multivariate regression model, which models the correlations among responses to boost power of association discovery. By utilizing the knockoff filter technique, the proposed procedure also enjoys the property of finite-sample false discovery rate control. It is demonstrated through a comprehensive simulation study to show the validity and usefulness of our new method and apply the methodology to a data set collected from microbiome studies to gain additional biological insights.
UR - http://www.scopus.com/inward/record.url?scp=85139060162&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85139060162&partnerID=8YFLogxK
U2 - 10.1016/j.csda.2022.107621
DO - 10.1016/j.csda.2022.107621
M3 - Article
AN - SCOPUS:85139060162
SN - 0167-9473
VL - 181
JO - Computational Statistics and Data Analysis
JF - Computational Statistics and Data Analysis
M1 - 107621
ER -