PLSDA-batch: a multivariate framework to correct for batch effects in microbiome data
文献类型: 外文期刊
第一作者: Wang, Yiwen
作者: Wang, Yiwen;Wang, Yiwen;Cao, Kim-Anh Le
作者机构:
关键词: microbiome data; multivariate; non-parametric; dimension reduction; batch effect correction
期刊名称:BRIEFINGS IN BIOINFORMATICS ( 影响因子:9.5; 五年影响因子:10.6 )
ISSN: 1467-5463
年卷期: 2023 年 24 卷 2 期
页码:
收录情况: SCI
摘要: Microbial communities are highly dynamic and sensitive to changes in the environment. Thus, microbiome data are highly susceptible to batch effects, defined as sources of unwanted variation that are not related to and obscure any factors of interest. Existing batch effect correction methods have been primarily developed for gene expression data. As such, they do not consider the inherent characteristics of microbiome data, including zero inflation, overdispersion and correlation between variables. We introduce new multivariate and non-parametric batch effect correction methods based on Partial Least Squares Discriminant Analysis (PLSDA). PLSDA-batch first estimates treatment and batch variation with latent components, then subtracts batch-associated components from the data. The resulting batch-effect-corrected data can then be input in any downstream statistical analysis. Two variants are proposed to handle unbalanced batch x treatment designs and to avoid overfitting when estimating the components via variable selection. We compare our approaches with popular methods managing batch effects, namely, removeBatchEffect, ComBat and Surrogate Variable Analysis, in simulated and three case studies using various visual and numerical assessments. We show that our three methods lead to competitive performance in removing batch variation while preserving treatment variation, especially for unbalanced batch x treatment designs. Our downstream analyses show selections of biologically relevant taxa. This work demonstrates that batch effect correction methods can improve microbiome research outputs. Reproducible code and vignettes are available on GitHub.
分类号:
- 相关文献
作者其他论文 更多>>
-
Application of machine learning to explore the genomic prediction accuracy of fall dormancy in autotetraploid alfalfa
作者:Zhang, Fan;Kang, Junmei;Long, Ruicai;Li, Mingna;He, Fei;Jiang, Xueqian;Yang, Changfu;Yang, Xijiang;Kong, Jie;Wang, Zhen;Yang, Qingchuan;Zhang, Fan;Zhang, Zhiwu;Sun, Yan;Wang, Yiwen
关键词:
-
Improving crop-livestock integration in China using numerical experiments at catchment and regional scales
作者:Chen, Lei;Wang, Yiwen;Yang, Nian;Zhu, Kaihang;Yan, Xiaoman;Shen, Zhenyao;Bai, Zhaohai;Zhai, Limei
关键词:Agriculture; Crop -livestock integration; Fertilizer; Crop yield; Nonpoint source pollution
-
Propionate poses antivirulence activity against Botrytis cinerea via regulating its metabolism, infection cushion development and overall pathogenic factors
作者:Zhu, Chuanxi;Tang, Yan;Ren, Dandan;Ren, Weiheng;Xue, Yongjun;Wang, Yiwen;Xu, Ling;Zhu, Pinkuan;Suthaparan, Aruppillai;Li, Jufen
关键词:Gray mold; Infection cushion; Methylcitrate cycle; Pathogenicity; Transcriptome
-
RPA-CRISPR/Cas12a Combined with Rolling Circle Amplification- Enriched DNAzyme: A Homogeneous Photothermal Sensing Strategy for Plant Pathogens
作者:Liu, Yanlin;Ma, Lanrui;Xie, Longyingzi;Wu, Qi;Wang, Yiwen;Zhou, Yan;Zhang, Yaohai;Jiao, Bining;He, Yue;Liu, Yanlin;Ma, Lanrui;Xie, Longyingzi;Wu, Qi;Wang, Yiwen;Zhou, Yan;Zhang, Yaohai;Jiao, Bining;He, Yue;Liu, Wenjing
关键词:recombinase polymerase amplification; CRISPR; Cas12a; rolling circle amplification; G-quadruplex; portable detection; plant pathogens
-
Ascorbic acid-mediated in situ growth of gold nanostars for photothermal immunoassay of ochratoxin A
作者:Wang, Yiwen;Xie, Longyingzi;Ma, Lanrui;Wu, Qi;Liu, Yanlin;Zhao, Qiyang;Zhang, Yaohai;Jiao, Bining;He, Yue;Wang, Yiwen;Xie, Longyingzi;Ma, Lanrui;Wu, Qi;Liu, Yanlin;Zhao, Qiyang;Zhang, Yaohai;Jiao, Bining;He, Yue;Wang, Yiwen;Ma, Lanrui;Wu, Qi;Li, Zhixia;He, Yue;Li, Zhixia
关键词:Photothermal immunoassay; Gold nanostars; Tyramine signal amplification; Portable detection; Ochratoxin A
-
Effects of Aging on Labor-Intensive Crop Production from the Perspectives of Landform and Life Cycle Labor Supply: Evidence from Chinese Apple Growers
作者:Fang, Pingping;Wang, Yiwen;Lin, Guanghua;Abler, David
关键词:aging; factor substitution; labor-intensive; landform; mechanization
-
The complete reference genome for grapevine (Vitis vinifera L.) genetics and breeding
作者:Shi, Xiaoya;Leng, Xiangpeng;Shi, Xiaoya;Cao, Shuo;Wang, Xu;Huang, Siyang;Wang, Yue;Liu, Zhongjie;Liu, Wenwen;Peng, Yanling;Wang, Nan;Wang, Yiwen;Ma, Zhiyao;Xu, Xiaodong;Zhang, Fan;Xiao, Hua;Zhou, Yongfeng;Cao, Shuo;Wang, Xu;Huang, Siyang;Wang, Yue;Zhong, Haixia;Wu, Xinyu;Xiao, Hua;Wang, Yi;Zhang, Kekun;Fang, Yuling;Velt, Amandine;Avia, Komlan;Rustenholz, Camille;Holtgraewe, Daniela;Holtgraewe, Daniela;Grimplet, Jerome;Matus, Jose Tomas;Ware, Doreen;Ware, Doreen;Wang, Haibo;Liu, Chonghuai;Cheng, Zongming;Zhou, Yongfeng
关键词: