An expectation-maximization algorithm enables accurate ecological modeling using longitudinal microbiome sequencing data

Page view(s)
25
Checked on Apr 16, 2024
An expectation-maximization algorithm enables accurate ecological modeling using longitudinal microbiome sequencing data
Title:
An expectation-maximization algorithm enables accurate ecological modeling using longitudinal microbiome sequencing data
Journal Title:
Microbiome
Keywords:
Publication Date:
22 August 2019
Citation:
Li, C., Chng, K.R., Kwah, J.S. et al. An expectation-maximization algorithm enables accurate ecological modeling using longitudinal microbiome sequencing data. Microbiome 7, 118 (2019). https://doi.org/10.1186/s40168-019-0729-z
Abstract:
Background: The dynamics of microbial communities is driven by a range of interactions from symbiosis to predator-prey relationships, the majority of which are poorly understood. With the increasing availability of high-throughput microbiome taxonomic profiling data, it is now conceivable to directly learn the ecological models that explicitly define microbial interactions and explain community dynamics. The applicability of these approaches is severely limited by the lack of accurate absolute cell density measurements (biomass). Methods: We present a new computational approach that resolves this key limitation in the inference of generalized Lotka-Volterra models (gLVMs) by coupling biomass estimation and model inference with an expectation-maximization algorithm (BEEM). Results: BEEM outperforms the state-of-the-art methods for inferring gLVMs, while simultaneously eliminating the need for additional experimental biomass data as input. BEEM’s application to previously inaccessible public datasets (due to the lack of biomass data) allowed us to construct ecological models of microbial communities in the human gut on a per-individual basis, revealing personalized dynamics and keystone species. Conclusions: BEEM addresses a key bottleneck in “systems analysis” of microbiomes by enabling accurate inference of ecological models from high throughput sequencing data without the need for experimental biomass measurements.
License type:
http://creativecommons.org/licenses/by/4.0/
Funding Info:
This work was supported by funding to the Genome Institute of Singapore from the Agency for Science, Technology and Research (A*STAR), Singapore.
Description:
ISSN:
2049-2618
Files uploaded:

File Size Format Action
s40168-019-0729-z.pdf 2.51 MB PDF Open