Learning the mechanism of collective microbial function via random community-media pairing
Abstract
In microbial systems, many biochemical functions arise from pathways encoded and executed at the community level. The collective nature of these functions complicates bottom-up efforts to determine each species’ contribution. Previous work has shown that regression over randomly sampled datasets of collective functions succeeds at predicting those functions. Building on this top-down idea, this paper asks whether regression can also reveal mechanistic insight into a cross-feeding relationship. For this, we propose extending the random sampling method to vary the growth environment, cultivating each community in the spent medium of another randomly constructed community. With a model-based analysis, we show that the new protocol extracts more mechanistic information, enabling assignment of species to the correct cross-feeding pathway steps and identification of species essential to the collective function, both achieved with simple LASSO regressions. More generally, our work illustrates that the utility of machine learning-based approaches can be greatly enhanced by a synergistic experimental design.
Related articles
Related articles are currently not available for this article.