Advancement: Bayesian Modeling of Interaction between Features in Sparse Multivariate Count Data with Application to Microbiome Study

Shuangjie Zhang
Statistical Science PhD Student
Location
Virtual Event
Advisor
Juhee Lee

Join us on Zoom: https://ucsc.zoom.us/j/99362800152?pwd=a0cxdUhFVG1nc09vdGZ2UENyRmE4UT09 / Passcode: 627807

Description: Many statistical methods have been developed for the analysis of microbial community profiles, but due to the complexity of typical microbiome measurements, inference of interactions between microbial features remains challenging. We develop a Bayesian zero-inflated rounded log-normal kernel method to model interaction between microbial features in a community using multivariate count data, in the presence of covariates and excess zeros. The model carefully constructs the interaction structure by imposing joint sparsity on the covariance matrix of the kernel, and obtains a reliable estimate of the structure with a small sample size.  The model also includes zero inflation to account for excess zeros observed in data and infers differential abundance of microbial features associated with covariates through log-linear regression. We provide simulation studies and real data analysis examples to demonstrate the developed model.  Comparison of the model to a simpler model and popular alternatives in simulation studies shows that in addition to an added and important insight on the feature interaction, it yields superior parameter estimates and model fit in various settings.