[PREPRINT] Metagenomic global survey and in-depth genomic analyses of Ruminococcus gnavus reveal differences across host lifestyle and health status

Nooij S, Plomp N, Sanders IMJG, Schout L, van der Meulen AE, Terveer EM, Norman JM, Karcher N, Larralde MF, Vossen RHAM, Kloet SL, Faber KN, Harmsen HJM, Zeller GF, Kuijper EJ, Smits WK, Ducarmon QR, biorXiv (2024).

Abstract

Ruminococcus gnavus is a highly prevalent gut bacterium (present in >90% of healthy individuals), of which increased abundance is associated with chronic inflammatory diseases, most notably Crohns disease. Nevertheless, its global distribution has not been investigated and little is known about intraspecies genomic variation. Through a large-scale survey of 12,791 gut metagenomes, we recapitulated known associations with metabolic diseases and inflammatory bowel disease. We uncover a higher prevalence and abundance of R. gnavus in Westernized populations and observe relative abundances of up to 83% in newborns and infants. Next, we built a collection of existing and newly cultured R. gnavus isolates (N = 45) from both healthy individuals and Crohns disease patients and subjected these to PacBio circular consensus sequencing to greatly expand the number of complete R. gnavus genomes. Analysis of these genomes as well as publicly available high quality draft genomes (total > 300 genomes) revealed multiple clades which separated Crohns-derived isolates from healthy-derived isolates. Functional analyses of genes predicted to constitute R. gnavus virulence factors could not explain this separation. Bacterial GWAS revealed that Crohns-derived isolates were enriched in genes related to mobile elements and putative mucin foraging. Together, we present one of the largest complete genome collections of any commensal gut microbe and provide novel biological insights into the global distribution and genomic variation of R. gnavus.