Metagenomic clustering links specific metabolic functions to globally relevant ecosystems

Zachary Flinkstrom*, Samuel Bryson, Pieter Candry, Mari Karoliina H. Winkler

*Corresponding author for this work

Research output: Contribution to journalArticleAcademicpeer-review

Abstract

Metagenomic sequencing has advanced our understanding of biogeochemical processes by providing an unprecedented view into the microbial composition of different ecosystems. While the amount of metagenomic data has grown rapidly, simple-to-use methods to analyze and compare across studies have lagged behind. Thus, tools expressing the metabolic traits of a community are needed to broaden the utility of existing data. Gene abundance profiles are a relatively low-dimensional embedding of a metagenome’s functional potential and are, thus, tractable for comparison across many samples. Here, we compare the abundance of KEGG Ortholog Groups (KOs) from 6,539 metagenomes from the Joint Genome Institute’s Integrated Microbial Genomes and Metagenomes (JGI IMG/M) database. We find that samples cluster into terrestrial, aquatic, and anaerobic ecosystems with marker KOs reflecting adaptations to these environments. For instance, functional clusters were differentiated by the metabolism of antibiotics, photosynthesis, methanogenesis, and surprisingly GC content. Using this functional gene approach, we reveal the broad-scale patterns shaping microbial communities and demonstrate the utility of ortholog abundance profiles for representing a rapidly expanding body of metagenomic data.

Original languageEnglish
JournalmSystems
Volume9
Issue number8
DOIs
Publication statusPublished - 1 Aug 2024

Keywords

  • biogeochemistry
  • environmental microbiology
  • functional genes
  • GC content
  • metagenome clustering
  • metagenomics
  • microbial ecology

Fingerprint

Dive into the research topics of 'Metagenomic clustering links specific metabolic functions to globally relevant ecosystems'. Together they form a unique fingerprint.

Cite this