An integrated catalog of reference genes in the human gut microbiome

J. Li, H. Jia, X. Cai, H. Zhong, Q. Feng, S. Sunagawa, M. Arumugam, J.R. Kultima, E. Prifti, T. Nielsen, A.S. Juncker, C. Manichanh, B. Chen, W. Zhang, F. Levenez, X. Xu, L. Xiao, S. Liang, D. Zhang, Z. ZhangW. Chen, H. Zhao, J.Y. Al-Aama, S. Edris, H. Yang, H. Hansen, H.B. Nielsen, S. Brunak, K. Kristiansen, F. Guarner, O. Pedersen, J. Doré, S.D. Ehrlich, P. Bork, J. Wang, W.M. de Vos, S. Tims, E.G. Zoetendal, M. Kleerebezem

Research output: Contribution to journalArticleAcademicpeer-review

885 Citations (Scopus)


Many analyses of the human gut microbiome depend on a catalog of reference genes. Existing catalogs for the human gut microbiome are based on samples from single cohorts or on reference genomes or protein sequences, which limits coverage of global microbiome diversity. Here we combined 249 newly sequenced samples of the Metagenomics of the Human Intestinal Tract (MetaHit) project with 1,018 previously sequenced samples to create a cohort from three continents that is at least threefold larger than cohorts used for previous gene catalogs. From this we established the integrated gene catalog (IGC) comprising 9,879,896 genes. The catalog includes close-to-complete sets of genes for most gut microbes, which are also of considerably higher quality than in previous catalogs. Analyses of a group of samples from Chinese and Danish individuals using the catalog revealed country-specific gut microbial signatures. This expanded catalog should facilitate quantitative characterization of metagenomic, metatranscriptomic and metaproteomic data from the gut microbiome to understand its variation across populations in human health and disease.
Original languageEnglish
Pages (from-to)834-841
JournalNature Biotechnology
Issue number8
Publication statusPublished - 2014


  • eukaryotic diversity
  • fecal microbiota
  • population-size
  • metagenome
  • sequences
  • genomes
  • tool
  • alignment
  • impact
  • twins


Dive into the research topics of 'An integrated catalog of reference genes in the human gut microbiome'. Together they form a unique fingerprint.

Cite this