TY - JOUR
T1 - MIBiG 2.0: a repository for biosynthetic gene clusters of known functions
AU - Kautsar, S.A.
AU - Blin, Kai
AU - Shaw, Simon
AU - Navarro Munoz, J.C.
AU - Terlouw, Barbara
AU - van der Hooft, J.J.J.
AU - Van Santen, Jeffrey A.
AU - Tracanna, V.
AU - Suarez Duran, Hernando
AU - Pascal Andreu, V.
AU - Selem Mojica, Nelly
AU - Alanjary, Mohammad
AU - Robinson, Serina
AU - Lund, George
AU - Epstein, Samuel C.
AU - Sisto, Ashley C.
AU - Charkoudian, Louise K.
AU - Collemare, Jérôme
AU - Linington, Roger G.
AU - Weber, Tilmann
AU - Medema, M.H.
PY - 2020/1/8
Y1 - 2020/1/8
N2 - Fueled by the explosion of (meta)genomic data, genome mining of specialized metabolites has become a major technology for drug discovery and studying microbiome ecology. In these efforts, computational tools like antiSMASH have played a central role through the analysis of Biosynthetic Gene Clusters (BGCs). Thousands of candidate BGCs from microbial genomes have been identified and stored in public databases. Interpreting the function and novelty of these predicted BGCs requires comparison with a well-documented set of BGCs of known function. The MIBiG (Minimum Information about a Biosynthetic Gene Cluster) Data Standard and Repository was established in 2015 to enable curation and storage of known BGCs. Here, we present MIBiG 2.0, which encompasses major updates to the schema, the data, and the online repository itself. Over the past five years, 851 new BGCs have been added. Additionally, we performed extensive manual data curation of all entries to improve the annotation quality of our repository. We also redesigned the data schema to ensure the compliance of future annotations. Finally, we improved the user experience by adding new features such as query searches and a statistics page, and enabled direct link-outs to chemical structure databases. The repository is accessible online at https://mibig.secondarymetabolites.org/.
AB - Fueled by the explosion of (meta)genomic data, genome mining of specialized metabolites has become a major technology for drug discovery and studying microbiome ecology. In these efforts, computational tools like antiSMASH have played a central role through the analysis of Biosynthetic Gene Clusters (BGCs). Thousands of candidate BGCs from microbial genomes have been identified and stored in public databases. Interpreting the function and novelty of these predicted BGCs requires comparison with a well-documented set of BGCs of known function. The MIBiG (Minimum Information about a Biosynthetic Gene Cluster) Data Standard and Repository was established in 2015 to enable curation and storage of known BGCs. Here, we present MIBiG 2.0, which encompasses major updates to the schema, the data, and the online repository itself. Over the past five years, 851 new BGCs have been added. Additionally, we performed extensive manual data curation of all entries to improve the annotation quality of our repository. We also redesigned the data schema to ensure the compliance of future annotations. Finally, we improved the user experience by adding new features such as query searches and a statistics page, and enabled direct link-outs to chemical structure databases. The repository is accessible online at https://mibig.secondarymetabolites.org/.
U2 - 10.1093/nar/gkz882
DO - 10.1093/nar/gkz882
M3 - Article
SN - 0305-1048
VL - 48
SP - D454-D458
JO - Nucleic acids research
JF - Nucleic acids research
IS - D1
ER -