TY - JOUR
T1 - Reproducible molecular networking of untargeted mass spectrometry data using GNPS
AU - Aron, Allegra T.
AU - Gentry, Emily C.
AU - McPhail, Kerry L.
AU - Nothias, Louis Félix
AU - Nothias-Esposito, Mélissa
AU - Bouslimani, Amina
AU - Petras, Daniel
AU - Gauglitz, Julia M.
AU - Sikora, Nicole
AU - Vargas, Fernando
AU - van der Hooft, Justin J.J.
AU - Ernst, Madeleine
AU - Kang, Kyo Bin
AU - Aceves, Christine M.
AU - Caraballo-Rodríguez, Andrés Mauricio
AU - Koester, Irina
AU - Weldon, Kelly C.
AU - Bertrand, Samuel
AU - Roullier, Catherine
AU - Sun, Kunyang
AU - Tehan, Richard M.
AU - Boya P, Cristopher A.
AU - Christian, Martin H.
AU - Gutiérrez, Marcelino
AU - Ulloa, Aldo Moreno
AU - Tejeda Mora, Javier Andres
AU - Mojica-Flores, Randy
AU - Lakey-Beitia, Johant
AU - Vásquez-Chaves, Victor
AU - Zhang, Yilue
AU - Calderón, Angela I.
AU - Tayler, Nicole
AU - Keyzers, Robert A.
AU - Tugizimana, Fidele
AU - Ndlovu, Nombuso
AU - Aksenov, Alexander A.
AU - Jarmusch, Alan K.
AU - Schmid, Robin
AU - Truman, Andrew W.
AU - Bandeira, Nuno
AU - Wang, Mingxun
AU - Dorrestein, Pieter C.
PY - 2020/6
Y1 - 2020/6
N2 - Global Natural Product Social Molecular Networking (GNPS) is an interactive online small molecule–focused tandem mass spectrometry (MS2) data curation and analysis infrastructure. It is intended to provide as much chemical insight as possible into an untargeted MS2 dataset and to connect this chemical insight to the user’s underlying biological questions. This can be performed within one liquid chromatography (LC)-MS2 experiment or at the repository scale. GNPS-MassIVE is a public data repository for untargeted MS2 data with sample information (metadata) and annotated MS2 spectra. These publicly accessible data can be annotated and updated with the GNPS infrastructure keeping a continuous record of all changes. This knowledge is disseminated across all public data; it is a living dataset. Molecular networking—one of the main analysis tools used within the GNPS platform—creates a structured data table that reflects the molecular diversity captured in tandem mass spectrometry experiments by computing the relationships of the MS2 spectra as spectral similarity. This protocol provides step-by-step instructions for creating reproducible, high-quality molecular networks. For training purposes, the reader is led through a 90- to 120-min procedure that starts by recalling an example public dataset and its sample information and proceeds to creating and interpreting a molecular network. Each data analysis job can be shared or cloned to disseminate the knowledge gained, thus propagating information that can lead to the discovery of molecules, metabolic pathways, and ecosystem/community interactions.
AB - Global Natural Product Social Molecular Networking (GNPS) is an interactive online small molecule–focused tandem mass spectrometry (MS2) data curation and analysis infrastructure. It is intended to provide as much chemical insight as possible into an untargeted MS2 dataset and to connect this chemical insight to the user’s underlying biological questions. This can be performed within one liquid chromatography (LC)-MS2 experiment or at the repository scale. GNPS-MassIVE is a public data repository for untargeted MS2 data with sample information (metadata) and annotated MS2 spectra. These publicly accessible data can be annotated and updated with the GNPS infrastructure keeping a continuous record of all changes. This knowledge is disseminated across all public data; it is a living dataset. Molecular networking—one of the main analysis tools used within the GNPS platform—creates a structured data table that reflects the molecular diversity captured in tandem mass spectrometry experiments by computing the relationships of the MS2 spectra as spectral similarity. This protocol provides step-by-step instructions for creating reproducible, high-quality molecular networks. For training purposes, the reader is led through a 90- to 120-min procedure that starts by recalling an example public dataset and its sample information and proceeds to creating and interpreting a molecular network. Each data analysis job can be shared or cloned to disseminate the knowledge gained, thus propagating information that can lead to the discovery of molecules, metabolic pathways, and ecosystem/community interactions.
U2 - 10.1038/s41596-020-0317-5
DO - 10.1038/s41596-020-0317-5
M3 - Article
AN - SCOPUS:85084452838
SN - 1754-2189
VL - 15
SP - 1954
EP - 1991
JO - Nature protocols
JF - Nature protocols
ER -