TY - JOUR
T1 - Chemically informed analyses of metabolomics mass spectrometry data with Qemistree
AU - Tripathi, Anupriya
AU - Vázquez-Baeza, Yoshiki
AU - Gauglitz, Julia M.
AU - Wang, Mingxun
AU - Dührkop, Kai
AU - Nothias-Esposito, Mélissa
AU - Acharya, Deepa D.
AU - Ernst, Madeleine
AU - van der Hooft, Justin J.J.
AU - Zhu, Qiyun
AU - McDonald, Daniel
AU - Brejnrod, Asker D.
AU - Gonzalez, Antonio
AU - Handelsman, Jo
AU - Fleischauer, Markus
AU - Ludwig, Marcus
AU - Böcker, Sebastian
AU - Nothias, Louis Félix
AU - Knight, Rob
AU - Dorrestein, Pieter C.
PY - 2021/2
Y1 - 2021/2
N2 - Untargeted mass spectrometry is employed to detect small molecules in complex biospecimens, generating data that are difficult to interpret. We developed Qemistree, a data exploration strategy based on the hierarchical organization of molecular fingerprints predicted from fragmentation spectra. Qemistree allows mass spectrometry data to be represented in the context of sample metadata and chemical ontologies. By expressing molecular relationships as a tree, we can apply ecological tools that are designed to analyze and visualize the relatedness of DNA sequences to metabolomics data. Here we demonstrate the use of tree-guided data exploration tools to compare metabolomics samples across different experimental conditions such as chromatographic shifts. Additionally, we leverage a tree representation to visualize chemical diversity in a heterogeneous collection of samples. The Qemistree software pipeline is freely available to the microbiome and metabolomics communities in the form of a QIIME2 plugin, and a global natural products social molecular networking workflow. [Figure not available: see fulltext.]
AB - Untargeted mass spectrometry is employed to detect small molecules in complex biospecimens, generating data that are difficult to interpret. We developed Qemistree, a data exploration strategy based on the hierarchical organization of molecular fingerprints predicted from fragmentation spectra. Qemistree allows mass spectrometry data to be represented in the context of sample metadata and chemical ontologies. By expressing molecular relationships as a tree, we can apply ecological tools that are designed to analyze and visualize the relatedness of DNA sequences to metabolomics data. Here we demonstrate the use of tree-guided data exploration tools to compare metabolomics samples across different experimental conditions such as chromatographic shifts. Additionally, we leverage a tree representation to visualize chemical diversity in a heterogeneous collection of samples. The Qemistree software pipeline is freely available to the microbiome and metabolomics communities in the form of a QIIME2 plugin, and a global natural products social molecular networking workflow. [Figure not available: see fulltext.]
U2 - 10.1038/s41589-020-00677-3
DO - 10.1038/s41589-020-00677-3
M3 - Article
AN - SCOPUS:85096069035
SN - 1552-4450
VL - 17
SP - 146
EP - 151
JO - Nature Chemical Biology
JF - Nature Chemical Biology
ER -