Introduction: In systems biology, where a main goal is acquiring knowledge of biological systems, one of the challenges is inferring biochemical interactions from different molecular entities such as metabolites. In this area, the metabolome possesses a unique place for reflecting “true exposure” by being sensitive to variation coming from genetics, time, and environmental stimuli. While influenced by many different reactions, often the research interest needs to be focused on variation coming from a certain source, i.e. a certain covariable Xm. Objective: Here, we use network analysis methods to recover a set of metabolite relationships, by finding metabolites sharing a similar relation to Xm. Metabolite values are based on information coming from individuals’ Xm status which might interact with other covariables. Methods: Alternative to using the original metabolite values, the total information is decomposed by utilizing a linear regression model and the part relevant to Xm is further used. For two datasets, two different network estimation methods are considered. The first is weighted gene co-expression network analysis based on correlation coefficients. The second method is graphical LASSO based on partial correlations. Results: We observed that when using the parts related to the specific covariable of interest, resulting estimated networks display higher interconnectedness. Additionally, several groups of biologically associated metabolites (very large density lipoproteins, lipoproteins, etc.) were identified in the human data example. Conclusions: This work demonstrates how information on the study design can be incorporated to estimate metabolite networks. As a result, sets of interconnected metabolites can be clustered together with respect to their relation to a covariable of interest.
- Incorporating relevant information
- Network reconstruction
- Study design