ChemRICH for Metabolon's data
Here is the code to run the ChemRICH analysis for a data-set by Metabolon Inc. Metabolomics data originated from this paper (LINK). Formatted input files are provided here - 1) Metabolon SubPathway Input and 2) MeSH ontology Input. The ChemRICH impact plot is shown in the images below. You need to download both files and copy them into a new folder. Then run the below R scripts from R-Studio.
As we can see that both ontologies indicated "Dipeptides" being the most significant chemical cluster in this study, but MeSH complemented Metabolon's pathway ontology by highlighting several chemical classes that might improve the biological interpretation in this study. This study highlights that dipeptides are associated with tumor aggressiveness and poor prognosis. Overall, it is recommended to run ChemRICH analysis with the Metabolon's Sub-pathway ontology and with the MeSH ontology to cover both non-overlapping pathways and chemical classes.
Latest datasets by Metabolon can have up to "1750 blood compounds spanning 20 super-pathways, subdivided into 113 sub-pathways" (LINK). Check the supplementary data for this paper. Many of these metabolites (named) do not have PubChem CID or SMILES codes, so we probably need to use both the User provided Set Definition and the Chemical Class based approaches.