RAMclust/RAMsearch: efficient post-XCMS feature clustering and annotation of MS-based metabolomics datasets
Poster Dec 22, 2016
Corey D. Broeckling and Jessica E. Prenni
Introduction: Chromatographically coupled mass spectrometry is a powerful tool for profiling, semi-quantitatively or quantitatively, a breadth of small molecules with sensitivity and selectivity. The complexity of these datasets has driven the development of informatics approaches for feature finding, retention time alignment, feature grouping, and annotation. However, the complexity of signals derived from a single compound is generally underestimated, resulting in poor spectral reproducibility, misannotation, and misinterpretation of individual mass signals. This limitation has driven us to develop informatics tools to improve the quality of post-XCMS data processing.
Methods: RAMclustR is developed in R and is freely available. It is designed with memory constraints in mind, and operates on the scale of minutes, but can take an hour when peak shape similarity scoring is also used. The output is initially an R object containing a dataset of reduced dimensionality as compared to the input XCMS set, as well as spectra which are written to .msp format. These spectra can include MSE (indiscriminant MS/MS) spectra when available. This msp format is taken as input for RAMseach, a .NET-based GUI for performing batch spectral searching against NIST formatted spectral libraries. The results can be output in a format which can be reimported back into the ramclustR.
Preliminary Results: RAMclustR feature similarity scores are calculated for all feature pairs in the input XCMS R object, where feature similarity is the product of individual similarities in correlation in intensity across the dataset, feature retention time, and peak shape. The contribution of each score is tunable using sigmoid functions, enabling the evaluation of results and adjustment, when necessary. The output datasets demonstrate improved injection reproducibility as compared to individual features, reduce false discovery error rate burden, and improve annotation quality. Annotation efficiency is dramatically improved by utilizing the output spectra from RAMclustR as input for spectral searching using RAMsearch, a novel GUI for batch searching and manual validation of search results. The output from RAMsearch is imported into RAMclustR, enabling the storing, visualization, and sharing of the evidence for a given annotation. These output are suitable as supplementary material upon publication of the dataset, to ensure transparency in the annotation process. This workflow reduces annotation time several fold by automating routine manual tasks. Further, it is designed to streamline the efforts that go into reporting annotation confidence, which will enable more robust, transparent, and accessible reporting of metabolomics data.
Using Elemental Analysis For Discrimination Of Pinot Noir Wines From Six Different Districts In An AvaPoster
The determination of geographical origin of wine is gaining increased interest by researchers and federal agencies around the world, partially due to increased fraud with regards to place of origin labelling. For wine, multi-elemental profiling of macro, micro, and trace elements has been proposed for determination of authenticity. Commercial wines from different wineries in 5 different neighborhoods within one AVA show characteristic elemental fingerprints. Macro, micro and trace elements as well as elemental ratios contribute to the observed separation, indicating the involvement of multiple factors and underlying mechanisms, including location and soil composition, elemental uptake by vine and rootstock, viticulture and nutrient management, water sources, and small differences in the different wineries.READ MORE
Fast arsenic speciation analysis of wines and rice with LC-ICP-QQQPoster
This method was designed in response to recent and proposed food standards, both international and national, that limit inorganic arsenic rather than total, organic, or individual arsenic species such as arsenite (AsIII) and arsenate (AsV). Analysis time is 10x faster than the current FDA regulatory method, increasing sample throughput, avoided spectral interferences and dramatically increased sensitivity. Validation data from two laboratories demonstrate the method’s accuracy and reproducibility of both wine and rice matrices in a single analytical batch.READ MORE
Exploiting Polypharmacology in Precision Oncology: Identification of Differential Kinase Off-targets Among Clinical PARP InhibitorsPoster
Can we use computational methods to identify previously unknown off-targets of PARP inhibitors that can explain their observed differences?READ MORE