Loading…

Extending compound identification for molecular network using the LipidXplorer database independent method: A proof of concept using glycoalkaloids from Solanum pseudoquina A. St.‐Hil

Introduction Molecular networks are now established as the method of choice for tandem mass spectrometry dereplication and similarity‐based structure elucidation. Node identification can be used to start the propagation of the structure elucidation of unknown compounds progressively. Objective To de...

Full description

Saved in:
Bibliographic Details
Published in:Phytochemical analysis 2019-03, Vol.30 (2), p.132-138
Main Authors: Soares, Vitor, Taujale, Rahil, Garrett, Rafael, Silva, Antonio Jorge R., Borges, Ricardo M.
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Items that cite this one
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Introduction Molecular networks are now established as the method of choice for tandem mass spectrometry dereplication and similarity‐based structure elucidation. Node identification can be used to start the propagation of the structure elucidation of unknown compounds progressively. Objective To demonstrate the capabilities of using the LipidXplorer data results along with molecular networking to identify nodes and aid sequential structure elucidation of unknown compounds. Material and Methods Molecular fragmentation query language (MFQL) files were written to identify glycoalkaloids based on known structures described for Solanum species. A dataset generated from liquid chromatography‐high resolution mass spectrometry (LC‐HRMS) analysis of Solanum pseudoquina sample were submitted to dereplication on both LipidXplorer software and Global Natural Products Social Molecular Network (GNPS) online system. The resulting attribute table from GNPS calculations was merged with the LipidXplorer results and this merged file was used for network visualisation in Cytoscape. Nodes in the molecular network were labelled using the LipidXplorer identifiers, thus assisting the structure elucidation of unidentified compounds. Results The combination of the LipidXplorer glycoalkaloids list and GNPS analysis was used in Cytoscape to label nodes in the molecular network. The analysis of the network using these labelled starting points triggered the structure elucidation of closely related nodes leading to the identification of 30 compounds using the LipidXplorer output and four purified and structure elucidated compounds, including a new glycoalkaloids identified as 3‐O‐(β‐d‐xylopyranosyl)‐(20R,25S)‐22,26‐epimino‐16‐acetyl‐cholesta‐5,22(N)‐diene. Conclusion A significant compound identification completely based on molecular formula and fragmentation queries was achieved. This new and effective approach could help researches to expand the identification rate of compounds in dereplication studies using molecular networks. To extend the compound identification rate for dereplication studies using molecular networking, we implemented the direct use of the data generated from the database independent method developed on LipidXplorer in the Global Natural Products Social Molecular Network (GNPS) table results. Features detected using the molecular formula restrictions translated into molecular fragmentation query language (MFQL) files were merged with the attribute table from GNPS
ISSN:0958-0344
1099-1565
DOI:10.1002/pca.2798