An in silico MS/MS library for automatic annotation of novel FAHFA lipids
© Ma et al. 2015
Received: 13 August 2015
Accepted: 3 November 2015
Published: 16 November 2015
A new lipid class named ‘fatty acid esters of hydroxyl fatty acids’ (FAHFA) was recently discovered in mammalian adipose tissue and in blood plasma and some FAHFAs were found to be associated with type 2 diabetes. To facilitate the automatic annotation of FAHFAs in biological specimens, a tandem mass spectra (MS/MS) library is needed. Due to the limitation of the commercial available standard compounds, we proposed building an in silico MS/MS library to extend the coverage of molecules.
We developed a computer-generated library with 3267 tandem mass spectra (MS/MS) for 1089 FAHFA species. FAHFA spectra were generated based on authentic standards with negative mode electrospray ionization and 10, 20, and 40 V collision induced dissociation at 4 spectra/s as used in in ultra-high performance liquid chromatography-QTOF mass spectrometry studies. However, positional information of the hydroxyl group is only obtained either at lower QTOF spectra acquisition rates of 1 spectrum/s or at the MS3 level in ion trap instruments. Therefore, an additional set of 4290 fragment-rich MS/MS spectra was created to enable distinguishing positional FAHFA isomers. The library was generated based on ion fragmentations and ion intensities of FAHFA external reference standards, developing a heuristic model for fragmentation rules and extending these rules to large swaths of computer-generated structures of FAHFAs with varying chain lengths, degrees of unsaturation and hydroxyl group positions. Subsequently, we validated the new in silico library by discovering several new FAHFA species in egg yolk, showing that this library enables high-throughput screening of FAHFA lipids in various biological matrices.
The developed library and templates are freely available for commercial or noncommercial use at http://fiehnlab.ucdavis.edu/staff/yanma/fahfa-lipid-library. This in silico MS/MS library allows users to annotate FAHFAs from accurate mass tandem mass spectra in an easy and fast manner with NIST MS Search or PepSearch software. The developing template is provided for advanced users to modify the parameters and export customized libraries according to their instrument features.
Recently, a novel lipid class named ‘fatty acid esters of hydroxyl fatty acids’ (FAHFA) was discovered in mice adipose tissues . Specifically, a FAHFA comprised of palmitic acid (16:0) esterified to a 9-hydroxyl stearic acid (9-O-18:0), abbreviated as 9-PAHSA, was discussed as promoting anti-diabetic and anti-inflammatory effects . PAHSA levels were shown to be highly correlated with insulin sensitivity in humans . However, it remained unclear if other FAHFAs might exert similar effects, and how many different FAHFAs in total might be present in mammalian tissues or biofluids.
Currently only 21 external reference standards are commercially available. In order to enable extensive profiling and automatic annotation of FAHFA species, an MS/MS library with more structural diversity is needed. Today, such mass spectral libraries can be created by applying rules of fragmentation patterns on large in silico structure list, as we have previously shown for over 200,000 mass spectra in LipidBlast  for twenty-six common lipid classes such as (lyso) phosphatidylcholines, monogalactosyldiacylglycerols or triacylglycerols. LipidBlast itself has been applied for the annotation of lipids in mouse liver , rat urine/serum  and in various algae species [5, 6], demonstrating that this strategy enables rapid annotation of many molecular species from mass spectra . LipidBlast templates use heuristic information of MS/MS fragmentation patterns to extend the range of in silico predicted mass spectra that can be used to discover species of novel lipid classes. An example has been shown for glucuronosyldiacylglycerol lipids in plants . Here we used the modified LipidBlast templates to build an in silico MS/MS library for 1089 species of the novel FAHFA lipid class and demonstrate the applicability of this new library.
Results and discussion
To expand the overall structure space of the FAHFA structures that can be annotated by mass spectrometry, we used 33 fatty acids commonly found in mammalian cells, varying from 14:0–24:6 (Additional file 2) for both the free fatty acid and the hydroxyl fatty acid moieties [9–11]. 1089 general FAHFA structures were defined and 3267 in silico spectra were modelled based on the fragmentation pattern of 9-PAHSA observed at three collision energies. Since the positional information of double-bonds and hydroxyl groups could not be reflected by the reference spectra acquired with the fast 4 spectra/s QTOF MS/MS profiling method, such detailed information was not specified in the structures of FAHFAs. Therefore, the structures in this FAHFA profiling library are general, semi-characterized structures. To characterize the position of the hydroxyl group, we built a more specific in silico library based on the fragment-rich spectra at 40 V modeled from 1 spectra/s QTOF MS/MS acquisition time. According to a patent on FAHFA lipids , the hydroxyl group may be positioned on all carbons except for the terminal carbon. Correspondingly, 4290 structures with saturated hydroxyl fatty acids and saturated or unsaturated fatty acid esters were defined, and their in silico spectra were created based on the reference spectra of 9-PAHSA acquired with 1 spectrum/s MS/MS method. Due to a lack of published spectra or commercially available standards, modeling for unsaturated hydroxyl fatty acid residues was excluded. To further verify the position of the hydroxyl group we used relative retention time information. For the elution order of three commercially available OAHSA isomer reference standards, we observed increasing retention times when the hydroxyl group was positioned closer to the carboxylic acid moiety, with 12-OAHSA eluting at 6.07 min, 9-OAHSA at 6.21 min and 5-OAHSA eluting at 6.45 min under the conditions described in the experimental section. For building a reliable retention time database or even predicting retention times for other FAHFAs, a larger range of authentic reference standards would be needed.
Information about FAHFAs found in egg yolk
We developed an in silico MS/MS library for FAHFA lipids with a total of 7557 QTOF spectra in negative ionization mode. The new library enables users to automatically annotate FAHFAs in LC–MS/MS lipidomics profiling and can be therefore applied to further studies of this novel lipid class. The batch annotation process is very easy and fast using NIST MS PepSearch. We also provided the Excel template for users to adapt this library to their own instrument features and parameters and export customized libraries. The developed library and templates are freely available for commercial or noncommercial use under creative commons-by attribution (CC-BY) license and can be downloaded from http://fiehnlab.ucdavis.edu/staff/yanma/fahfa-lipid-library. The subset of 4290 spectra with defined structures (provided as InChI codes) is also available in Massbank of North America (MoNA) at http://mona.fiehnlab.ucdavis.edu/.
Experimental measurements of standards
FAHFA standards, including 9-PAHSA [16:0-(9-O-18:0)], 5-OAHSA [18:1-(5-O-18:0)], 9-OAHSA [18:1-(9-O-18:0)], and 12-OAHSA [18:1-(12-O-18:0)], were purchased from Cayman Chemical (Ann Arbor, MI). Stock solution for each lipid standard was prepared in ethanol at 1 mg/mL, and then diluted in methanol to 50 ppm for injection. LC–MSMS acquisition was performed by an Agilent 1290 HPLC coupled to an Agilent 6530 quadrupole time of flight (QTOF) mass spectrometer. A Waters Acquity CSH C18 column (2.1 × 100 mm, 1.7 μM) was used for separation. Mobile phase A consisted of 60:40 acetonitrile:water while mobile phase B consisted of 90:10 isopropanol:acetonitrile, both with 9.2 mM ammonium acetate. Column temperature was set to 65 °C and the flow rate was 0.6 mL/min. The following gradient was applied: 0–2 min from 15 % B–30 % B, 2–2.5 min from 30 % B–48 % B, 2.5–11 min from 48 % B–82 % B, 11–11.5 min from 82–99 % B, 11.5–12 min remain 99 % B, 12–12.1 from 99 % B–15 % B, 12.1–15 min re-equilibrate at 15 % B. 3 µL of each standard as well as a mixture of the four standards were injected. MS and MS/MS data was collected in negative ionization mode, in profile and centroid mode with a scan rate of 4 spectra per second. Multiple collision energies were applied, including 10, 20 and 40 V. In addition, MS/MS data was acquired specifically for m/z 537.489 and m/z 563.504 at acquisition time of 1 spectrum per second. MS/MS spectra were exported from Agilent MassHunter software to Mascot Generic Format (MGF) format. Additional confirmation was performed by MS/MS and MS3 experiments by direct infusion of 10 ppm standard solutions in methanol into a ThermoScientific Linear Ion Trap (LTQ) mass spectrometer at 5 kV electrospray voltage and 30 ms collision activation time with an activation Q set at 0.25, isolation width of 3 Da and the normalized collision energy for collision induced dissociation was set to 20 % for the acquisition of MS3 level spectra.
In-silico development and validation
10, 20 and 40 V MS/MS spectra of 9-PAHSA were used for the development of the in silico library. Fragmentation patterns were manually investigated, and resulting m/z and abundance of all major peaks at each collision energy were included in the modified LipidBlast template. Molecular formula and accurate masses were calculated from chain lengths and degrees of unsaturation of the fatty acid and hydroxyl fatty acid residues, using the Exact Mass Calculator . To expand the structure space of the library, a series of common fatty acids found in mammalian systems were added to the template, as fatty acid moieties [9–11]. Spectra information was calculated according to the model compound. Fatty acid structures were downloaded from the lipid metabolites and pathways strategy (Lipid MAPS) database . Hydroxyl fatty acid and FAHFA structures were generated by ChemAxon Marvin 9.5.3 and JChem Reactor 9.5.3 . VBA code of LipidBlast was modified to fit the new template and export the spectra to NIST MSP format. MSP file was then converted to NIST library by Lib2NIST  software and was ready to be used with NIST MS Search or NIST MS PepSearch software . Other internal experimental MS/MS spectra as well as the external spectra from METLIN  online database were used for validation.
Application: lipidomics of egg yolk
With slight modifications, lipid extraction of egg yolk was performed according to a previously published method . Briefly, 300 µL egg yolk was added to 1200 µL citric acid buffer (100 mM sodium citrate, 1 M NaCl), followed by adding 1.5 mL of methanol and 3 mL of chloroform. The mixture was shaken by hand for 30 s, vortexed for 15 s, and centrifuged at 2200g, 4 °C for 6 min. The organic phase was dried under a gentle stream of nitrogen gas. The extracted lipids were reconstituted with 200 µL chloroform and loaded to a pre-conditioned SPE cartridge (500 mg silica, 6 mL, Thermo Scientific). Neutral lipids were eluted with 15 mL 5 % ethyl acetate in hexane followed by the elution of FAHFAs with 15 mL ethyl acetate. The FAHFA lipid fraction was dried under nitrogen gas and stored at −80 °C prior to LC/MS analysis. On the day of the experiment, the lipid extract was re-suspended with 30 µL of methanol and 5 µL was injected. The LC-MSMS method was similar to the method for the reference standard measurements, with a scan rate of 4 spectra per second and collision energy of 20 V. NIST MS PepSearch was used for the MS/MS search against the in silico FAHFA library.
YM, TK, OF designed the experiment. YM developed and validated the in silico library. YM, AV, IG and JFF performed experimental measurements. YM, TK and OF wrote the manuscript. All authors read and approved the final manuscript.
This study was funded by National Science Foundation grant MCB 1139644 and National Institutes of Health Grant U24 DK097154 and 1S10RR031630.
The authors declare that they have no competing interests.
Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
- Yore MM, Syed I, Moraes-Vieira PM, Zhang T, Herman MA, Homan EA, Patel RT, Lee J, Chen S, Peroni OD (2014) Discovery of a class of endogenous mammalian lipids with anti-diabetic and anti-inflammatory effects. Cell 159(2):318–332View ArticleGoogle Scholar
- Kind T, Liu KH, Lee do Y, DeFelice B, Meissen JK, Fiehn O (2013) Lipidblast in silico tandem mass spectrometry database for lipid identification. Nat Methods 10(8):755–758View ArticleGoogle Scholar
- Park HM, Shon JC, Lee MY, Liu K-H, Kim JK, Lee SJ, Lee CH (2014) Mass spectrometry-based metabolite profiling in the mouse liver following exposure to ultraviolet B radiation. PLoS One 9(10):e109479View ArticleGoogle Scholar
- Kim HY, Lee MY, Park HM, Park YK, Shon JC, Liu KH, Lee CH (2015) Urine and serum metabolite profiling of rats fed a high-fat diet and the anti-obesity effects of caffeine consumption. Molecules 20(2):3107–3128View ArticleGoogle Scholar
- Ogawa T, Furuhashi T, Okazawa A, Nakai R, Nakazawa M, Kind T, Fiehn O, Kanaya S, Arita M, Ohta D (2014) Exploration of polar lipid accumulation profiles in Euglena gracilis using LipidBlast, an MS/MS spectral library constructed in silico. Biosci Biotechnol Biochem 78(1):14–18View ArticleGoogle Scholar
- Kind T, Meissen JK, Yang D, Nocito F, Vaniya A, Cheng YS, VanderGheynst JS, Fiehn O (2012) Qualitative analysis of algal secretions with multiple mass spectrometric platforms. J Chromatogr A 1244:139–147View ArticleGoogle Scholar
- Tsugawa H, Cajka T, Kind T, Ma Y, Higgins B, Ikeda K, Kanazawa M, VanderGheynst J, Fiehn O, Arita M (2015) MS-DIAL: data-independent MS/MS deconvolution for comprehensive metabolome analysis. Nat Methods 12(6):523–526View ArticleGoogle Scholar
- Kind T, Okazaki Y, Saito K, Fiehn O (2014) LipidBlast templates as flexible tools for creating new in-silico tandem mass spectral libraries. Anal Chem 86(22):11024–11027View ArticleGoogle Scholar
- McEvoy T, Coull G, Broadbent P, Hutchinson J, Speake B (2000) Fatty acid composition of lipids in immature cattle, pig and sheep oocytes with intact zona pellucida. J Reprod Fertil 118(1):163–170View ArticleGoogle Scholar
- Käkelä R, Hyvärinen H (1996) Site-specific fatty acid composition in adipose tissues of several northern aquatic and terrestrial mammals. Comp Biochem Physiol B: Biochem Mol Biol 115(4):501–514View ArticleGoogle Scholar
- Svennerholm L (1968) Distribution and fatty acid composition of phosphoglycerides in normal human brain. J Lipid Res 9(5):570–579Google Scholar
- Kahn B, Herman M, Saghatelian A, Homan E Lipids that increase insulin sensitivity and methods of using the same. Google Patents, 2013Google Scholar
- Smith CA, O’Maille G, Want EJ, Qin C, Trauger SA, Brandon TR, Custodio DE, Abagyan R, Siuzdak G (2005) METLIN: a metabolite mass spectral database. Ther Drug Monit 27(6):747–751View ArticleGoogle Scholar
- Exact Mass Calculator. http://www.sisweb.com/referenc/tools/exactmass.htm. Accessed 27 Feb 2015
- Fahy E, Sud M, Cotter D, Subramaniam S (2007) LIPID MAPS online tools for lipid research. Nucleic Acids Res 35(suppl 2):W606–W612View ArticleGoogle Scholar
- ChemAxon. http://www.chemaxon.com. Accessed 9 Mar 2015
- Lib2NIST. http://chemdata.nist.gov/. Accessed 3 Apr 2015
- Stein SE, Scott DR (1994) Optimization and testing of mass spectral library search algorithms for compound identification. J Am Soc Mass Spectrom 5(9):859–866View ArticleGoogle Scholar