1. Overview
Siriraj Metabolomics Data Warehouse for Ion Mobility-Mass Spectrometry (SiMD) is a four-dimensional (4D) library that encompasses mass-to-charge ratio (m/z), retention time (RT), fragment ions (MS/MS), and collision cross-section (CCS) values, all derived from reference standards characterized by ion mobility-mass spectrometry. The SiMD library contains both human metabolites and specialized metabolites from natural products. This advanced mass spectral database provides scoring algorithms for different matching parameters to facilitate various levels of metabolite identification for both traveling wave ion mobility-mass spectrometry (TWIMS) and cyclic ion mobility-mass spectrometry (cIMS).
2. Search
SiMD supports metabolite identification for both traveling wave ion mobility-mass spectrometry (TWIMS) and cyclic ion mobility-mass spectrometry (cIMS). It allows both single and batch (CSV) searches. For each platform, SiMD offers 11 search schemes (see Tables 1 and 2).
Table 1. Search schemes for TWIMS
No. | Search schemes for TWIMS | Description |
---|---|---|
1 | Name | Name searching e.g., Butyric acid |
2 | Xref | Cross reference searching e.g., HMDB0000039 |
3 | m/z | m/z searching |
4 | CCS | CCS searching |
5 | m/z + MS/MS | m/z and MS/MS searching |
6 | m/z + CCS | m/z and CCS searching |
7 | m/z + RT | m/z and RT searching |
8 | m/z + CCS + RT | m/z, CCS and RT searching |
9 | m/z + CCS + MS/MS | m/z, CCS and MS/MS searching |
10 | m/z + RT + MS/MS | m/z, RT and MS/MS searching |
11 | m/z + CCS +RT + MS/MS | m/z, CCS, RT and MS/MS searching |
Table 2. Search schemes for cIMS
No. | Search schemes for cIMS | Description |
---|---|---|
1 | Name | Name searching e.g., Butyric acid |
2 | Xref | Cross reference searching e.g., HMDB0000039 |
3 | m/z | m/z searching |
4 | CCS | CCS searching |
5 | m/z + MS/MS | m/z and MS/MS searching |
6 | m/z + CCS | m/z and CCS searching |
7 | m/z + tp | m/z and tp searching |
8 | m/z + tn | m/z and tn searching |
9 | m/z + CCS + MS/MS | m/z, CCS and MS/MS searching |
10 | m/z + tp + MS/MS | m/z, tp and MS/MS searching |
11 | m/z + tn + MS/MS | m/z, tn and MS/MS searching |
3. Search Criteria
Users can define the adduct (e.g., [M+H]+ and [M-H]-) as search criteria. They can also set tolerance for m/z, CCS, RT, MS/MS, tp and tn searching. The following equations are used for setting tolerance for each parameter.
(1)Δm/z = Parts per million (ppm) measures the approximation error between the experimental mass and the reference mass
(2)ΔCCS = Deviation of CCS (%)
(3)ΔRT = Deviation of RT (min)
(4)Δtₚ = Deviation of tₚ (%)
(5)Δtₙ = Deviation of tₙ (%)
4. Similarity Score
The following equations are used to calculate a scaled similarity score for m/z, CCS, RT, tp, tn. The similarity scores range between 0 and 100, with 100 signifying a perfect match. A scaling value (X) for each parameter can be specified by the user. The default values of Xmz = 2, XCCS = 10, XRT = 200, Xtp = 20, Xtn = 20 will result in a score of 0, if Δm/z = 50, ΔCCS = 10, Δtp = 5, Δtn = 5, respectively. A lower scaling value will result in a higher score. This helps determine the significance when searching using more than one parameter.
(6)m/z similarity score
mzsim = 100 - (Xmz × Δm/z) ; Xmz = 2
(7)CCS similarity score
CCSsim = 100 - (XCCS × ΔCCS) ; XCCS = 10
(8)RT similarity score
RTsim = 100 - (XRT × ΔRT) ; XRT = 200
(9)tp similarity score
(10)tn similarity score
(11)Average similarity score when searching more than one parameter
(12)MS/MS similarity score when peak intensity or peak height is not considered
(13)MS/MS similarity score when peak intensity or peak height is considered
SiMD uses a weighting function (W) where the m/z value is raised to the power of m, and the peak intensity is raised to the power of n. The values of m and n can be specified by the user to determine the significance of m/z and peak intensity.