1. Overview

Siriraj Metabolomics Data Warehouse for Ion Mobility-Mass Spectrometry (SiMD) is a four-dimensional (4D) library that encompasses mass-to-charge ratio (m/z), retention time (RT), fragment ions (MS/MS), and collision cross-section (CCS) values, all derived from reference standards characterized by ion mobility-mass spectrometry. The SiMD library contains both human metabolites and specialized metabolites from natural products. This advanced mass spectral database provides scoring algorithms for different matching parameters to facilitate various levels of metabolite identification for both traveling wave ion mobility-mass spectrometry (TWIMS) and cyclic ion mobility-mass spectrometry (cIMS).

3. Search Criteria

Users can define the adduct (e.g., [M+H]+ and [M-H]-) as search criteria. They can also set tolerance for m/z, CCS, RT, MS/MS, tp and tn searching. The following equations are used for setting tolerance for each parameter.

(1)Δm/z = Parts per million (ppm) measures the approximation error between the experimental mass and the reference mass

Δm/z = | mzinput mzref | mzref × 10 6

(2)ΔCCS = Deviation of CCS (%)

ΔCCS = | CCSinput CCSref | CCSref × 100

(3)ΔRT = Deviation of RT (min)

ΔRT = | RTinput RTref |

(4)Δtₚ = Deviation of tₚ (%)

Δtp = | tp(input) tp(ref) | tp(ref) × 100

(5)Δtₙ = Deviation of tₙ (%)

Δtn = | tn(input) tn(ref) | tn(ref) × 100

4. Similarity Score

The following equations are used to calculate a scaled similarity score for m/z, CCS, RT, tp, tn. The similarity scores range between 0 and 100, with 100 signifying a perfect match. A scaling value (X) for each parameter can be specified by the user. The default values of Xmz = 2, XCCS = 10, XRT = 200, Xtp = 20, Xtn = 20 will result in a score of 0, if Δm/z = 50, ΔCCS = 10, Δtp = 5, Δtn = 5, respectively. A lower scaling value will result in a higher score. This helps determine the significance when searching using more than one parameter.

(6)m/z similarity score

mzsim = 100 - (Xmz × Δm/z) ; Xmz = 2

(7)CCS similarity score

CCSsim = 100 - (XCCS × ΔCCS) ; XCCS = 10

(8)RT similarity score

RTsim = 100 - (XRT × ΔRT) ; XRT = 200

(9)tp similarity score

tpsim = 100 ( Xtp × Δ tp ) ; Xtp = 20

(10)tn similarity score

tnsim = 100 ( Xtn × Δ tn ) ; Xtn = 20

(11)Average similarity score when searching more than one parameter

Average Similarity Score = ( similarity scores ) N ; N number of search parameters

(12)MS/MS similarity score when peak intensity or peak height is not considered

MS / MS  Similarity Score1 = number of matched mz number of mz ref × 100

(13)MS/MS similarity score when peak intensity or peak height is considered

SiMD uses a weighting function (W) where the m/z value is raised to the power of m, and the peak intensity is raised to the power of n. The values of m and n can be specified by the user to determine the significance of m/z and peak intensity.

W = mz m × Intensity n ;   m = 1  and  n = 0.6
MS / MS  Similarity Score2 = ( Winput × Wref ) ( Winput2 ) × ( Wref2 )