File Formats
Input: MGF
msfiddle accepts tandem mass spectra in .mgf format. Four fields are
required per spectrum; all others are ignored.
Field |
Description |
|---|---|
|
Unique spectrum identifier, propagated to the |
|
Observed precursor m/z. |
|
Adduct type (e.g. |
|
Collision energy in eV. |
Example:
BEGIN IONS
TITLE=EMBL_MCF_2_0_HRMS_Library000529
PEPMASS=111.02016
CHARGE=1-
PRECURSOR_TYPE=[M-H]-
PRECURSOR_MZ=111.02016
COLLISION_ENERGY=50.0
41.0148 0.329893
68.0258 0.402906
111.0203 100.0
END IONS
Supported precursor types:
[M+H]+, [M+2H]2+, [M+Na]+, [M-H]-, [M+H-H2O]+,
[M-H2O+H]+, [2M+H]+, [2M-H]-, [M+H-2H2O]+, [M+H-NH3]+,
[M+H+NH3]+, [M+NH4]+, [M+H-CH2O2]+, [M+H-CH4O2]+,
[M-H-CO2]-, [M-CHO2]-, [M-H-H2O]-
Output: msfiddle CSV
One row is produced per input spectrum.
Column |
Description |
|---|---|
|
Spectrum identifier from the MGF |
|
Raw TCN output: semicolon-separated atom-count vector (C, H, O, N, F, S, Cl, P, B, Br, I, Na, K). |
|
Neutral monoisotopic mass derived from |
|
Top formula from the TCN model prior to refinement. |
|
Monoisotopic mass of |
|
Total atom count predicted by the model. |
|
H/C ratio predicted by the model. |
|
Wall time per spectrum in seconds (prediction + refinement). |
|
The k-th best refined formula (0-indexed), ranked by rescore score.
|
|
Monoisotopic mass of |
|
Rescore model confidence score for |
The number of ranked columns is set by top_k in the configuration file
(default: 5).
Input: BUDDY CSV (optional)
Required columns when using --buddy_path:
Column |
Description |
|---|---|
|
Spectrum identifier matching the MGF |
|
Precursor type string. |
|
Top-5 candidate formulas from BUDDY. |
|
Confidence scores; candidates below the configured threshold are excluded. |
Input: SIRIUS CSV (optional)
Required columns when using --sirius_path:
Column |
Description |
|---|---|
|
Spectrum identifier matching the MGF |
|
Top-5 candidate formulas from SIRIUS. |
|
Predicted adduct for each candidate. |
|
Log-likelihood scores; candidates below the configured threshold are excluded. |