mtbinfo.github.io

msinfo manuals

View the Project on GitHub systemsomicslab/mtbinfo.github.io

MRMPROBS tutorial

Edited in 2016/11/16

Introduction

MRMPROBS is launched as a universal program for targeted metabolomics using not only multiple reaction monitoring (MRM)- or selected reaction monitoring (SRM) but also SCAN and data independent MS/MS acquisition (DIA) data. Originally, the previous MRMPROBS program was developed to deal with large scale MRM assay’s data sets monitoring 500-1000 small molecules in a single run simultaneously. The program provided 1) a user-friendly graphical user interface (GUI) for data curation and 2) an objective evaluation system of small molecule identifications. Here, it was expanded for DIA-MS data (like SWATH-MS) and for SCAN data (like GC/MS and LC/MS).
    All data-processing workflow from data import to statistical analysis is supported. This tutorial will introduce the workflow for 1) MRM data, 2) SWATH-MS (DIA) data, and 3) GC/MS data for targeted metabolomics. In this MRMPROBS project, your feedback would be appreciated to improve the identification and quantification systems as well as the user interface.

Hiroshi Tsugawa
RIKEN Center for Sustainable Resource Science
hiroshi.tsugawa@riken.jp


alt
MRMPROBS screenshot

MRMPROBS tutorial

Table of contents

Section 1: Software environments
Section 2: Required software programs and files
Section 3: Project type and condition
Section 4: ABF file conversion
Section 4-1: Downloading the ABF converter
Section 4-2: Check the conditions for file conversion
Section 4-3: File conversion
Section 5: Reference file format
Section 5-1: Reference library for Project type 1: MRMPROBS key index = metabolite name (abf)
Section 5-2: Reference library for Project type 2: MRMPROBS key index = Function (mzML)
Section 5-3: Reference library for Project type 3: MRMPROBS key index = SCAN or DIA-MS (abf)
  Section 5-3-1: Reference format for DIA-MS data
  Section 5-3-2: Dictionary file for DIA-MS data processing
  Section 5-3-3: Reference format for GC/MS and LC/MS data
Section 6: Starting MRMPROBS
Section 6-1: Summary for MRM demonstration data sets
Section 6-2: Starting up your project
Section 6-3: Importing Abf files
Section 6-4: Parameter
Section 7: MRMPROBS viewer
Section 7-1: Mouse operation in the chromatogram viewer
Section 7-2: Library editor (optional)
Section 7-3: Tool button
Section 7-4: Tab
Section 7-5: Button
Section 7-6: List Box
Section 7-7: Details on the MRMPROBS function
  Section 7-7-1: File menu
  Section 7-7-2: Data reprocessing
  Section 7-7-3: Statistical analysis
  Section 7-7-4: Missing value methods
  Section 7-7-5: Normalization
  Section 7-7-6: Window menu
  Section 7-7-7: View menu
  Section 7-7-8: Option menu
  Section 7-7-9: Export menu
Appendix A: How to obtain appropriate file conversion of the Shimadzu .lcd file
Appendix B: Third option of MRMPROBS: via mzML file


Section 1

Software environments

Section 2

Required software programs and files

MRMPROBS can import Analysis Base Framework (ABF) format data. MRMPROBS extracts chromatogram data together with the reference library including the name of the target metabolite, its retention-time and amplitude information, and precursor m/z and product m/z. The supported formats for ABF conversion are Shimadzu Inc. (.LCD), Agilent Technologies (.D), AB Sciex (.WIFF), Waters (.RAW), and Thermo Fisher Scientific (.RAW). MRMPROBS is also acceptable to a common data format mzML converted by an open source file translator ProteoWizard. The information is described in Appendix B.

Section 3

Project type and condition

alt

1. MRMPROBS key index = metabolite name (abf)

2. MRMPROBS key index = Function (mzML)

* The above two projects are for MRM data sets.

3. MRMPROBS key index = SCAN or DIA-MS (abf)

4. MRM-DIFF (abf, mzML)
See http://prime.psc.riken.jp/Metabolomics_Software/MRMPROBS/index.html.

Section 4

ABF file conversion

Section 4-1

Downloading the ABF converter

  1. Go to http://www.reifycs.com/AbfConverter/index.html.
  2. Check the requirements and license terms, and download the converter.

* File converter is freely available.

alt

Section 4-2

Check the conditions for file conversion

To convert files of some MS vendors including Bruker, LECO, Shimadzu, Thermo, and Waters, the specific data access library needs to be installed on your PC

Also see FAQ for ABF converter
http://prime.psc.riken.jp/Metabolomics_Software/MS-DIAL/index3.html

Summary of PC condition required for file conversion

Vendo Formats Required
Agilent .D None, but the files from Chemstation should be converted to netCDF
Bruker .D CompassXtract
LECO .PEG All PEG files should be first converted to netCDF (AIA).
Sciex .WIFF None
Shimadzu for GC/MS .QGD GCMS solution
Shimadzu for LC/MS .LCD LCMS solutions
Thermo .RAW MSFileReader
Waters .RAW MassLynx Raw Data Reader Interface Library
netCDF .CDF Microsoft Visual J# 2.0

FAQ


Section 4-3

File conversion

  1. Start “AnalysisBaseFileConverter.exe”.
  2. Drag & drop MS vendor files into this program.
  3. Click “Convert”.
  4. The ABF files are generated in the same directory as the raw data files.

alt

Section 5

Reference file format

Section 5-1

Reference library for Project type 1: MRMPROBS key index = metabolite name (abf)

Five items are required as tab-delimited format. The header names are flexible, but the item order should be kept.

alt

1 column.    Compound namea
2 column.    Precursor m/z (accurate m/z information is rounded into nominal m/z information)
3 column.    Product m/z
4 column.    Retention time [min]
5 column.    Amplitude ratios [%]b

Notes
a When you choose project 1 of MRMPROBS, the name must be identical to the compound name in the instrument setting window. The compound name MUST be written by half-width alphanumeric symbols.

b About the amplitude ratio format

✓ Example: only one transition for one metabolite
  Thymine  125  42.05  5.58  100

✓ Example: multiple transitions for one metabolite
  G6P  258.9  97.05  9.21  100
  G6P  258.9  79.05  9.21  30.1
  G6P  258.9 199.15  9.21  5.5

Note 1: You can edit the reference library and update its information in MRMPROBS. However, an empty value cannot be accepted when the library is imported. If you do not know the suitable retention time and amplitude information for the metabolites, enter arbitrary values for the metabolites.

Note 2: Users do not have to include all metabolite information you entered in the MS instrument.

Note 3: Sometimes the tab-delimited file exported from Microsoft Excel includes unexpected hidden trailing columns. These unexpected columns after the ‘Ratio’ column cannot be handled by MRMPROBS. You can inspect the exported file by selecting a few rows (see below). If there are selected characters after the last column (Ratio), edit the file in Excel to delete these columns and re-export it again.

Good example (no unexpected column)
alt

Bad example (there are unexpected columns)
alt

Section 5-2

Reference library for Project type 2: MRMPROBS key index = Function (mzML)

Six items are required as tab-delimited text format. The header names are flexible but the item order should be followed. (Here, in order easily to see the library, the reference was described in the Microsoft Excel.)

alt

1 column.    Compound namea
2 column.    Function IDb
3 column.    Precursor m/z (accurate m/z information is rounded into nominal m/z information)
4 column.    Product m/z
5 column.    Retention time [min]
6 column.    Amplitude ratios [%]c

Notes
a When you use project 2 of MRMPROBS, the name doesn’t have to be identical to the compound name in the instrument setting. The compound name MUST be written by half-width alphanumeric symbols.

b The function ID is the most important ID to use this option. In the mzML data, there is a markup indicating a ‘Function ID’ which is unambiguous key to contact to the specific MRM chromatogram for the retention time range, the precursor ion, and product ion. In order to easily to see the relationship between the function ID and the MRM information, use the SeeMS program which can be downloaded at ProteoWizard webpage: http://proteowizard.sourceforge.net/.

  1. Open SeeMS.
  2. Select a mzML file.

alt

To find the identical function ID in your data, use the Microsoft Excel sorting function and your experiment condition file. In the most of case, the proteowizard is sorting the functions following the order to 1. Precursor Ion, 2. Product Ion, 3. Retention time starting point.

c About the amplitude ratio format
See the section of Reference library for Project type 1: MRMPROBS key index = metabolite name (abf).

Section 5-3

Reference library for Project type 3: MRMPROBS key index = SCAN or DIA-MS (abf)

Section 5-3-1

Reference format for DIA-MS data

Users can utilize MRMPROBS software for scan type data such as GC/MS, LC/MS, and LC-data independent MS/MS (DIA-MS). The below figure is the reference library for DIA-MS data. Here, our objective is to utilized DIA-MS data as MRM (what we call DIA-MRM, for example SWATH-MRM for SCIEX machine.). This library can be easily exported by MS-DIAL software: http://prime.psc.riken.jp/Metabolomics_Software/MS-DIAL/.

alt

1 column.    Compound name
2 column.    Precursor m/z
3 column.    Product m/z
4 column.    Retention time [min]
5 column.    Amplitude ratios [%]
6 column.    RT begin: start time to draw the chromatogram
7 column.    RT end: end time to draw the chromatogram
8 column.    MS1 tolerance: mass accuracy for survey scan MS data
9 column.    MS2 tolerance: mass accuracy for MS/MS spectra
10 column.    MS level: put 1 for survey scan MS data (MS1), and put 2 for MS/MS.
11 column.    Class: it’s used for the MRMPROBS viewer to filter out the chromatograms. Set ‘NA’ or something if not interest.

Below is the description of the ‘bridge’ from MS-DIAL to MRMPROBS
alt

Section 5-3-2

Dictionary file for DIA-MS data processing

The dictionary file should contain MS1 scan range and precursor window in combination with its experimental ID.
alt

In the case of SWATH data-independent analysis, the experiment file can be made at PeakView (Show->sample information). Do not change the column orders. The word “SCAN” should be kept.

Section 5-3-3

Reference format for GC/MS and LC/MS data

MRMPROBS is improved to utilize single MS data such as GC/MS and LC/MS, and the below figure is the reference library for GC/MS data. The trick to import the single MS data sets is 1) to assign the same values for product m/z and MS2 tolerance as precursor m/z and MS1 tolerance, respectively, and 2) to assign ‘1’ as MS level for all queries.
This library can be easily exported by MS-DIAL software: http://prime.psc.riken.jp/Metabolomics_Software/MS-DIAL/.

alt

Section 6

Starting MRMPROBS

Section 6-1

Summary for MRM demonstration data sets

  1. Starting up your project
  2. Importing Abf files
  3. Setting parameters
  4. Running the software (1-2 min / sample)

* The tutorial uses 40 demonstration files and the reference library which are downloadable from the above link. The common measurement conditions of the demonstration files were as follows.

Liquid chromatography: total 25 min run per sample with CELI L-column2 ODC (150 mm×2.1 mm, 3 μm).
Mass spectrometer: MRM method with negative ion mode.
Target metabolite number: 60
Total transitions: 166

The detail of experimental conditions is downloadable at the MRM Database section (Ion-pair LC-QqQ/MS).

http://prime.psc.riken.jp/Metabolomics_Software/MrmDatabase/index.html

Section 6-2

Starting up your project

  1. File → New project.

  2. Chose a project type (select the top one for this demonstration).

alt

Section 6-3

Importing Abf files

alt

Note:

Section 6-4

Parameter

alt

Select ‘ExampleLibrary.txt’ and set the above parameters for this demonstration.

Note:

[Recommended]
Peak detection
Smoothing method: linear weighted moving average.
Smoothing level: 1-2
Minimum peak width: 3-5
Minimum peak height: 50-100

Peak identification
Retention time tolerance: As long as the reverse phase or hydrophilic interaction chromatography LC are used, 0.1-0.2 min is recommended.
Amplitude tolerance: 15
Minimum posterior: Decide the minimum probability for peak identification. MRMPROBS calculates a probability for a peak, i.e. “probability of true target metabolite given the calculated scores”. The detected peak less than this criterion is recognized as a false peak. The recommended value is 50-70.

Note: The first data processing including file import, peak detection, and peak identification requires 5-20 seconds (depending on machine specifications) per file.

Section 7

MRMPROBS viewer

Section 7-1

Mouse operation in the chromatogram viewer

Main window

alt

alt

View mode

  1. Chromatogram window: drag holding left click → chromatogram scroll, drag holding right click → chromatogram zoom.
  2. Detected window: left double-click the reverse triangle → change the true peak, right double-click anywhere → un-checked detected peaks.
  3. Retention time window: drag holding right click → warping on retention time range.
  4. Intensity window: drag holding right click → warping on intensity range.

Edit mode

  1. Left click and drag on the peak edge [red square] → change the location of the peak edge.
  2. Right click and drag → detect new peak.

Section 7-2

Library editor (optional)

alt

Note: The details and the operation method for chromatogram viewer are described later.

Section 7-3

Tool button

alt

Section 7-4

Tab

alt

Raw data matrix
alt

Section 7-5

Button

alt

Section 7-6

List Box

alt

If you double-click a metabolite name or a file name, the chromatograms are generated in the chromatogram viewer.

Section 7-7

Details on the MRMPROBS function

Section 7-7-1

File menu

Section 7-7-2

Data reprocessing

Data re-processing can be done by newly optimized parameters in this option. Re-processing is also performed per metabolite or per file. The target MRM can also be changed. The parameters are set per metabolite and per file. The required time for data re-processing is very short because file import has been performed already.

alt

Section 7-7-3

Statistical analysis

The current program can apply two types of missing value approaches and can normalize a quantification value by the internal standard and loess/cubic spline with the analytical order information. If you want to use the internal standard, you must set the optimal setting in the “Option menu”. The current program can also do principal component analysis.

alt

Section 7-7-4

Missing value methods

  1. The process is performed per column, i.e. per metabolite.
  2. If the value of a metabolite is “N.D.” in all files, a zero (0) value is assigned.
  3. The retention time values except for “N.D.” files are stored and the average value is calculated.
  4. For each “N.D.”, the intensity of “data point” consistency with the average retention time of the processed EIC chromatogram (after smoothing) is assigned as the quantification value.

Section 7-7-5

Normalization

After clicking the “Done” button, the “Statistical analysis setting” button is activated.
alt

You can do principal component analysis. Add the calculated number of the principal components and choose the scale and transform method.

alt

Zooming in and out can be done with the mouse wheel. Each principal component is shown by selecting the X axis or the Y axis combobox.

Section 7-7-6

Window menu

The tile setting is possible depending on your computer’s resolution. Please select your preference.

alt

Section 7-7-7

View menu

In this menu the chromatograms in the chromatogram viewer are sorted by file id, analytical order, class id, and file type.

Section 7-7-8

Option menu

Here it is possible to set the properties of metabolites and files. In particular, this option menu is used to create a data matrix for statistical analysis.
    In the file properties you can re-set the file type, class ID, and analytical order except for the file name. If you clear the check box of the included property it is no longer included in the processed data matrix.
    In the metabolite properties you can set the internal standard. It can be set independently for each metabolite. However, please make sure that the metabolite name of the internal standard is completely consistent with the metabolite name in the “internal standard” column. Therefore, we recommend that you use copy and paste for the internal standard setting. In this window, although copy and paste can be performed just by using the keyboard, you can do “multi” copy. For example, copy a metabolite name by pushing Ctrl + C. Select the rows you want to add in the internal standard column by dragging and paste the clipboard contents by pushing Ctrl + V.

alt

Section 7-7-9

Export menu

A tab-delimited text file can be exported for a raw data matrix, a processed data matrix, the updated library, detected peak information detail, and PCA results. Moreover, the PCA result can be exported by some image formats.

alt

Appendix A

How to obtain appropriate file conversion of the Shimadzu .lcd file.

Although you can do a content change of the .lcd file after LC-QqQ/MS (MRM) analysis, it is very useful to construct a suitable method file (.lcm format file) for the successful file convert of the MRMPROBS software.

1. Event name and channel (MRM transitions) rule.

alt

2. Update compound table
After the method construction of MRM transitions, you should update the compound table m/z by the MRM event. If you can analyze the samples by using the updated method file, you do not have to perform any other tasks for the stable file convert.

alt

You can check the updated table by Method->Data Processing Parameters->Compound tab.

alt

3. If your data (.lcd) were not collected by a suitable method described above, you can improve the .lcd file by using the method file modified in the above way. After the construction of the modified method file, please open “Postrun Analysis” of LabSolutions.

alt

After selecting the analysis files (.lcd) push the “Apply to Method” button.

alt

Select the modified method file and improve your .lcd file including the compound table m/z. If you can do this, the file (.lcd) is successfully converted by Reifycs Inc. software.

alt

4. File convert
Conditions: You can convert from .lcd files to .abf files on your computer by installing LabSolutions software. “TTFLDataExportVer5.dll” of LabSolutions ver. 5.53 SP4 or later is required for the file convert. Check the “TTFLDataExportVer5.dll” (Program Files (or *86)>LabSolutions) file property. If the file size is less than 577,536 bytes, contact Shimadzu Inc. for a file change.
After “AnalysisBaseFileConverter.exe” is opened, drag and drop the .lcd files to this converter.

alt

Push the “Convert” button. The ABF format files will be generated in the same folder as the .lcd files.

Appendix B

Third option of MRMPROBS: via mzML file.

Required software and file

MRMPROBS can import the mzML format file. In the third option of MRMPROBS, the “function id” is utilized to extract the chromatogram data. The users should add the “function id” information to the reference library in addition to the normal library format.

Download ProteoWizard

  1. Select download type: Windows installer (includes vendor reader support) is recommended.
  2. Read license agreements and download the proteowizard.

alt

(http://proteowizard.sourceforge.net/downloads.shtml)

Setup ProteoWizard

  1. Follow the wizard windows. (Maybe you don’t miss it.)
  2. “SeeMS” should be also imported.

Convert the vendor’s MS file to mzML via ProteoWizard

  1. Open the MSConvertGUI.exe.
  2. Select “List of Files”.
  3. Select the vendor’s file via “Browse” button.
  4. In the “Options”, never check any additional compression including “Use numpress linear compression”, “Use numpress short logged float compression”, and “Use numpress short positive integer compression”. Each of binary encoding precision is available.
  5. Click “Start” button.

alt

Note! ProteoWizard does’nt support Shimadzu MS format. If you want to use them, please use the abf converter.