Issue 115, 2016, Issue in Progress

Feature extraction from resolution perspective for gas chromatography-mass spectrometry datasets

Abstract

Automatic feature extraction from large-scale datasets is one of the major challenges when analyzing complex samples with gas chromatography-mass spectrometry (GC-MS). The classic processing pipeline basically consists of noise filtering, baseline correction, peak detection, alignment, normalization and identification. The long pipeline makes the extracted features inconsistent with different methods and values of parameters. In this study, MS-Assisted Resolution of Signals (MARS) has been proposed to extract features automatically from resolution perspective for large-scale GC-MS datasets. Firstly, it divides complex data into small segments and searches the target zone by moving sub-window factor analysis (MSWFA). Then, improved iterative target transformation factor analysis (ITTFA) has been developed to extract features of the compound from complex datasets. MARS was systematically tested on a simulated dataset (5 samples), peppermint dataset (2 samples), red wine dataset (24 samples) and human plasma dataset (131 samples). The results show that MARS can extract features accurately, automatically, objectively and swiftly from these complex datasets at 2–3 minutes/chromatogram speed. The extracted features of overlapped peaks are comparable to the features resolved by MCR-ALS or PARAFAC2, and significantly better than XCMS. Furthermore, PLS-DA models of the human plasma dataset indicated that features extracted automatically by MARS are comparable or better than features extracted manually by experts with a GC-MS workstation. It has been implemented and open-sourced at https://github.com/zmzhang/MARS.

Graphical abstract: Feature extraction from resolution perspective for gas chromatography-mass spectrometry datasets

Supplementary files

Article information

Article type
Paper
Submitted
13 Jul 2016
Accepted
25 Nov 2016
First published
01 Dec 2016

RSC Adv., 2016,6, 113997-114004

Feature extraction from resolution perspective for gas chromatography-mass spectrometry datasets

P. Ma, Z. Zhang, X. Zhou, Y. Yun, Y. Liang and H. Lu, RSC Adv., 2016, 6, 113997 DOI: 10.1039/C6RA17864B

To request permission to reproduce material from this article, please go to the Copyright Clearance Center request page.

If you are an author contributing to an RSC publication, you do not need to request permission provided correct acknowledgement is given.

If you are the author of this article, you do not need to request permission to reproduce figures and diagrams provided correct acknowledgement is given. If you want to reproduce the whole article in a third-party publication (excluding your thesis/dissertation for which permission is not required) please go to the Copyright Clearance Center request page.

Read more about how to correctly acknowledge RSC content.

Social activity

Spotlight

Advertisements