Feature extraction from resolution perspective for gas chromatography-mass spectrometry datasets

Pan Ma; Zhimin Zhang; Xinyi Zhou; Yonghuan Yun; Yizeng Liang; Hongmei Lu

doi:10.1039/C6RA17864B

Feature extraction from resolution perspective for gas chromatography-mass spectrometry datasets†

Pan Ma,^a Zhimin Zhang,^a Xinyi Zhou,^a Yonghuan Yun,^a Yizeng Liang^a and Hongmei Lu*^a

* Corresponding authors

^a College of Chemistry and Chemical Engineering, Central South University, Changsha 410083, PR China
E-mail: hongmeilu@csu.edu.cn
Tel: +86-731-88830831

Abstract

Automatic feature extraction from large-scale datasets is one of the major challenges when analyzing complex samples with gas chromatography-mass spectrometry (GC-MS). The classic processing pipeline basically consists of noise filtering, baseline correction, peak detection, alignment, normalization and identification. The long pipeline makes the extracted features inconsistent with different methods and values of parameters. In this study, MS-Assisted Resolution of Signals (MARS) has been proposed to extract features automatically from resolution perspective for large-scale GC-MS datasets. Firstly, it divides complex data into small segments and searches the target zone by moving sub-window factor analysis (MSWFA). Then, improved iterative target transformation factor analysis (ITTFA) has been developed to extract features of the compound from complex datasets. MARS was systematically tested on a simulated dataset (5 samples), peppermint dataset (2 samples), red wine dataset (24 samples) and human plasma dataset (131 samples). The results show that MARS can extract features accurately, automatically, objectively and swiftly from these complex datasets at 2–3 minutes/chromatogram speed. The extracted features of overlapped peaks are comparable to the features resolved by MCR-ALS or PARAFAC2, and significantly better than XCMS. Furthermore, PLS-DA models of the human plasma dataset indicated that features extracted automatically by MARS are comparable or better than features extracted manually by experts with a GC-MS workstation. It has been implemented and open-sourced at https://github.com/zmzhang/MARS.

Supplementary files

Article information

DOI: https://doi.org/10.1039/C6RA17864B
Article type: Paper
Submitted: 13 Jul 2016
Accepted: 25 Nov 2016
First published: 01 Dec 2016

Download Citation

RSC Adv., 2016,6, 113997-114004

Permissions

Request permissions

Feature extraction from resolution perspective for gas chromatography-mass spectrometry datasets

P. Ma, Z. Zhang, X. Zhou, Y. Yun, Y. Liang and H. Lu, RSC Adv., 2016, 6, 113997 DOI: 10.1039/C6RA17864B

To request permission to reproduce material from this article, please go to the Copyright Clearance Center request page.

If you are an author contributing to an RSC publication, you do not need to request permission provided correct acknowledgement is given.

If you are the author of this article, you do not need to request permission to reproduce figures and diagrams provided correct acknowledgement is given. If you want to reproduce the whole article in a third-party publication (excluding your thesis/dissertation for which permission is not required) please go to the Copyright Clearance Center request page.

RSC Advances

Feature extraction from resolution perspective for gas chromatography-mass spectrometry datasets†

Abstract

Supplementary files

Article information

Download Citation

Permissions

Feature extraction from resolution perspective for gas chromatography-mass spectrometry datasets

Social activity

Search articles by author

Spotlight

Advertisements