Issue 10, 2007

Composite sequence proteomic analysis of proteinbiomarkers of Campylobacter coli, C. lari and C. concisus for bacterial identification

Abstract

Protein biomarkers observed in the matrix-assisted laser desorption/ionization time-of-flight mass spectra (MALDI-TOF-MS) of cell lysates of three strains of Campylobacter coli, two strains of C. lari and one strain of C. concisus have been identified by ‘bottom-up’ proteomic techniques. The significant findings are as follows. First, the proteinbiomarkers identified were: PhnA-related protein, 4-oxalocrotonate tautomerase (DmpI)-related protein, NifU-like protein, cytochrome c, DNA-bindingprotein HU, 10 kDa chaperonin, thioredoxin, as well as several conserved hypothetical and ribosomal proteins. Second, variations in the biomarker ion m/z in MALDI-TOF-MS spectra across species and strains are the result of variations in the amino acid sequence of the protein due to non-synonymous mutations of the biomarker gene. Third, the most common post-translational modifications (PTMs) were the removal of the N-terminal methionine and N-terminal signal peptides. However, in the case of the NifU protein (an iron–sulfur cluster transport protein), post-translational cleavage occurred from the C-terminus. Fourth, only the genomes of the C. coli strain RM2228 and C. lari strain RM2100 have been sequenced; thus, proteomic identification of the proteins of the other strains in this study relied upon sequence homology to the genomic sequence of these strains as well as the genomes of sequences of other Campylobacter strains. In some cases, the determination of the full amino acid sequence of a proteinbiomarker from a genomically non-sequenced strain was accomplished by combining non-overlapping partial sequences from proteomic identifications of genomically-sequenced strains that were of the same species (or of a different species) to that of the non-sequenced strain. The accuracy of this composite sequence was confirmed by both MS and MS/MS. It was necessary, in some cases, to perform de novo sequencing on ‘gaps’ in the composite sequence that were not homologous to any genomically-sequenced strain. In order to validate the composite sequence approach, composite sequences were further confirmed by subsequent DNA sequencing of the biomarker gene. Thus, using the composite sequence approach, it was possible to determine the full amino acid sequence of an unknown protein from a genomically non-sequenced bacterial strain without the necessity of either sequencing the biomarker gene or performing full de novoMS/MS sequencing. The sequence obtained could then be used as a strain-specific biomarker for analysis by ‘top-down’ proteomics techniques.

Graphical abstract: Composite sequence proteomic analysis of proteinbiomarkers of Campylobacter coli, C. lari and C. concisus for bacterial identification

Supplementary files

Article information

Article type
Paper
Submitted
26 Feb 2007
Accepted
22 Jun 2007
First published
20 Jul 2007

Analyst, 2007,132, 1010-1023

Composite sequence proteomic analysis of proteinbiomarkers of Campylobacter coli, C. lari and C. concisus for bacterial identification

C. K. Fagerquist, E. Yee and W. G. Miller, Analyst, 2007, 132, 1010 DOI: 10.1039/B702859H

To request permission to reproduce material from this article, please go to the Copyright Clearance Center request page.

If you are an author contributing to an RSC publication, you do not need to request permission provided correct acknowledgement is given.

If you are the author of this article, you do not need to request permission to reproduce figures and diagrams provided correct acknowledgement is given. If you want to reproduce the whole article in a third-party publication (excluding your thesis/dissertation for which permission is not required) please go to the Copyright Clearance Center request page.

Read more about how to correctly acknowledge RSC content.

Social activity

Spotlight

Advertisements