Issue 5, 2014

The relationship between classification of multi-domain proteins using an alignment-free approach and their functions: a case study with immunoglobulins

Abstract

Establishing functional relationships between multi-domain protein sequences is a non-trivial task. Traditionally, delineating functional assignment and relationships of proteins requires domain assignments as a prerequisite. This process is sensitive to alignment quality and domain definitions. In multi-domain proteins due to multiple reasons, the quality of alignments is poor. We report the correspondence between the classification of proteins represented as full-length gene products and their functions. Our approach differs fundamentally from traditional methods in not performing the classification at the level of domains. Our method is based on an alignment free local matching scores (LMS) computation at the amino-acid sequence level followed by hierarchical clustering. As there are no gold standards for full-length protein sequence classification, we resorted to Gene Ontology and domain-architecture based similarity measures to assess our classification. The final clusters obtained using LMS show high functional and domain architectural similarities. Comparison of the current method with alignment based approaches at both domain and full-length protein showed superiority of the LMS scores. Using this method we have recreated objective relationships among different protein kinase sub-families and also classified immunoglobulin containing proteins where sub-family definitions do not exist currently. This method can be applied to any set of protein sequences and hence will be instrumental in analysis of large numbers of full-length protein sequences.

Graphical abstract: The relationship between classification of multi-domain proteins using an alignment-free approach and their functions: a case study with immunoglobulins

Supplementary files

Article information

Article type
Paper
Submitted
03 Oct 2013
Accepted
14 Jan 2014
First published
14 Jan 2014

Mol. BioSyst., 2014,10, 1082-1093

The relationship between classification of multi-domain proteins using an alignment-free approach and their functions: a case study with immunoglobulins

R. M. Bhaskara, P. Mehrotra, R. Rakshambikai, M. Gnanavel, J. Martin and N. Srinivasan, Mol. BioSyst., 2014, 10, 1082 DOI: 10.1039/C3MB70443B

To request permission to reproduce material from this article, please go to the Copyright Clearance Center request page.

If you are an author contributing to an RSC publication, you do not need to request permission provided correct acknowledgement is given.

If you are the author of this article, you do not need to request permission to reproduce figures and diagrams provided correct acknowledgement is given. If you want to reproduce the whole article in a third-party publication (excluding your thesis/dissertation for which permission is not required) please go to the Copyright Clearance Center request page.

Read more about how to correctly acknowledge RSC content.

Spotlight

Advertisements