Issue 4, 2018

Genome-wide survey of remote homologues for protein domain superfamilies of known structure reveals unequal distribution across structural classes

Abstract

Domains are the basic building blocks of proteins which can combine to give rise to different domain architectures. Annotation of domains in a sequence is the first step towards understanding the biological function. Since there are a limited number of folds and evolutionarily related proteins have a similar structure, function can be inferred through remote homology. Computational sequence searches were performed for remote homologues on genomes of around ∼160 000 different organisms, starting from nearly 11 000 superfamily queries of known structure. Case studies revealed that most of the associated domains are involved in the same biological process. Using all the proteins predicted to have at least one structural domain, a coverage of 61% of Pfam families was achieved which is higher than the existing methods (43.36% by SIFTS). Taxonomic analysis of the proteins revealed 493 superfamilies in all the major kingdoms of life and a few lateral gene transfers between viruses and cellular organisms. The distribution of remote homologues across different classes, folds and superfamilies was studied and reveals that sequences are unequally distributed across structural classes. Finally, domain architectures were computed for the homologues and these data were compiled for each superfamily and organism.

Graphical abstract: Genome-wide survey of remote homologues for protein domain superfamilies of known structure reveals unequal distribution across structural classes

Supplementary files

Article information

Article type
Research Article
Submitted
09 Jan 2018
Accepted
22 Jun 2018
First published
27 Jun 2018

Mol. Omics, 2018,14, 266-280

Genome-wide survey of remote homologues for protein domain superfamilies of known structure reveals unequal distribution across structural classes

M. S. Iyer, A. G. Joshi and R. Sowdhamini, Mol. Omics, 2018, 14, 266 DOI: 10.1039/C8MO00008E

To request permission to reproduce material from this article, please go to the Copyright Clearance Center request page.

If you are an author contributing to an RSC publication, you do not need to request permission provided correct acknowledgement is given.

If you are the author of this article, you do not need to request permission to reproduce figures and diagrams provided correct acknowledgement is given. If you want to reproduce the whole article in a third-party publication (excluding your thesis/dissertation for which permission is not required) please go to the Copyright Clearance Center request page.

Read more about how to correctly acknowledge RSC content.

Social activity

Spotlight

Advertisements