Jump to main content
Jump to site search

Issue 8, 2012
Previous Article Next Article

Cascaded walks in protein sequence space: use of artificial sequences in remote homology detection between natural proteins

Author affiliations

Abstract

Over the past two decades, many ingenious efforts have been made in protein remote homology detection. Because homologous proteins often diversify extensively in sequence, it is challenging to demonstrate such relatedness through entirely sequence-driven searches. Here, we describe a computational method for the generation of ‘protein-like’ sequences that serves to bridge gaps in protein sequence space. Sequence profile information, as embodied in a position-specific scoring matrix of multiply aligned sequences of bona fide family members, serves as the starting point in this algorithm. The observed amino acid propensity and the selection of a random number dictate the selection of a residue for each position in the sequence. In a systematic manner, and by applying a ‘roulette-wheel’ selection approach at each position, we generate parent family-like sequences and thus facilitate an enlargement of sequence space around the family. When generated for a large number of families, we demonstrate that they expand the utility of natural intermediately related sequences in linking distant proteins. In 91% of the assessed examples, inclusion of designed sequences improved fold coverage by 5–10% over searches made in their absence. Furthermore, with several examples from proteins adopting folds such as TIM, globin, lipocalin and others, we demonstrate that the success of including designed sequences in a database positively sensitized methods such as PSI-BLAST and Cascade PSI-BLAST and is a promising opportunity for enormously improved remote homology recognition using sequence information alone.

Graphical abstract: Cascaded walks in protein sequence space: use of artificial sequences in remote homology detection between natural proteins

Back to tab navigation

Supplementary files

Publication details

The article was received on 22 Mar 2012, accepted on 15 May 2012 and first published on 13 Jun 2012


Article type: Paper
DOI: 10.1039/C2MB25113B
Mol. BioSyst., 2012,8, 2076-2084

  •   Request permissions

    Cascaded walks in protein sequence space: use of artificial sequences in remote homology detection between natural proteins

    S. Sandhya, R. Mudgal, C. Jayadev, K. R. Abhinandan, R. Sowdhamini and N. Srinivasan, Mol. BioSyst., 2012, 8, 2076
    DOI: 10.1039/C2MB25113B

Search articles by author

Spotlight

Advertisements