On the necessity and biological significance of threshold-free regulon prediction outputs

Sébastien Rigali; Renaud Nivelle; Pierre Tocquin

doi:10.1039/C4MB00485J

On the necessity and biological significance of threshold-free regulon prediction outputs

Sébastien Rigali,*^a Renaud Nivelle^ab and Pierre Tocquin^b

* Corresponding authors

^a Centre for Protein Engineering, University of Liège, Institut de Chimie B6a, B-4000 Liège, Belgium
E-mail: srigali@ulg.ac.be

^b Laboratory of Plant Physiology, PhytoSYSTEMS, University of Liège, B-4000 Liège, Belgium

Abstract

The in silico prediction of cis-acting elements in a genome is an efficient way to quickly obtain an overview of the biological processes controlled by a trans-acting factor, and connections between regulatory networks. Several regulon prediction web tools are available, designed to identify DNA motifs predicted to be bound by transcription factors using position weight matrix-based algorithms. In this paper we expose and discuss the conflicting objectives of software creators (bioinformaticians) and software users (biologists), who aim for reliable and exhaustive prediction outputs, respectively. Software makers, concerned with providing tools that minimise the number of false positive hits, often impose a stringent threshold score for a sequence to be included in the list of the putative cis-acting sites. This rigidity eventually results in the identification of strongly reliable but largely straightforward sites, i.e. those associated with genes already anticipated to be targeted by the studied transcription factor. Importantly, this biased identification of strongly bound sequences contrasts with the biological reality where, in many circumstances, a weak DNA–protein interaction is required for the appropriate gene's expression. We show here a series of transcriptionally controlled systems involving weakly bound cis-acting elements that could never have been discovered because of the policy of preventing software users from modifying the screening parameters. Proposing only trustworthy prediction outputs thus prevents biologists from fully utilising their knowledge background and deciding to analyse statistically irrelevant hits that could nonetheless be potentially involved in subtle, unexpected, though essential cis–trans relationships.

Article information

https://doi.org/10.1039/C4MB00485J

Article type

Opinion

Submitted

13 Aug 2014

Accepted

06 Nov 2014

First published

06 Nov 2014

Download Citation

Mol. BioSyst., 2015,11, 333-337

Permissions

Request permissions

On the necessity and biological significance of threshold-free regulon prediction outputs

S. Rigali, R. Nivelle and P. Tocquin, Mol. BioSyst., 2015, 11, 333 DOI: 10.1039/C4MB00485J

To request permission to reproduce material from this article, please go to the Copyright Clearance Center request page.

If you are an author contributing to an RSC publication, you do not need to request permission provided correct acknowledgement is given.

If you are the author of this article, you do not need to request permission to reproduce figures and diagrams provided correct acknowledgement is given. If you want to reproduce the whole article in a third-party publication (excluding your thesis/dissertation for which permission is not required) please go to the Copyright Clearance Center request page.

Molecular BioSystems

On the necessity and biological significance of threshold-free regulon prediction outputs

Abstract

Article information

Download Citation

Author version available

Permissions

On the necessity and biological significance of threshold-free regulon prediction outputs

Search articles by author

Spotlight

Advertisements