Jump to main content
Jump to site search


An efficient method to transcription factor binding sites imputation via simultaneous completion of multiple matrices with positional consistency

Author affiliations

Abstract

Transcription factors (TFs) are DNA-binding proteins that have a central role in regulating gene expression. Identification of DNA-binding sites of TFs is a key task in understanding transcriptional regulation, cellular processes and disease. Chromatin immunoprecipitation followed by high-throughput sequencing (ChIP-seq) enables genome-wide identification of in vivo TF binding sites. However, it is still difficult to map every TF in every cell line owing to cost and biological material availability, which poses an enormous obstacle for integrated analysis of gene regulation. To address this problem, we propose a novel computational approach, TFBSImpute, for predicting additional TF binding profiles by leveraging information from available ChIP-seq TF binding data. TFBSImpute fuses the dataset to a 3-mode tensor and imputes missing TF binding signals via simultaneous completion of multiple TF binding matrices with positional consistency. We show that signals predicted by our method achieve overall similarity with experimental data and that TFBSImpute significantly outperforms baseline approaches, by assessing the performance of imputation methods against observed ChIP-seq TF binding profiles. Besides, motif analysis shows that TFBSImpute preforms better in capturing binding motifs enriched in observed data compared with baselines, indicating that the higher performance of TFBSImpute is not simply due to averaging related samples. We anticipate that our approach will constitute a useful complement to experimental mapping of TF binding, which is beneficial for further study of regulation mechanisms and disease.

Graphical abstract: An efficient method to transcription factor binding sites imputation via simultaneous completion of multiple matrices with positional consistency

Back to tab navigation

Supplementary files

Publication details

The article was received on 15 Mar 2017, accepted on 06 Jul 2017 and first published on 06 Jul 2017


Article type: Paper
DOI: 10.1039/C7MB00155J
Citation: Mol. BioSyst., 2017, Advance Article
  •   Request permissions

    An efficient method to transcription factor binding sites imputation via simultaneous completion of multiple matrices with positional consistency

    W. Guo and D. Huang, Mol. BioSyst., 2017, Advance Article , DOI: 10.1039/C7MB00155J

Search articles by author

Spotlight

Advertisements