Issue 5, 2020

Enhancer recognition and prediction during spermatogenesis based on deep convolutional neural networks

Abstract

Motivation: enhancers play an important role in the regulation of gene expression during spermatogenesis. The development of ChIP-Chip and ChIP-Seq sequencing technology has enabled researchers to focus on the relationship between enhancers and DNA sequences and histone protein modifications. However, the prediction of enhancers based on the locally conserved DNA sequence and similar histone modification features is still unknown. Here, the present study proposed a convolutional neural network (CNN) model to predict enhancers that can regulate gene expression during spermatogenesis. Results: we have obtained a positive set of enhancers using the P300 locus, verified by experiments, while a negative set was constructed using the promoter as a non-enhancer locus. The model was trained on all types of specific cells during spermatogenesis independently, and the transfer learning strategy was used to fine-tune the model based on which the model can be trained and adapted to other cells quickly. We visualized the convolution layer of the trained model and aligned the predicted enhancer with the JASPAR database. The results showed that the model was highly matched with some important transcription factors during spermatogenesis, signifying the reliability of the model. Finally, we compared the CNN algorithm with the gkmSVM algorithm (Support Vector Machine). It is well known that CNN has better performance than the gkmSVM algorithm, especially in the generalization ability. Our work demonstrated their strong learning ability and the low CPU requirements for the experiment, with a small number of convolution layers and simple network structure, while avoiding overfitting the training data. At the end of the experiment, we used the trained model to build an enhancer recognition website for further research and communication.

Graphical abstract: Enhancer recognition and prediction during spermatogenesis based on deep convolutional neural networks

Article information

Article type
Research Article
Submitted
13 Mar 2020
Accepted
28 May 2020
First published
29 May 2020

Mol. Omics, 2020,16, 455-464

Enhancer recognition and prediction during spermatogenesis based on deep convolutional neural networks

C. Sun, N. Zhang, P. Yu, X. Wu, Q. Li, T. Li, H. Li, X. Xiao, A. Shalmani, L. Li, D. Che, X. Wang, P. Zhang, Z. Chen, T. Liu, J. Zhao, J. Hua and M. Liao, Mol. Omics, 2020, 16, 455 DOI: 10.1039/D0MO00031K

To request permission to reproduce material from this article, please go to the Copyright Clearance Center request page.

If you are an author contributing to an RSC publication, you do not need to request permission provided correct acknowledgement is given.

If you are the author of this article, you do not need to request permission to reproduce figures and diagrams provided correct acknowledgement is given. If you want to reproduce the whole article in a third-party publication (excluding your thesis/dissertation for which permission is not required) please go to the Copyright Clearance Center request page.

Read more about how to correctly acknowledge RSC content.

Social activity

Spotlight

Advertisements