High-Order Convolutional Neural Network Architecture for Predicting DNA-Protein Binding Sites

被引:82
作者
Zhang, Qinhu [1 ]
Zhu, Lin [1 ]
Huang, De-Shuang [1 ]
机构
[1] Tongji Univ, Sch Elect & Informat Engn, Inst Machine Learning & Syst Biol, 4800 Caoan Rd, Shanghai 201804, Peoples R China
基金
中国博士后科学基金; 美国国家科学基金会;
关键词
High-order; multi-scale convolutional layer; transcription factor; binding specificity; TRANSCRIPTION FACTOR; CLASSIFICATION; REPRESENTATION; METHODOLOGY; SEQUENCES;
D O I
10.1109/TCBB.2018.2819660
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Although Deep learning algorithms have outperformed conventional methods in predicting the sequence specificities of DNA-protein binding, they lack to consider the dependencies among nucleotides and the diverse binding lengths for different transcription factors (TFs). To address the above two limitations simultaneously, in this paper, we propose a high-order convolutional neural network architecture (HOCNN), which employs a high-order encoding method to build high-order dependencies among nucleotides, and a multi-scale convolutional layer to capture the motif features of different length. The experimental results on real ChIP-seq datasets show that the proposed method outperforms the state-of-the-art deep learning method (DeepBind) in the motif discovery task. In addition, we provide further insights about the importance of introducing additional convolutional kernels and the degeneration problem of importing high-order in the motif discovery task.
引用
收藏
页码:1184 / 1192
页数:9
相关论文
共 64 条
[1]   Predicting the sequence specificities of DNA- and RNA-binding proteins by deep learning [J].
Alipanahi, Babak ;
Delong, Andrew ;
Weirauch, Matthew T. ;
Frey, Brendan J. .
NATURE BIOTECHNOLOGY, 2015, 33 (08) :831-+
[2]  
[Anonymous], 2017, COMMUN ACM, DOI DOI 10.1145/3065386
[3]  
[Anonymous], PATTERN RECOGN LETT
[4]  
[Anonymous], 2012, MACH LEARN
[5]  
Bailey T L, 1995, Proc Int Conf Intell Syst Mol Biol, V3, P21
[6]   CIPPN: computational identification of protein pupylation sites by using neural network [J].
Bao, Wenzheng ;
You, Zhu-Hong ;
Huang, De-Shuang .
ONCOTARGET, 2017, 8 (65) :108867-108879
[7]   Novel human microbe-disease association prediction using network consistency projection [J].
Bao, Wenzheng ;
Jiang, Zhichao ;
Huang, De-Shuang .
BMC BIOINFORMATICS, 2017, 18
[8]   Classification of Protein Structure Classes on Flexible Neutral Tree [J].
Bao, Wenzheng ;
Wang, Dong ;
Chen, Yuehui .
IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2017, 14 (05) :1122-1133
[9]   Pupylation sites prediction with ensemble classification model [J].
Bao, Wenzheng ;
Huang, Zhenhua ;
Yuan, Chang-An ;
Huang, De-Shuang .
INTERNATIONAL JOURNAL OF DATA MINING AND BIOINFORMATICS, 2017, 18 (02) :91-104
[10]   Prediction of protein structure classes with flexible neural tree [J].
Bao, Wenzheng ;
Chen, Yuehui ;
Wang, Dong .
BIO-MEDICAL MATERIALS AND ENGINEERING, 2014, 24 (06) :3797-3806