Batch Mode TD(λ) for Controlling Partially Observable Gene Regulatory Networks

被引:5
作者
Sirin, Utku [1 ]
Polat, Faruk [2 ]
Alhajj, Reda [3 ]
机构
[1] Ecole Polytech Fed Lausanne, Sch Comp & Commun Sci, CH-1015 Lausanne, Switzerland
[2] Middle East Tech Univ, Dept Comp Engn, TR-06800 Ankara, Turkey
[3] Univ Calgary, Dept Comp Sci, Calgary, AB T2N 1N4, Canada
关键词
Batch mode reinforcement learning; temporal-difference learning; gene regulatory networks; gene expression; gene regulation; PROBABILISTIC BOOLEAN NETWORKS; EXTERNAL CONTROL;
D O I
10.1109/TCBB.2016.2595577
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
External control of gene regulatory networks (GRNs) has received much attention in recent years. The aim is to find a series of actions to apply to a gene regulation system making it avoid its diseased states. In this work, we propose a novel method for controlling partially observable GRNs combining batch mode reinforcement learning (Batch RL) and TD(lambda) algorithms. Unlike the existing studies inferring a computational model from gene expression data, and obtaining a control policy over the constructed model, our idea is to interpret the time series gene expression data as a sequence of observations that the system produced, and obtain an approximate stochastic policy directly from the gene expression data without estimation of the internal states of the partially observable environment. Thereby, we get rid of the most time consuming phases of the existing studies, inferring a model and running the model for the control. Results show that our method is able to provide control solutions for regulation systems of several thousands of genes only in seconds, whereas existing studies cannot solve control problems of even a few dozens of genes. Results also show that our approximate stochastic policies are almost as good as the policies generated by the existing studies.
引用
收藏
页码:1214 / 1227
页数:14
相关论文
共 39 条
[1]  
[Anonymous], 1989, LEARNING DELAYED REW
[2]  
[Anonymous], 1994, Machine Learning, DOI DOI 10.1016/C2009-0-27542-8
[3]  
Bellman R. E., 1957, Dynamic programming. Princeton landmarks in mathematics
[4]  
Bernard A, 2005, PACIFIC SYMPOSIUM ON BIOCOMPUTING 2005, P459
[5]  
Bertsekas D. P., 1976, DYNAMIC PROGRAMMING
[6]   Molecular classification of cutaneous malignant melanoma by gene expression profiling [J].
Bittner, M ;
Meitzer, P ;
Chen, Y ;
Jiang, Y ;
Seftor, E ;
Hendrix, M ;
Radmacher, M ;
Simon, R ;
Yakhini, Z ;
Ben-Dor, A ;
Sampas, N ;
Dougherty, E ;
Wang, E ;
Marincola, F ;
Gooden, C ;
Lueders, J ;
Glatfelter, A ;
Pollock, P ;
Carpten, J ;
Gillanders, E ;
Leja, D ;
Dietrich, K ;
Beaudry, C ;
Berens, M ;
Alberts, D ;
Sondak, V ;
Hayward, N ;
Trent, J .
NATURE, 2000, 406 (6795) :536-540
[7]   Planning Interventions in Biological Networks [J].
Bryce, Daniel ;
Verdicchio, Michael ;
Kim, Seungchan .
ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2010, 1 (02)
[8]  
Busoniu L, 2010, AUTOM CONTROL ENG SE, P1, DOI 10.1201/9781439821091-f
[9]   External control in Markovian genetic regulatory networks: the imperfect information case [J].
Datta, A ;
Choudhary, A ;
Bittner, ML ;
Dougherty, ER .
BIOINFORMATICS, 2004, 20 (06) :924-930
[10]   External control in Markovian Genetic Regulatory Networks [J].
Datta, A ;
Choudhary, A ;
Bittner, ML ;
Dougherty, ER .
MACHINE LEARNING, 2003, 52 (1-2) :169-191