R/DWD: distance-weighted discrimination for classification, visualization and batch adjustment

被引:33
作者
Huang, Hanwen [1 ,3 ,4 ]
Lu, Xiaosun [1 ,3 ]
Liu, Yufeng [1 ,2 ]
Haaland, Perry [3 ]
Marron, J. S. [1 ]
机构
[1] Univ N Carolina, Dept Stat & Operat Res, Chapel Hill, NC 27599 USA
[2] Univ N Carolina, Carolina Ctr Genome Sci, Chapel Hill, NC 27599 USA
[3] BD Technol, Res Triangle Pk, NC 27709 USA
[4] Univ Texas Hlth Sci Ctr, Ctr Clin & Translat Sci, Houston, TX 77030 USA
关键词
D O I
10.1093/bioinformatics/bts096
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
R/DWD is an extensible package for classification. It is built based on a recently developed powerful classification method called distance weighted discrimination (DWD). DWD is related to, and has been shown to be superior to, the support vector machine in situations that are fundamental to bioinformatics, such as very high dimensional data. DWD has proven to be very useful for several fundamental bioinformatics tasks, including classification, data visualization and removal of biases, such as batch effects. Earlier DWD implementations, however, relied on Matlab, which is not free and requires a license. The major contribution of the R/DWD package is an implementation that is completely in R and thus can be used without any requirements for licensing or software purchase. In addition, R/DWD also provides efficient solvers for second-order-cone-programming and quadratic programming.
引用
收藏
页码:1182 / 1183
页数:2
相关论文
共 9 条
[1]   Second-order cone programming [J].
Alizadeh, F ;
Goldfarb, D .
MATHEMATICAL PROGRAMMING, 2003, 95 (01) :3-51
[2]  
[Anonymous], BATCH EFFECTS NOISE
[3]   Adjustment of systematic microarray data biases [J].
Benito, M ;
Parker, J ;
Du, Q ;
Wu, JY ;
Xang, D ;
Perou, CM ;
Marron, JS .
BIOINFORMATICS, 2004, 20 (01) :105-114
[4]  
Byvatov Evgeny, 2003, Appl Bioinformatics, V2, P67
[5]   Analysis of matched mRNA measurements from two different microarray technologies [J].
Kuo, WP ;
Jenssen, TK ;
Butte, AJ ;
Ohno-Machado, L ;
Kohane, IS .
BIOINFORMATICS, 2002, 18 (03) :405-412
[6]   Hard or Soft Classification? Large-Margin Unified Machines [J].
Liu, Yufeng ;
Zhang, Hao Helen ;
Wu, Yichao .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2011, 106 (493) :166-177
[7]  
Macenko M, 2009, 6 IEEE INT S BIOM IM, P1107
[8]   Distance-weighted discrimination [J].
Marron, J. S. ;
Todd, Michael J. ;
Ahn, Jeongyoun .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2007, 102 (480) :1267-1271
[9]   SDPT3 -: A MATLAB software package for semidefinite programming, version 1.3 [J].
Toh, KC ;
Todd, MJ ;
Tütüncü, RH .
OPTIMIZATION METHODS & SOFTWARE, 1999, 11-2 (1-4) :545-581