共 44 条
Hybrid Method Based on Information Gain and Support Vector Machine for Gene Selection in Cancer Classification
被引:81
作者:

Gao, Lingyun
论文数: 0 引用数: 0
h-index: 0
机构:
Wannan Med Coll, Sch Med Informat, Wuhu 241002, Peoples R China Wannan Med Coll, Sch Med Informat, Wuhu 241002, Peoples R China

Ye, Mingquan
论文数: 0 引用数: 0
h-index: 0
机构:
Wannan Med Coll, Sch Med Informat, Wuhu 241002, Peoples R China Wannan Med Coll, Sch Med Informat, Wuhu 241002, Peoples R China

Lu, Xiaojie
论文数: 0 引用数: 0
h-index: 0
机构:
Wannan Med Coll, Sch Med Informat, Wuhu 241002, Peoples R China Wannan Med Coll, Sch Med Informat, Wuhu 241002, Peoples R China

Huang, Daobin
论文数: 0 引用数: 0
h-index: 0
机构:
Wannan Med Coll, Sch Med Informat, Wuhu 241002, Peoples R China Wannan Med Coll, Sch Med Informat, Wuhu 241002, Peoples R China
机构:
[1] Wannan Med Coll, Sch Med Informat, Wuhu 241002, Peoples R China
基金:
中国国家自然科学基金;
关键词:
Gene selection;
Cancer classification;
Information gain;
Support vector machine;
Small sample size with high dimension;
CONGENITAL MUSCULAR-DYSTROPHY;
HEPSIN GENE;
EXPRESSION;
OPTIMIZATION;
MUTATIONS;
ALGORITHM;
VARIANTS;
INPP5K;
D O I:
10.1016/j.gpb.2017.08.002
中图分类号:
Q3 [遗传学];
学科分类号:
071007 ;
090102 ;
摘要:
It remains a great challenge to achieve sufficient cancer classification accuracy with the entire set of genes, due to the high dimensions, small sample size, and big noise of gene expression data. We thus proposed a hybrid gene selection method, Information Gain-Support Vector Machine (IG-SVM) in this study. IG was initially employed to filter irrelevant and redundant genes. Then, further removal of redundant genes was performed using SVM to eliminate the noise in the datasets more effectively. Finally, the informative genes selected by IG-SVM served as the input for the LIBSVM classifier. Compared to other related algorithms, IG-SVM showed the highest classification accuracy and superior performance as evaluated using five cancer gene expression datasets based on a few selected genes. As an example, IG-SVM achieved a classification accuracy of 90.32% for colon cancer, which is difficult to be accurately classified, only based on three genes including CSRP1, MYL9, and GUCA2B.
引用
收藏
页码:389 / 395
页数:7
相关论文
共 44 条
[1]
The guanylate cyclase-C signaling pathway is down-regulated in inflammatory bowel disease
[J].
Brenna, Oystein
;
Bruland, Torunn
;
Furnes, Marianne W.
;
Granlund, Atle van Beelen
;
Drozdov, Ignat
;
Emgard, Johanna
;
Bronstad, Gunnar
;
Kidd, Mark
;
Sandvik, Arne K.
;
Gustafsson, Bjorn I.
.
SCANDINAVIAN JOURNAL OF GASTROENTEROLOGY,
2015, 50 (10)
:1241-1252

Brenna, Oystein
论文数: 0 引用数: 0
h-index: 0
机构:
Univ Trondheim Hosp, St Olavs Hosp, Dept Gastroenterol & Hepatol, Trondheim, Norway
Norwegian Univ Sci & Technol, Dept Canc Res & Mol Med, N-7034 Trondheim, Norway Univ Trondheim Hosp, St Olavs Hosp, Dept Gastroenterol & Hepatol, Trondheim, Norway

论文数: 引用数:
h-index:
机构:

Furnes, Marianne W.
论文数: 0 引用数: 0
h-index: 0
机构:
Norwegian Univ Sci & Technol, Dept Canc Res & Mol Med, N-7034 Trondheim, Norway Univ Trondheim Hosp, St Olavs Hosp, Dept Gastroenterol & Hepatol, Trondheim, Norway

论文数: 引用数:
h-index:
机构:

Drozdov, Ignat
论文数: 0 引用数: 0
h-index: 0
机构:
Bering Ltd, Richmond, Surrey, England Univ Trondheim Hosp, St Olavs Hosp, Dept Gastroenterol & Hepatol, Trondheim, Norway

Emgard, Johanna
论文数: 0 引用数: 0
h-index: 0
机构:
Norwegian Univ Sci & Technol, Dept Canc Res & Mol Med, N-7034 Trondheim, Norway Univ Trondheim Hosp, St Olavs Hosp, Dept Gastroenterol & Hepatol, Trondheim, Norway

Bronstad, Gunnar
论文数: 0 引用数: 0
h-index: 0
机构:
Neurozym Biotech AS, Snasa, Norway Univ Trondheim Hosp, St Olavs Hosp, Dept Gastroenterol & Hepatol, Trondheim, Norway

Kidd, Mark
论文数: 0 引用数: 0
h-index: 0
机构:
Norwegian Univ Sci & Technol, Dept Canc Res & Mol Med, N-7034 Trondheim, Norway
Yale Univ, Sch Med, Dept Surg, Gastroenterol Sect, New Haven, CT 06510 USA Univ Trondheim Hosp, St Olavs Hosp, Dept Gastroenterol & Hepatol, Trondheim, Norway

Sandvik, Arne K.
论文数: 0 引用数: 0
h-index: 0
机构:
Univ Trondheim Hosp, St Olavs Hosp, Dept Gastroenterol & Hepatol, Trondheim, Norway
Norwegian Univ Sci & Technol, Dept Canc Res & Mol Med, N-7034 Trondheim, Norway
Norwegian Univ Sci & Technol, Ctr Mol Inflammat Res, N-7034 Trondheim, Norway Univ Trondheim Hosp, St Olavs Hosp, Dept Gastroenterol & Hepatol, Trondheim, Norway

Gustafsson, Bjorn I.
论文数: 0 引用数: 0
h-index: 0
机构:
Univ Trondheim Hosp, St Olavs Hosp, Dept Gastroenterol & Hepatol, Trondheim, Norway
Norwegian Univ Sci & Technol, Dept Canc Res & Mol Med, N-7034 Trondheim, Norway Univ Trondheim Hosp, St Olavs Hosp, Dept Gastroenterol & Hepatol, Trondheim, Norway
[2]
A survey on feature selection methods
[J].
Chandrashekar, Girish
;
Sahin, Ferat
.
COMPUTERS & ELECTRICAL ENGINEERING,
2014, 40 (01)
:16-28

Chandrashekar, Girish
论文数: 0 引用数: 0
h-index: 0
机构:
Rochester Inst Technol, Rochester, NY 14623 USA Rochester Inst Technol, Rochester, NY 14623 USA

论文数: 引用数:
h-index:
机构:
[3]
Gene selection for cancer identification: a decision tree model empowered by particle swarm optimization algorithm
[J].
Chen, Kun-Huang
;
Wang, Kung-Jeng
;
Tsai, Min-Lung
;
Wang, Kung-Min
;
Adrian, Angelia Melani
;
Cheng, Wei-Chung
;
Yang, Tzu-Sen
;
Teng, Nai-Chia
;
Tan, Kuo-Pin
;
Chang, Ku-Shang
.
BMC BIOINFORMATICS,
2014, 15

Chen, Kun-Huang
论文数: 0 引用数: 0
h-index: 0
机构:
Natl Taiwan Univ Sci & Technol, Dept Ind Management, Taipei 106, Taiwan Natl Taiwan Univ Sci & Technol, Dept Ind Management, Taipei 106, Taiwan

Wang, Kung-Jeng
论文数: 0 引用数: 0
h-index: 0
机构:
Natl Taiwan Univ Sci & Technol, Dept Ind Management, Taipei 106, Taiwan Natl Taiwan Univ Sci & Technol, Dept Ind Management, Taipei 106, Taiwan

Tsai, Min-Lung
论文数: 0 引用数: 0
h-index: 0
机构:
Yuanpei Univ, Dept Food Sci, Hsinchu 300, Taiwan Natl Taiwan Univ Sci & Technol, Dept Ind Management, Taipei 106, Taiwan

Wang, Kung-Min
论文数: 0 引用数: 0
h-index: 0
机构:
Shin Kong Wu Ho Mem Hosp, Dept Surg, Taipei, Taiwan Natl Taiwan Univ Sci & Technol, Dept Ind Management, Taipei 106, Taiwan

Adrian, Angelia Melani
论文数: 0 引用数: 0
h-index: 0
机构:
Natl Taiwan Univ Sci & Technol, Dept Ind Management, Taipei 106, Taiwan Natl Taiwan Univ Sci & Technol, Dept Ind Management, Taipei 106, Taiwan

Cheng, Wei-Chung
论文数: 0 引用数: 0
h-index: 0
机构:
Cheng Hsin Gen Hosp, Dept Surg, Taipei 11220, Taiwan
Natl Yang Ming Univ, Genom Res Ctr, Taipei 11221, Taiwan Natl Taiwan Univ Sci & Technol, Dept Ind Management, Taipei 106, Taiwan

Yang, Tzu-Sen
论文数: 0 引用数: 0
h-index: 0
机构:
Taipei Med Univ, Sch Dent Technol, Taipei 110, Taiwan
Taipei Med Univ, Taiwan Res Ctr Biomed Implants & Microsurg Dev, Taipei 110, Taiwan Natl Taiwan Univ Sci & Technol, Dept Ind Management, Taipei 106, Taiwan

Teng, Nai-Chia
论文数: 0 引用数: 0
h-index: 0
机构:
Taipei Med Univ, Coll Oral Med, Sch Dent, Taipei, Taiwan Natl Taiwan Univ Sci & Technol, Dept Ind Management, Taipei 106, Taiwan

Tan, Kuo-Pin
论文数: 0 引用数: 0
h-index: 0
机构:
Natl Taiwan Univ Sci & Technol, Sch Management, MBA, Taipei 106, Taiwan Natl Taiwan Univ Sci & Technol, Dept Ind Management, Taipei 106, Taiwan

Chang, Ku-Shang
论文数: 0 引用数: 0
h-index: 0
机构:
Yuanpei Univ, Dept Food Sci, Hsinchu 300, Taiwan Natl Taiwan Univ Sci & Technol, Dept Ind Management, Taipei 106, Taiwan
[4]
The mitochondrial ADP/ATP carrier (SLC25 family): Pathological implications of its dysfunction
[J].
Clemencon, Benjamin
;
Babot, Marion
;
Trezeguet, Veronique
.
MOLECULAR ASPECTS OF MEDICINE,
2013, 34 (2-3)
:485-493

Clemencon, Benjamin
论文数: 0 引用数: 0
h-index: 0
机构:
Univ Bern, NCCR TransCure, Inst Biochem & Mol Med IBMM, CH-3012 Bern, Switzerland Univ Bern, NCCR TransCure, Inst Biochem & Mol Med IBMM, CH-3012 Bern, Switzerland

Babot, Marion
论文数: 0 引用数: 0
h-index: 0
机构:
CNRS, IBGC, UMR 5095, Lab Physiol Mol & Cellulaire, F-33077 Bordeaux, France Univ Bern, NCCR TransCure, Inst Biochem & Mol Med IBMM, CH-3012 Bern, Switzerland

Trezeguet, Veronique
论文数: 0 引用数: 0
h-index: 0
机构:
Univ Bordeaux, CBMN, CNRS, UMR 5248, F-33600 Pessac, France Univ Bern, NCCR TransCure, Inst Biochem & Mol Med IBMM, CH-3012 Bern, Switzerland
[5]
SUPPORT-VECTOR NETWORKS
[J].
CORTES, C
;
VAPNIK, V
.
MACHINE LEARNING,
1995, 20 (03)
:273-297

CORTES, C
论文数: 0 引用数: 0
h-index: 0

VAPNIK, V
论文数: 0 引用数: 0
h-index: 0
[6]
Analysis of genomic variation in lung adenocarcinoma patients revealed the critical role of PI3K complex
[J].
Deng, Zhao min
;
Liu, Lin
;
Qiu, Wen hai
;
Zhang, Yong qun
;
Zhong, Hong yan
;
Liao, Ping
;
Wu, Yun hong
.
PEERJ,
2017, 5

Deng, Zhao min
论文数: 0 引用数: 0
h-index: 0
机构:
Peoples Govt Tibetan Autonomous Reg, Hosp Chengdu Off, Chengdu, Peoples R China Peoples Govt Tibetan Autonomous Reg, Hosp Chengdu Off, Chengdu, Peoples R China

Liu, Lin
论文数: 0 引用数: 0
h-index: 0
机构:
Peoples Govt Tibetan Autonomous Reg, Hosp Chengdu Off, Chengdu, Peoples R China Peoples Govt Tibetan Autonomous Reg, Hosp Chengdu Off, Chengdu, Peoples R China

Qiu, Wen hai
论文数: 0 引用数: 0
h-index: 0
机构:
West China Second Univ Hosp, Chengdu, Peoples R China Peoples Govt Tibetan Autonomous Reg, Hosp Chengdu Off, Chengdu, Peoples R China

Zhang, Yong qun
论文数: 0 引用数: 0
h-index: 0
机构:
Peoples Govt Tibetan Autonomous Reg, Hosp Chengdu Off, Chengdu, Peoples R China Peoples Govt Tibetan Autonomous Reg, Hosp Chengdu Off, Chengdu, Peoples R China

Zhong, Hong yan
论文数: 0 引用数: 0
h-index: 0
机构:
Peoples Govt Tibetan Autonomous Reg, Hosp Chengdu Off, Chengdu, Peoples R China Peoples Govt Tibetan Autonomous Reg, Hosp Chengdu Off, Chengdu, Peoples R China

Liao, Ping
论文数: 0 引用数: 0
h-index: 0
机构:
Peoples Govt Tibetan Autonomous Reg, Hosp Chengdu Off, Chengdu, Peoples R China Peoples Govt Tibetan Autonomous Reg, Hosp Chengdu Off, Chengdu, Peoples R China

Wu, Yun hong
论文数: 0 引用数: 0
h-index: 0
机构:
Peoples Govt Tibetan Autonomous Reg, Hosp Chengdu Off, Chengdu, Peoples R China Peoples Govt Tibetan Autonomous Reg, Hosp Chengdu Off, Chengdu, Peoples R China
[7]
A two-stage gene selection scheme utilizing MRMR filter and GA wrapper
[J].
El Akadi, Ali
;
Amine, Aouatif
;
El Ouardighi, Abdeljalil
;
Aboutajdine, Driss
.
KNOWLEDGE AND INFORMATION SYSTEMS,
2011, 26 (03)
:487-500

El Akadi, Ali
论文数: 0 引用数: 0
h-index: 0
机构:
Mohammed V Univ, Fac Sci, LRIT CNRS, LRIT Lab, Rabat, Morocco Mohammed V Univ, Fac Sci, LRIT CNRS, LRIT Lab, Rabat, Morocco

Amine, Aouatif
论文数: 0 引用数: 0
h-index: 0
机构: Mohammed V Univ, Fac Sci, LRIT CNRS, LRIT Lab, Rabat, Morocco

El Ouardighi, Abdeljalil
论文数: 0 引用数: 0
h-index: 0
机构:
Univ Hassan I, Fac Econ Sci Settat, LM2CE, Settat, Morocco Mohammed V Univ, Fac Sci, LRIT CNRS, LRIT Lab, Rabat, Morocco

Aboutajdine, Driss
论文数: 0 引用数: 0
h-index: 0
机构: Mohammed V Univ, Fac Sci, LRIT CNRS, LRIT Lab, Rabat, Morocco
[8]
Development of a two-stage gene selection method that incorporates a novel hybrid approach using the cuckcio optimization algorithm and harmony search for cancer classification
[J].
Elyasigomari, V.
;
Lee, D. A.
;
Screen, H. R. C.
;
Shaheed, M. H.
.
JOURNAL OF BIOMEDICAL INFORMATICS,
2017, 67
:11-20

Elyasigomari, V.
论文数: 0 引用数: 0
h-index: 0
机构:
Queen Mary Univ London, Sch Engn & Mat Sci, London E1 4NS, England Queen Mary Univ London, Sch Engn & Mat Sci, London E1 4NS, England

Lee, D. A.
论文数: 0 引用数: 0
h-index: 0
机构:
Queen Mary Univ London, Sch Engn & Mat Sci, London E1 4NS, England Queen Mary Univ London, Sch Engn & Mat Sci, London E1 4NS, England

Screen, H. R. C.
论文数: 0 引用数: 0
h-index: 0
机构:
Queen Mary Univ London, Sch Engn & Mat Sci, London E1 4NS, England Queen Mary Univ London, Sch Engn & Mat Sci, London E1 4NS, England

Shaheed, M. H.
论文数: 0 引用数: 0
h-index: 0
机构:
Queen Mary Univ London, Sch Engn & Mat Sci, London E1 4NS, England Queen Mary Univ London, Sch Engn & Mat Sci, London E1 4NS, England
[9]
Support vector machine classification and validation of cancer tissue samples using microarray expression data
[J].
Furey, TS
;
Cristianini, N
;
Duffy, N
;
Bednarski, DW
;
Schummer, M
;
Haussler, D
.
BIOINFORMATICS,
2000, 16 (10)
:906-914

Furey, TS
论文数: 0 引用数: 0
h-index: 0
机构:
Univ Calif Santa Cruz, Dept Comp Sci, Santa Cruz, CA 95064 USA Univ Calif Santa Cruz, Dept Comp Sci, Santa Cruz, CA 95064 USA

Cristianini, N
论文数: 0 引用数: 0
h-index: 0
机构: Univ Calif Santa Cruz, Dept Comp Sci, Santa Cruz, CA 95064 USA

Duffy, N
论文数: 0 引用数: 0
h-index: 0
机构: Univ Calif Santa Cruz, Dept Comp Sci, Santa Cruz, CA 95064 USA

Bednarski, DW
论文数: 0 引用数: 0
h-index: 0
机构: Univ Calif Santa Cruz, Dept Comp Sci, Santa Cruz, CA 95064 USA

Schummer, M
论文数: 0 引用数: 0
h-index: 0
机构: Univ Calif Santa Cruz, Dept Comp Sci, Santa Cruz, CA 95064 USA

Haussler, D
论文数: 0 引用数: 0
h-index: 0
机构: Univ Calif Santa Cruz, Dept Comp Sci, Santa Cruz, CA 95064 USA
[10]
Molecular classification of cancer: Class discovery and class prediction by gene expression monitoring
[J].
Golub, TR
;
Slonim, DK
;
Tamayo, P
;
Huard, C
;
Gaasenbeek, M
;
Mesirov, JP
;
Coller, H
;
Loh, ML
;
Downing, JR
;
Caligiuri, MA
;
Bloomfield, CD
;
Lander, ES
.
SCIENCE,
1999, 286 (5439)
:531-537

Golub, TR
论文数: 0 引用数: 0
h-index: 0
机构: MIT, Whitehead Inst, Ctr Genome Res, Cambridge, MA 02139 USA

Slonim, DK
论文数: 0 引用数: 0
h-index: 0
机构: MIT, Whitehead Inst, Ctr Genome Res, Cambridge, MA 02139 USA

Tamayo, P
论文数: 0 引用数: 0
h-index: 0
机构: MIT, Whitehead Inst, Ctr Genome Res, Cambridge, MA 02139 USA

Huard, C
论文数: 0 引用数: 0
h-index: 0
机构: MIT, Whitehead Inst, Ctr Genome Res, Cambridge, MA 02139 USA

Gaasenbeek, M
论文数: 0 引用数: 0
h-index: 0
机构: MIT, Whitehead Inst, Ctr Genome Res, Cambridge, MA 02139 USA

Mesirov, JP
论文数: 0 引用数: 0
h-index: 0
机构: MIT, Whitehead Inst, Ctr Genome Res, Cambridge, MA 02139 USA

Coller, H
论文数: 0 引用数: 0
h-index: 0
机构: MIT, Whitehead Inst, Ctr Genome Res, Cambridge, MA 02139 USA

Loh, ML
论文数: 0 引用数: 0
h-index: 0
机构: MIT, Whitehead Inst, Ctr Genome Res, Cambridge, MA 02139 USA

Downing, JR
论文数: 0 引用数: 0
h-index: 0
机构: MIT, Whitehead Inst, Ctr Genome Res, Cambridge, MA 02139 USA

Caligiuri, MA
论文数: 0 引用数: 0
h-index: 0
机构: MIT, Whitehead Inst, Ctr Genome Res, Cambridge, MA 02139 USA

Bloomfield, CD
论文数: 0 引用数: 0
h-index: 0
机构: MIT, Whitehead Inst, Ctr Genome Res, Cambridge, MA 02139 USA

Lander, ES
论文数: 0 引用数: 0
h-index: 0
机构: MIT, Whitehead Inst, Ctr Genome Res, Cambridge, MA 02139 USA