Small sample learning algorithm based on novel hybrid class-labeling technique

被引:0
|
作者
Li M.-D. [1 ]
Shen Y. [1 ]
Zhang D.-P. [1 ]
Yin H.-B. [1 ]
机构
[1] Department of Signal and Information Processing, China Jiliang University, Hangzhou
来源
Shen, Ye (shenye1978@vip.sina.com) | 1600年 / Zhejiang University卷 / 50期
关键词
Computer-aided diagnosis (CAD); Hybrid class-labeling; Membership; Small sample learning;
D O I
10.3785/j.issn.1008-973X.2016.01.020
中图分类号
学科分类号
摘要
A small sample learning algorithm based on a novel hybrid class-labeling technique (HCLT) was proposed in order to address the learning problem resulting from the underrepresented labeled training set in computer-aided diagnosis(CAD). The abundant unlabeled samples were labeled by HCLT with three diverse class labeling schemes respectively from the view point of geometric similarity, probabilistic distribution and semantic concept. Only those unlabeled samples which get the unanimous labeling results from three different labeling schemes were added to the training set in order to enlarge the labeled training set. The memberships of pseudo-labeled samples were introduced to fuzzy support vector machine (FSVM) in order to reduce the adverse effects for learning performance resulting from the still existing labeling mistakes. The contributions of pseudo-labeled samples to learning task were determined by their memberships. Classification experiment results based on datasets in UCI show that the proposed algorithm can deal with the small sample learning problem. The algorithm has less mistakes and better classification performance compared with the other algorithms which adopt the single labeling scheme. © 2016, Zhejiang University. All right reserved.
引用
收藏
页码:137 / 143
页数:6
相关论文
共 20 条
  • [1] Shen Y., Li M.-D., Xia S.-R., Learning algorithm with non-balanced data for computer-aided diagnosis of breast cancer, Journal of Zhejiang University: Engineering Science, 47, 1, pp. 1-7, (2013)
  • [2] Gorgel P., Sertbas A., Ucan O.N., Computer-aided classification of breast masses in mammogram images based on spherical wavelet transform and support vector machines, Expert Systems, 32, 1, pp. 155-164, (2015)
  • [3] Dheeba J., Selvi S.T., Classification of malignant and benign microcalcification using SVM, Proceedings of ICETECT, pp. 686-690, (2011)
  • [4] Jeyakumar V., Kanagaraj B.R., A framework for medical image retrieval system using ant colony optimization and weighted relevance feedback, Journal of Medical Imaging and Health Informatics, 5, 7, pp. 1383-1389, (2015)
  • [5] Shen Y., Xia S.-R., Li M.-D., A survey on relevance feedback techniques in content-based medical image retrieval, Chinese Journal of Biomedical Engineering, 28, 1, pp. 128-136, (2009)
  • [6] Wu K., Yap K.H., Fuzzy SVM for content-based image retrieval: a pseudo-label support vector machine framework, IEEE Computational Intelligence Magazine, 1, 2, pp. 10-16, (2006)
  • [7] Zhou D., Bousquet O., Lal T.N., Et al., Learning with local and global consistency, Proceedings of NIPS, pp. 321-328, (2003)
  • [8] Wang F., Zhang C.-S., Label propagation through linear neighborhoods, IEEE Transactions on Knowledge and Data Engineering, 20, 1, pp. 55-66, (2008)
  • [9] Tu E., Yang J., Kasabov N., Et al., Posterior distribution learning (PDL): a novel supervised learning framework using unlabeled samples to improve classification performance, Neurocomputing, 157, pp. 173-186, (2015)
  • [10] Zhou Z.-H., Li M., Tri-training: exploiting unlabeled data using three classifier, IEEE Transactions on Knowledge and Data Engineering, 117, 11, pp. 1529-1541, (2005)