Concept drift and the importance of examples

被引:0
作者
Klinkenberg, R [1 ]
Rüping, S [1 ]
机构
[1] Univ Dortmund, Dept Comp Sci, Artificial Intelligence Unit LS VIII, D-44221 Dortmund, Germany
来源
TEXT MINING: THEORETICAL ASPECTS AND APPLICATIONS | 2003年
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
For many learning tasks where data is collected over an extended period of time, its underlying distribution is likely to change. A typical example is information filtering, i.e. the adaptive classification of documents with respect to a particular user interest. Both the interest of the user and the document content change over time. A filtering system should be able to adapt to such concept changes. Examples may be important for different reasons. In case of a drifting concept, the importance of an example obviously depends on its age. If a user is interested in several topics, these may be of different importance to her/him. Hence the importance of an example is influenced by the topic it belongs to. Of course these two effects may cumulate. In this paper we model the importance of an example by weighting its importance for the final decision function. This paper investigates how to handle these two effects with support vector machines extending the approach of [12], which showed that drifting concepts can be learned effectively and efficiently with little parameterization. Several approaches addressing the different effects are compared in experiments on real-world text data.
引用
收藏
页码:55 / 77
页数:23
相关论文
共 29 条
[1]  
ALLAN J, 1996, P 19 ANN INT ACM SIG, P270
[2]  
[Anonymous], MYSVM MANUAL
[3]  
[Anonymous], 1999, P 16 INT C MACH LEAR
[4]  
[Anonymous], 2005, EUR C MACH LEARN
[5]  
Balabanovic M., 1997, Proceedings of the First International Conference on Autonomous Agents, P378, DOI 10.1145/267658.267744
[6]  
Blum A., 1998, Proceedings of the Eleventh Annual Conference on Computational Learning Theory, P92, DOI 10.1145/279943.279962
[7]  
Fischer S., 2002, CI13602 U DORTM COLL
[8]  
HELMBOLD DP, 1994, MACH LEARN, V14, P27, DOI 10.1007/BF00993161
[9]  
JAOCHIMS T, 2001, THESIS U DORTMUND GE
[10]  
JOACHIMS T, 2000, P 17 INT C MACH LEAR, P431