Automatic discovery of similarity relationships through Web mining

被引:42
作者
Roussinov, D
Zhao, JL
机构
[1] Arizona State Univ, Coll Business, SAIM, Sch Accountancy & Informat Management, Tempe, AZ 85287 USA
[2] Univ Arizona, Sch Business & Publ Adm, Dept MIS, Tucson, AZ 85721 USA
关键词
data mining; context sensitive similarity discovery; empirical study; group decision support systems; Internet; machine learning; organizational concept space; text clustering; Web mining;
D O I
10.1016/S0167-9236(02)00102-1
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This work demonstrates how the World Wide Web can be mined in a fully automated manner for discovering the semantic similarity relationships among the concepts surfaced during an electronic brainstorming session, and thus improving the accuracy of automated clustering meeting messages. Our novel Context Sensitive Similarity Discovery (CSSD) method takes advantage of the meeting context when selecting a subset of Web pages for data mining, and then conducts regular concept co-occurrence analysis within that subset. Our results have implications on reducing information overload in applications of text technologies such as email filtering, document retrieval, text summarization, and knowledge management. (C) 2002 Elsevier Science B.V. All rights reserved.
引用
收藏
页码:149 / 166
页数:18
相关论文
共 32 条
[1]  
[Anonymous], 1979, Monte Carlo Methods, DOI DOI 10.1007/978-94-009-5819-7
[2]   AUTOMATIC CONCEPT CLASSIFICATION OF TEXT FROM ELECTRONIC MEETINGS [J].
CHEN, H ;
HSU, P ;
ORWIG, R ;
HOOPES, L ;
NUNAMAKER, JF .
COMMUNICATIONS OF THE ACM, 1994, 37 (10) :56-73
[3]   AUTOMATIC CONSTRUCTION OF NETWORKS OF CONCEPTS CHARACTERIZING DOCUMENT DATABASES [J].
CHEN, HC ;
LYNCH, KJ .
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS, 1992, 22 (05) :885-902
[4]   Internet categorization and search: A self-organizing approach [J].
Chen, HC ;
Schuffels, C ;
Orwig, R .
JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 1996, 7 (01) :88-102
[5]  
Chen HC, 1998, J AM SOC INFORM SCI, V49, P206, DOI 10.1002/(SICI)1097-4571(199803)49:3<206::AID-ASI3>3.0.CO
[6]  
2-K
[7]   A parallel computing approach to creating engineering concept spaces for semantic retrieval: The Illinois Digital Library Initiative project [J].
Chen, HC ;
Schatz, B ;
Ng, T ;
Martinez, J ;
Kirchhoff, A ;
Lin, CT .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1996, 18 (08) :771-782
[8]   EFFECTS OF ANONYMITY AND EVALUATIVE TONE ON IDEA GENERATION IN COMPUTER-MEDIATED GROUPS [J].
CONNOLLY, T ;
JESSUP, LM ;
VALACICH, JS .
MANAGEMENT SCIENCE, 1990, 36 (06) :689-703
[9]  
Cooley R., 1997, P 9 IEEE INT C TOOLS
[10]   AN APPROACH TO THE AUTOMATIC CONSTRUCTION OF GLOBAL THESAURI [J].
CROUCH, CJ .
INFORMATION PROCESSING & MANAGEMENT, 1990, 26 (05) :629-640