Text mining in different languages

被引:0
|
作者
Lebart, L [1 ]
机构
[1] Ecole Natl Super Telecommun, CNRS, F-75013 Paris, France
来源
APPLIED STOCHASTIC MODELS AND DATA ANALYSIS | 1998年 / 14卷 / 04期
关键词
Text Mining; text categorization; language independent methods; discriminant analysis;
D O I
暂无
中图分类号
C93 [管理学]; O22 [运筹学];
学科分类号
070105 ; 12 ; 1201 ; 1202 ; 120202 ;
摘要
The purpose of Text Mining is to describe and explore textual data, to uncover structural traits, and proceed to predictions. The field of application concerns Information Retrieval, processing responses to open-ended questions in sample surveys as well as processing textual corpora of a more general nature. At the intersection of Corpora Linguistics and Exploratory Statistical Analysis, a series of language independent tools and methods can perform most of the previously mentioned tasks, including the assessment and validation of the obtained results, be it visualization or categorization. Multiple confusion matrices calculated on test-samples characterize the quality of the prediction as well as the structure of errors of prediction. In the case of multinational surveys and corpora, they allow us to proceed to comparisons among several countries, in spite of the very heterogeneous character of the basic information (texts in different languages). Copyright (C) 1998 John Wiley & Sons, Ltd.
引用
收藏
页码:323 / 334
页数:12
相关论文
共 50 条
  • [31] Text Mining and Its Applications
    Guo, Shengyu
    Cao, Buyang
    PROCEEDINGS OF THE 2015 INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND INTELLIGENT COMMUNICATION, 2015, 16 : 72 - 78
  • [32] Text Mining in Computational Advertising
    Soriano, Jacopo
    Au, Timothy
    Banks, David
    STATISTICAL ANALYSIS AND DATA MINING, 2013, 6 (04) : 273 - 285
  • [33] The application of text mining in accounting
    Senave, Elseline
    Jans, Mieke J.
    Srivastava, Rajendra P.
    INTERNATIONAL JOURNAL OF ACCOUNTING INFORMATION SYSTEMS, 2023, 50
  • [34] Text Mining and Performance Analysis
    Kolbinger, Otto
    9TH INTERNATIONAL PERFORMANCE ANALYSIS WORKSHOP AND CONFERENCE & 5TH IACSS CONFERENCE, 2022, : 3 - 8
  • [35] Agile Text Mining with Sherlok
    Richardet, Renaud
    Chappelier, Jean-Cedric
    Tripathy, Shreejoy
    Hill, Sean
    PROCEEDINGS 2015 IEEE INTERNATIONAL CONFERENCE ON BIG DATA, 2015, : 1479 - 1484
  • [36] A genetic algorithm for text mining
    Desjardins, G
    Godin, R
    Proulx, R
    Data Mining VI: Data Mining, Text Mining and Their Business Applications, 2005, : 133 - 142
  • [37] Text mining by Tsallis entropy
    Jamaati, Maryam
    Mehri, Ali
    PHYSICA A-STATISTICAL MECHANICS AND ITS APPLICATIONS, 2018, 490 : 1368 - 1376
  • [38] Text Mining Software and Their Applications
    You Xiaguang
    2014 FOURTH INTERNATIONAL CONFERENCE ON INSTRUMENTATION AND MEASUREMENT, COMPUTER, COMMUNICATION AND CONTROL (IMCCC), 2014, : 902 - 905
  • [39] Text mining infrastructure in R
    Feinerer, Ingo
    Hornik, Kurt
    Meyer, David
    JOURNAL OF STATISTICAL SOFTWARE, 2008, 25 (05): : 1 - 54
  • [40] Using text mining and link analysis for software mining
    Grcar, Miha
    Grobehlik, Marko
    Mladenic, Dunja
    MINING COMPLEX DATA, 2008, 4944 : 1 - 12