Text mining in different languages

被引:0
|
作者
Lebart, L [1 ]
机构
[1] Ecole Natl Super Telecommun, CNRS, F-75013 Paris, France
来源
APPLIED STOCHASTIC MODELS AND DATA ANALYSIS | 1998年 / 14卷 / 04期
关键词
Text Mining; text categorization; language independent methods; discriminant analysis;
D O I
暂无
中图分类号
C93 [管理学]; O22 [运筹学];
学科分类号
070105 ; 12 ; 1201 ; 1202 ; 120202 ;
摘要
The purpose of Text Mining is to describe and explore textual data, to uncover structural traits, and proceed to predictions. The field of application concerns Information Retrieval, processing responses to open-ended questions in sample surveys as well as processing textual corpora of a more general nature. At the intersection of Corpora Linguistics and Exploratory Statistical Analysis, a series of language independent tools and methods can perform most of the previously mentioned tasks, including the assessment and validation of the obtained results, be it visualization or categorization. Multiple confusion matrices calculated on test-samples characterize the quality of the prediction as well as the structure of errors of prediction. In the case of multinational surveys and corpora, they allow us to proceed to comparisons among several countries, in spite of the very heterogeneous character of the basic information (texts in different languages). Copyright (C) 1998 John Wiley & Sons, Ltd.
引用
收藏
页码:323 / 334
页数:12
相关论文
共 50 条
  • [41] Computational Topology in Text Mining
    Wagner, Hubert
    Dlotko, Pawel
    Mrozek, Marian
    COMPUTATIONAL TOPOLOGY IN IMAGE CONTEXT (CTIC), 2012, 7309 : 68 - 78
  • [42] Text mining in a digital library
    Witten I.H.
    Don K.J.
    Dewsnip M.
    Tablan V.
    International Journal on Digital Libraries, 2004, 4 (1) : 56 - 59
  • [43] Text Mining of Highly Cited Publications in Data Mining
    Jayasekara, P. K.
    Abu, K. S.
    IEEE 5TH INTERNATIONAL SYMPOSIUM ON EMERGING TRENDS AND TECHNOLOGIES IN LIBRARIES AND INFORMATION SERVICES (ETTLIS 2018), 2018, : 128 - 130
  • [44] Legal aspects of text mining
    Truyens, Maarten
    Van Eecke, Patrick
    COMPUTER LAW & SECURITY REVIEW, 2014, 30 (02) : 153 - 170
  • [45] Extending Web Mining to Digital Forensics Text Mining
    Hicks, Chelsea
    Beebe, Nicole Lang
    Haliscak, Brandi
    AMCIS 2016 PROCEEDINGS, 2016,
  • [46] Text mining of bilingual parallel corpora with a measure of semantic similarity
    Lee, CH
    Yang, HC
    2001 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS, VOLS 1-5: E-SYSTEMS AND E-MAN FOR CYBERNETICS IN CYBERSPACE, 2002, : 470 - 475
  • [47] Text Preprocessing for Text Mining in Organizational Research: Review and Recommendations
    Hickman, Louis
    Thapa, Stuti
    Tay, Louis
    Cao, Mengyang
    Srinivasan, Padmini
    ORGANIZATIONAL RESEARCH METHODS, 2022, 25 (01) : 114 - 146
  • [48] Technical evolution and prediction of blockchain based on different evolution patterns by text mining and bibliometric methods
    Zhang, Huiying
    Zhao, Runbo
    Yang, Zuguo
    INTERNATIONAL JOURNAL OF TECHNOLOGY MANAGEMENT, 2023, 93 (3-4) : 345 - 374
  • [49] Application of Text Mining in Detecting Evidence of Fraud in Text Documents
    Silva, Elcelina
    2017 12TH IBERIAN CONFERENCE ON INFORMATION SYSTEMS AND TECHNOLOGIES (CISTI), 2017,
  • [50] One reservoir and different views on local development: text mining analyses for the Mucharz Reservoir, Poland
    Pirog, Danuta
    Wiejaczka, Lukasz
    Fidelus-Orzechowska, Joanna
    MISCELLANEA GEOGRAPHICA, 2024,