REDISCOVERING 50 YEARS OF DISCOVERIES IN SPEECH AND LANGUAGE PROCESSING: A SURVEY.

被引:0
|
作者
Mariani, Joseph [1 ]
Francopoulo, Gil [2 ]
Paroubek, Patrick [1 ]
Vernier, Frederic [1 ]
机构
[1] CNRS, LIMSI, Paris, France
[2] Tagmatica, Paris, France
来源
2017 20TH CONFERENCE OF THE ORIENTAL CHAPTER OF THE INTERNATIONAL COORDINATING COMMITTEE ON SPEECH DATABASES AND SPEECH I/O SYSTEMS AND ASSESSMENT (O-COCOSDA) | 2017年
关键词
Speech Processing; Natural Language Processing; Text Analytics; Bibliometrics; Scientometrics; Informetrics;
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
We have created the NLP4NLP corpus to study the content of scientific publications in the field of speech and natural language processing. It contains articles published in 34 major conferences and journals in that field over a period of 50 years (1965-2015). comprising 65.000 documents. gathering 50.000 authors. including 325.000 references and representing approximately 270 million words. Most of these publications are in English. some are in French. German or Russian. Some are open access. others have been provided by the publishers. In order to constitute and analyze this corpus several tools have been used or developed. Some of them use Natural Language Processing methods that have been published in the corpus. hence its name. Numerous manual corrections were necessary. which demonstrated the importance of establishing standards for uniquely identifying authors. publications or resources. We have conducted various studies: evolution over time of the number of articles and authors. collaborations between authors. citations between papers and authors. evolution of research themes and identification of the authors who introduced them. measure of innovation and detection of epistemological ruptures. use of language resources. reuse of articles and plagiarism in the context of a global or comparative analysis between sources.
引用
收藏
页数:23
相关论文
共 50 条
  • [1] Rediscovering 25 Years of Discoveries in Spoken Language Processing: A preliminary ISCA Archive Analysis.
    Mariani, J.
    Paroubek, P.
    Francopoulo, G.
    Delaborde, M.
    14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 3370 - 3370
  • [2] Rediscovering 15 + 2 years of discoveries in language resources and evaluation
    Joseph Mariani
    Patrick Paroubek
    Gil Francopoulo
    Olivier Hamon
    Language Resources and Evaluation, 2016, 50 : 165 - 220
  • [3] Rediscovering 15+2 years of discoveries in language resources and evaluation
    Mariani, Joseph
    Paroubek, Patrick
    Francopoulo, Gil
    Hamon, Olivier
    LANGUAGE RESOURCES AND EVALUATION, 2016, 50 (02) : 165 - 220
  • [4] Rediscovering 15 Years of Discoveries in Language Resources and Evaluation: The LREC Anthology Analysis.
    Mariani, Joseph
    Paroubek, Patrick
    Francopoulo, Gil
    Hamon, Olivier
    LREC 2014 - NINTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2014, : 4632 - 4669
  • [5] Measuring Innovation in Speech and Language Processing Publications
    Mariani, Joseph
    Francopoulo, Gil
    Paroubek, Patrick
    PROCEEDINGS OF THE ELEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2018), 2018, : 1890 - 1895
  • [6] Twenty-Five Years of Evolution in Speech and Language Processing
    Yu, Dong
    Gong, Yifan
    Picheny, Michael Alan
    Ramabhadran, Bhuvana
    Hakkani-Tur, Dilek
    Prasad, Rohit
    Zen, Heiga
    Skoglund, Jan
    Cernocky, Jan Honza
    Burget, Lukas
    Mohamed, Abdelrahman
    IEEE SIGNAL PROCESSING MAGAZINE, 2023, 40 (05) : 27 - 39
  • [7] Reuse and plagiarism in Speech and Natural Language Processing publications
    Mariani, Joseph
    Francopoulo, Gil
    Paroubek, Patrick
    INTERNATIONAL JOURNAL ON DIGITAL LIBRARIES, 2018, 19 (2-3) : 113 - 126
  • [8] A Critical Survey on the use of Fuzzy Sets in Speech and Natural Language Processing
    Carvalho, Joao P.
    Batista, Fernando
    Coheur, Luisa
    2012 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS (FUZZ-IEEE), 2012,
  • [9] Optimization Algorithms and Applications for Speech and Language Processing
    Wright, Stephen J.
    Kanevsky, Dimitri
    Deng, Li
    He, Xiaodong
    Heigold, Georg
    Li, Haizhou
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2013, 21 (11): : 2231 - 2243
  • [10] Speech and Language processing as assistive technologies
    McCoy, Kathleen F.
    Arnott, John L.
    Ferres, Leo
    Fried-Oken, Melanie
    Roark, Brian
    COMPUTER SPEECH AND LANGUAGE, 2013, 27 (06) : 1143 - 1146