REDISCOVERING 50 YEARS OF DISCOVERIES IN SPEECH AND LANGUAGE PROCESSING: A SURVEY.

被引:0
|
作者
Mariani, Joseph [1 ]
Francopoulo, Gil [2 ]
Paroubek, Patrick [1 ]
Vernier, Frederic [1 ]
机构
[1] CNRS, LIMSI, Paris, France
[2] Tagmatica, Paris, France
来源
2017 20TH CONFERENCE OF THE ORIENTAL CHAPTER OF THE INTERNATIONAL COORDINATING COMMITTEE ON SPEECH DATABASES AND SPEECH I/O SYSTEMS AND ASSESSMENT (O-COCOSDA) | 2017年
关键词
Speech Processing; Natural Language Processing; Text Analytics; Bibliometrics; Scientometrics; Informetrics;
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
We have created the NLP4NLP corpus to study the content of scientific publications in the field of speech and natural language processing. It contains articles published in 34 major conferences and journals in that field over a period of 50 years (1965-2015). comprising 65.000 documents. gathering 50.000 authors. including 325.000 references and representing approximately 270 million words. Most of these publications are in English. some are in French. German or Russian. Some are open access. others have been provided by the publishers. In order to constitute and analyze this corpus several tools have been used or developed. Some of them use Natural Language Processing methods that have been published in the corpus. hence its name. Numerous manual corrections were necessary. which demonstrated the importance of establishing standards for uniquely identifying authors. publications or resources. We have conducted various studies: evolution over time of the number of articles and authors. collaborations between authors. citations between papers and authors. evolution of research themes and identification of the authors who introduced them. measure of innovation and detection of epistemological ruptures. use of language resources. reuse of articles and plagiarism in the context of a global or comparative analysis between sources.
引用
收藏
页数:23
相关论文
共 50 条
  • [21] State of the art in statistical methods for language and speech processing
    Bellegarda, Jerome R.
    Monz, Christof
    COMPUTER SPEECH AND LANGUAGE, 2016, 35 : 163 - 184
  • [22] Hate speech detection in the Bengali language: a comprehensive survey
    Al Maruf, Abdullah
    Abidin, Ahmad Jainul
    Haque, Md. Mahmudul
    Jiyad, Zakaria Masud
    Golder, Aditi
    Alubady, Raaid
    Aung, Zeyar
    JOURNAL OF BIG DATA, 2024, 11 (01)
  • [23] The promise of NLP and speech processing technologies in language assessment
    Chapelle, Carol A.
    Chung, Yoo-Ree
    LANGUAGE TESTING, 2010, 27 (03) : 301 - 315
  • [24] The effect of second language immersion and musical experiences on second language speech processing and general auditory processing
    Wang, Cuicui
    Flemming, Krystal
    Wang, Yanpei
    Putkinen, Vesa
    Tervaniemi, Mari
    Lammert, Jessica
    Tao, Sha
    Joanisse, Marc F.
    JOURNAL OF NEUROLINGUISTICS, 2023, 68
  • [25] Natural language processing in the patent domain: a survey
    Jiang, Lekang
    Goetz, Stephan M.
    Artificial Intelligence Review, 2025, 58 (07)
  • [26] Quantum Natural Language Processing: A Comprehensive Survey
    Varmantchaonala, Charles M.
    Fendji, Jean Louis K. E.
    Schoning, Julius
    Atemkeng, Marcellin
    IEEE ACCESS, 2024, 12 : 99578 - 99598
  • [27] i-Vectors in speech processing applications: a survey
    Verma, Pulkit
    Das, Pradip
    INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2015, 18 (04) : 529 - 546
  • [28] Survey on Spell Checker for Tamil Language Using Natural Language Processing
    Selvaraj, P. A.
    Jagadeesan, M.
    Harikrishnan, M.
    Vijayapriya, R.
    Jayasudha, K.
    JOURNAL OF PHARMACEUTICAL NEGATIVE RESULTS, 2022, 13 : 170 - 174
  • [29] Automated Handwriting Recognition and Speech Synthesizer for Indigenous Language Processing
    Alqaralleh, Bassam A. Y.
    Aldhaban, Fahad
    A-Matarneh, Feras Mohammed
    AlQaralleh, Esam A.
    CMC-COMPUTERS MATERIALS & CONTINUA, 2022, 72 (02): : 3913 - 3927
  • [30] FarSpeech: Arabic Natural Language Processing for Live Arabic Speech
    Eldesouki, Mohamed
    Gopee, Naassih
    Ali, Ahmed
    Darwish, Kareem
    INTERSPEECH 2019, 2019, : 2372 - 2373