Data science in light of natural language processing: An overview

被引:13
|
作者
Zeroual, Imad [1 ]
Lakhouaja, Abdelhak [1 ]
机构
[1] Mohamed First Univ, Fac Sci, Av Med 6 BP 717, Oujda 60000, Morocco
来源
PROCEEDINGS OF THE FIRST INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING IN DATA SCIENCES (ICDS2017) | 2018年 / 127卷
关键词
Data science; Natural language processing; Data driven approches; Corpora; Machine learning;
D O I
10.1016/j.procs.2018.01.101
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The focus of data scientists is essentially divided into three areas: collecting data, analyzing data, and inferring information from data. Each one of these tasks requires special personnel, takes time, and costs money. Yet, the next and the fastidious step is how to turn data into products. Therefore, this field grabs the attention of many research groups in academia as well as industry. In the last decades, data-driven approaches came into existence and gained more popularity because they require much less human effort. Natural Language Processing (NLP) is strongly among the fields influenced by data. The growth of data is behind the performance improvement of most NLP applications such as machine translation and automatic speech recognition. Consequently, many NLP applications are frequently moving from rule-based systems and knowledge-based methods to data driven approaches. However, collected data that are based on undefined design criteria or on technically unsuitable forms will be useless. Also, they will be neglected if the size is not enough to perform the required analysis and to infer the accurate information. The chief purpose of this overview is to shed some lights on the vital role of data in various fields and give a better understanding of data in light of NLP. Expressly, it describes what happen to data during its life-cycle: building, processing, analyzing, and exploring phases. (C) 2018 The Authors. Published by Elsevier B.V.
引用
收藏
页码:82 / 91
页数:10
相关论文
共 50 条
  • [1] Natural language processing in medicine: An overview
    Spyns, P
    METHODS OF INFORMATION IN MEDICINE, 1996, 35 (4-5) : 285 - 301
  • [2] Effectiveness of Recent Research Approaches in Natural Language Processing on Data Science-An Insight
    Shruthi, J.
    Swamy, Suma
    COMPUTATIONAL AND STATISTICAL METHODS IN INTELLIGENT SYSTEMS, 2019, 859 : 172 - 182
  • [3] AN HISTORICAL OVERVIEW OF NATURAL-LANGUAGE PROCESSING SYSTEMS THAT LEARN
    COLLIER, R
    ARTIFICIAL INTELLIGENCE REVIEW, 1994, 8 (01) : 17 - 54
  • [4] Data augmentation techniques in natural language processing
    Pellicer, Lucas Francisco Amaral Orosco
    Ferreira, Taynan Maier
    Costa, Anna Helena Reali
    APPLIED SOFT COMPUTING, 2023, 132
  • [5] An Overview of Natural Language Processing for Indonesian and Malay
    Jiang S.
    Li S.
    Fu S.
    Lin N.
    Moshi Shibie yu Rengong Zhineng/Pattern Recognition and Artificial Intelligence, 2020, 33 (06): : 530 - 541
  • [6] Data science through natural language with ChatGPT's Code Interpreter
    Ahn, Sangzin
    TRANSLATIONAL AND CLINICAL PHARMACOLOGY, 2024, 32 (02) : 73 - 82
  • [7] Natural language processing data services for healthcare providers
    Yeung, Joshua Au
    Shek, Anthony
    Searle, Thomas
    Kraljevic, Zeljko
    Dinu, Vlad
    Ratas, Mart
    Al-Agil, Mohammad
    Foy, Aleksandra
    Rafferty, Barbara
    Oliynyk, Vitaliy
    Teo, James T.
    BMC MEDICAL INFORMATICS AND DECISION MAKING, 2024, 24 (01)
  • [8] Data augmentation approaches in natural language processing: A survey
    Li, Bohan
    Hou, Yutai
    Che, Wanxiang
    AI OPEN, 2022, 3 : 71 - 90
  • [9] Measuring Memberships in Collectives in Light of Developments in Cognitive Science and Natural- Language Processing
    Hannan, Michael T.
    SOCIOLOGICAL SCIENCE, 2022, 9 : 473 - 492
  • [10] Natural Language Processing in Game Studies Research: An Overview
    Zagal, Jose P.
    Tomuro, Noriko
    Shepitsen, Andriy
    SIMULATION & GAMING, 2012, 43 (03) : 356 - 373