Corpus of digital interactions: systematization of techniques to collect data on WhatsApp

被引:3
|
作者
Cantamutto, Lucia [1 ]
Delfa, Cristina Vela [2 ]
机构
[1] Univ Nacl Rio Negro, CONICET, CIEDIS, Viedma, Argentina
[2] Univ Valladolid, Segovia, Spain
关键词
digital discourse; corpus linguistics; instant messaging; digital interaction;
D O I
10.7764/cdi.54.53165
中图分类号
G2 [信息与知识传播];
学科分类号
05 ; 0503 ;
摘要
The collection of datasets from real interactions is an unavoidable step in many research works aiming to understand language use. In the field of digital discourse analysis, data collection is complex due to the fast-paced changes in the applications and the ethical decisions involved. This work has two goals. First, we seek to show an overview of the literature on datasets of digital exchanges by WhatsApp. Then, we aim to systematize different sampling techniques used in previous research. We thus proceeded by applying content analysis to 100 research articles and theses retrieved from open access portals. We conducted a descriptive analysis that included the amount of data collected, the technique employed in the collection of the data, the method used to contact participants, and the online access to the linguistic corpora, among other variables. The results show the existence of some corpora annotated and available in languages other than Spanish. In addition, most of the literature shows a combination of different techniques to collect a wide set of linguistic and multimodal data. Then, we systematize the main methodological alternatives for data collection from digital interactions by WhatsApp, with the participant observation method standing out.
引用
收藏
页码:117 / 139
页数:23
相关论文
共 50 条
  • [1] ChatDashboard: A Framework to collect, link, and process donated WhatsApp Chat Log Data
    Kohne, Julian
    Montag, Christian
    BEHAVIOR RESEARCH METHODS, 2024, 56 (04) : 3658 - 3684
  • [2] From the conversation to the corpus, or how to collect and archive spoken language data
    Gocol, Damian
    Zasko-Zielinska, Monika
    Majewska-Tworek, Anna
    Sleziak, Marta
    Tworek, Artur
    WROCLAWSKI ROCZNIK HISTORII MOWIONEJ, 2024, 14 : 306 - 311
  • [3] Using personal digital assistants to collect survey data
    Nusser, SM
    Thompson, DM
    DeLozier, GS
    AMERICAN STATISTICAL ASSOCIATION - 1996 PROCEEDINGS OF THE SECTION ON SURVEY RESEARCH METHODS, VOLS I AND II, 1996, : 780 - 785
  • [4] Using Digital Workbooks to Collect Design Process Data
    Carberry, Adam R.
    Hynes, Morgan M.
    Danahy, Ethan E.
    2013 ASEE ANNUAL CONFERENCE, 2013,
  • [5] Sociolinguistic Corpus of WhatsApp Chats in Spanish among College Students - Data Paper
    Dorantes, Alejandro
    Sierra, Gerardo
    Donohue Perez, Yamin
    Bel-Enguix, Gemma
    Jasso Rosales, Monica
    NATURAL LANGUAGE PROCESSING FOR SOCIAL MEDIA (AFNLP SIG SOCIALNLP), 2018, : 1 - 6
  • [6] Web Scraping Techniques to Collect Weather Data in South Sumatera
    Fatmasari
    Kunang, Yesi Novaria
    Purnamasari, Susan Dian
    2018 INTERNATIONAL CONFERENCE ON ELECTRICAL ENGINEERING AND COMPUTER SCIENCE (ICECOS), 2018, : 385 - 389
  • [7] Using personal digital assistants to collect wildlife field data
    Waddle, JH
    Rice, KG
    Percival, HF
    WILDLIFE SOCIETY BULLETIN, 2003, 31 (01) : 306 - 308
  • [8] The use of digital technologies to collect patient data in outcomes research
    Byrom, Bill
    Row, Bill
    JOURNAL OF COMPARATIVE EFFECTIVENESS RESEARCH, 2017, 6 (04) : 275 - 277
  • [9] Involving Elderly Users in Design: Techniques to Collect Preferences for Interactive Digital Television
    Spagnolli, Anna
    Gamberini, Luciano
    Ibanez, Francisco
    Fabregat, Maria Elena
    Debelic, Tijana
    Orso, Valeria
    ANNUAL REVIEW OF CYBERTHERAPY AND TELEMEDICINE, 2012, 10 : 233 - 237
  • [10] Computer Vision Techniques to Collect Helmet-Wearing Data on Cyclists
    Li, Jinling
    Hajimirsadeghi, Hossein
    Zaki, Mohamed H.
    Mori, Greg
    Sayed, Tarek
    TRANSPORTATION RESEARCH RECORD, 2014, (2468) : 1 - 10