Introduction to the Special Issue on Language in Social Media: Exploiting Discourse and Other Contextual Information

被引:10
作者
Benamara, Farah [1 ]
Inkpen, Diana [2 ]
Taboada, Maite [3 ]
机构
[1] Paul Sabatier Univ, IRIT, Univ Toulouse, Toulouse, France
[2] Univ Ottawa, Sch Elect Engn & Comp Sci, Ottawa, ON, Canada
[3] Simon Fraser Univ, Dept Linguist, Burnaby, BC, Canada
关键词
LOCAL COHERENCE; FRAMEWORK; MODELS;
D O I
10.1162/coli_a_00333
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Social media content is changing the way people interact with each other and share information, personal messages, and opinions about situations, objects, and past experiences. Most social media texts are short online conversational posts or comments that do not contain enough information for natural language processing (NLP) tools, as they are often accompanied by non-linguistic contextual information, including meta-data (e.g., the user's profile, the social network of the user, and their interactions with other users). Exploiting such different types of context and their interactions makes the automatic processing of social media texts a challenging research task. Indeed, simply applying traditional text mining tools is clearly sub-optimal, as, typically, these tools take into account neither the interactive dimension nor the particular nature of this data, which shares properties with both spoken and written language. This special issue contributes to a deeper understanding of the role of these interactions to process social media data from a new perspective in discourse interpretation. This introduction first provides the necessary background to understand what context is from both the linguistic and computational linguistic perspectives, then presents the most recent context-based approaches to NLP for social media. We conclude with an overview of the papers accepted in this special issue, highlighting what we believe are the future directions in processing social media texts.
引用
收藏
页码:663 / 681
页数:19
相关论文
共 92 条
  • [71] Rosso Paolo, 2018, LANGUAGE LINGUISTICS, V12, P1, DOI DOI 10.1111/LNC3
  • [72] Twitter corpus creation: The case of a Malay Chat-style-text Corpus (MCC)
    Saloot, Mohammad Arshi
    Idris, Norisma
    Aw, AiTi
    Thorleuchter, Dirk
    [J]. DIGITAL SCHOLARSHIP IN THE HUMANITIES, 2016, 31 (02) : 227 - 243
  • [73] Searle JR., 1969, SPEECH ACTS ESSAY PH
  • [74] Sidarenka Uladzimir, 2015, P WORKSH ID ANN DISC, P1
  • [75] Socher R, 2013, P 2013 C EMP METH NA, P935
  • [76] Recognizing Counterfactual Thinking in Social Media Texts
    Son, Youngseo
    Buffone, Anneke
    Janocko, Anthony
    Larche, Allegra
    Raso, Joseph
    Zembroski, Kevin
    Schwartz, H. Andrew
    Ungar, Lyle
    [J]. PROCEEDINGS OF THE 55TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2017), VOL 2, 2017, : 654 - 658
  • [77] Sperber Dan., 1981, Irony and the Use-Mention Distinction, P295
  • [78] Overview of PAN 2018 Author Identification, Author Profiling, and Author Obfuscation
    Stamatatos, Efstathios
    Rangel, Francisco
    Tschuggnall, Michael
    Stein, Benno
    Kestemont, Mike
    Rosso, Paolo
    Potthast, Martin
    [J]. EXPERIMENTAL IR MEETS MULTILINGUALITY, MULTIMODALITY, AND INTERACTION (CLEF 2018), 2018, 11018 : 267 - 285
  • [79] Rhetorical Structure Theory: looking back and moving ahead
    Taboada, Maite
    Mann, William C.
    [J]. DISCOURSE STUDIES, 2006, 8 (03) : 423 - 459
  • [80] Tan Chenhao., 2011, P 17 ACM SIGKDD INT, P1397, DOI DOI 10.1145/2020408.2020614