Detection of Sociolinguistic Features in Digital Social Networks for the Detection of Communities

被引：0

作者：

Edwin Puertas

Luis Gabriel Moreno-Sandoval

Javier Redondo

Jorge Andres Alvarado-Valencia

Alexandra Pomares-Quimbaya

机构：

[1] Universidad Tecnologica de Bolivar,Faculty of Engineering, Department of Engineering

[2] Pontificia Universidad Javeriana,Faculty of Engineering, Engineering School

[3] Pontificia Universidad Javeriana,Department of Communication and Language

来源：

Cognitive Computation | 2021年 / 13卷

关键词：

Sociolinguistic; Community discovery; Natural language processing; Social networks; Community detection.;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

The emergence of digital social networks has transformed society, social groups, and institutions in terms of the communication and expression of their opinions. Determining how language variations allow the detection of communities, together with the relevance of specific vocabulary (proposed by the National Council of Accreditation of Colombia (Consejo Nacional de Acreditación - CNA) to determine the quality evaluation parameters for universities in Colombia) in digital assemblages could lead to a better understanding of their dynamics and social foundations, thus resulting in better communication policies and intervention where necessary. The approach presented in this paper intends to determine what are the semantic spaces (sociolinguistic features) shared by social groups in digital social networks. It includes five layers based on Design Science Research, which are integrated with Natural Language Processing techniques (NLP), Computational Linguistics (CL), and Artificial Intelligence (AI). The approach is validated through a case study wherein the semantic values of a series of “Twitter” institutional accounts belonging to Colombian Universities are analyzed in terms of the 12 quality factors established by CNA. In addition, the topics and the sociolect used by different actors in the university communities are also analyzed. The current approach allows determining the sociolinguistic features of social groups in digital social networks. Its application allows detecting the words or concepts to which each actor of a social group (university) gives more importance in terms of vocabulary.

引用

页码：518 / 537

页数：19

共 55 条

[1]

Dumbill E(2013)A revolution that will transform how we live, work, and think: An interview with the authors of big data Big data. 1 73-7

[2]

Scott J(2011)Social network analysis: developments, advances, and prospects Social network analysis and mining. 1 21-6

[3]

Zeinab K(2016)An introduction to sociolinguistics International Journal of Society, Culture & Language. 4 134-40

[4]

Khalil M(2016)A Seza Doğruöz, Carolyn P Rosé, and Franciska de Jong Computational sociolinguistics: A survey. Computational linguistics. 42 537-93

[5]

Nguyen D(2019)An analytical study of information extraction from unstructured and multidimensional big data Journal of Big Data. 6 91-221

[6]

K Adnan(2004)Semantic variation in idiolect and sociolect: Corpus linguistic evidence from literary texts Computers and the Humanities. 38 207-417

[7]

R Akbar(2012)Detection of groups in non-structured data Procedia Computer Science. 12 412-733

[8]

Louwerse MM(2018)Semi-supervised learning for big social data analysis Neurocomputing. 275 1662-6105

[9]

Paradis RD(2019)An enhanced graph-based semi-supervised learning algorithm to detect fake users on twitter The Journal of Supercomputing. 75 6085-50

[10]

Davenport D(2019)Embedding both finite and infinite communities on graphs [application notes] IEEE Computational Intelligence Magazine. 14 39-72

← 1 2 3 4 5 6 →