Disclosure Standards for Social Media and Generative Artificial Intelligence Research: Toward Transparency and Replicability

被引:3
作者
Kostygina, Ganna [1 ,2 ]
Kim, Yoonsang [1 ]
Seeskin, Zachary [1 ]
Leclere, Felicia [1 ]
Emery, Sherry [1 ]
机构
[1] NORC Univ Chicago, Chicago, IL USA
[2] NORC Univ Chicago, Social Data Collaboratory, 55 East Monroe St,3165, Chicago, IL 60603 USA
来源
SOCIAL MEDIA + SOCIETY | 2023年 / 9卷 / 04期
基金
美国国家卫生研究院;
关键词
social data quality; reproducibility; reporting standards; scientific transparency; disclosure; BIG DATA; SCIENCE;
D O I
10.1177/20563051231216947
中图分类号
G2 [信息与知识传播];
学科分类号
05 ; 0503 ;
摘要
Social media dominate today's information ecosystem and provide valuable information for social research. Market researchers, social scientists, policymakers, government entities, public health researchers, and practitioners recognize the potential for social data to inspire innovation, support products and services, characterize public opinion, and guide decisions. The appeal of mining these rich datasets is clear. However, there is potential risk of data misuse, underscoring an equally huge and fundamental flaw in the research: there are no procedural standards and little transparency. Transparency across the processes of collecting and analyzing social media data is often limited due to proprietary algorithms. Spurious findings and biases introduced by artificial intelligence (AI) demonstrate the challenges this lack of transparency poses for research. Social media research remains a virtual "wild west," with no clear standards for reporting regarding data retrieval, preprocessing steps, analytic methods, or interpretation. Use of emerging generative AI technologies to augment social media analytics can undermine validity and replicability of findings, potentially turning this research into a "black box" enterprise. Clear guidance for social media analyses and reporting is needed to assure the quality of the resulting research. In this article, we propose criteria for evaluating the quality of studies using social media data, grounded in established scientific practice. We offer clear documentation guidelines to ensure that social data are used properly and transparently in research and applications. A checklist of disclosure elements to meet minimal reporting standards is proposed. These criteria will make it possible for scholars and practitioners to assess the quality, credibility, and comparability of research findings using digital data.
引用
收藏
页数:12
相关论文
共 65 条
[21]   "So what if ChatGPT wrote it?" Multidisciplinary perspectives on opportunities, challenges and implications of generative conversational AI for research, practice and policy [J].
Dwivedi, Yogesh K. ;
Kshetri, Nir ;
Hughes, Laurie ;
Slade, Emma Louise ;
Jeyaraj, Anand ;
Kar, Arpan Kumar ;
Baabdullah, Abdullah M. ;
Koohang, Alex ;
Raghavan, Vishnupriya ;
Ahuja, Manju ;
Albanna, Hanaa ;
Albashrawi, Mousa Ahmad ;
Al-Busaidi, Adil S. ;
Balakrishnan, Janarthanan ;
Barlette, Yves ;
Basu, Sriparna ;
Bose, Indranil ;
Brooks, Laurence ;
Buhalis, Dimitrios ;
Carter, Lemuria ;
Chowdhury, Soumyadeb ;
Crick, Tom ;
Cunningham, Scott W. ;
Davies, Gareth H. ;
Davison, Robert M. ;
De, Rahul ;
Dennehy, Denis ;
Duan, Yanqing ;
Dubey, Rameshwar ;
Dwivedi, Rohita ;
Edwards, John S. ;
Flavian, Carlos ;
Gauld, Robin ;
Grover, Varun ;
Hu, Mei-Chih ;
Janssen, Marijn ;
Jones, Paul ;
Junglas, Iris ;
Khorana, Sangeeta ;
Kraus, Sascha ;
Larsen, Kai R. ;
Latreille, Paul ;
Laumer, Sven ;
Malik, F. Tegwen ;
Mardani, Abbas ;
Mariani, Marcello ;
Mithas, Sunil ;
Mogaji, Emmanuel ;
Nord, Jeretta Horn ;
O'Connor, Siobhan .
INTERNATIONAL JOURNAL OF INFORMATION MANAGEMENT, 2023, 71
[22]  
Elmas T, 2023, Arxiv, DOI [arXiv:2304.03434, DOI 10.48550/ARXIV.2304.03434]
[23]   Are You Scared Yet? Evaluating Fear Appeal Messages in Tweets About the Tips Campaign [J].
Emery, Sherry L. ;
Szczypka, Glen ;
Abril, Eulalia P. ;
Kim, Yoonsang ;
Vera, Lisa .
JOURNAL OF COMMUNICATION, 2014, 64 (02) :278-295
[24]  
fatml.org, Fairness, Accountability, and Transparency in Machine Learning
[25]  
Fineberg H, 2020, Review: RBI's Report on Municipal Finances Highlights the Inadequate Revenues of ULBs, V2, DOI DOI 10.1162/99608F92.CB310198
[26]   Sustained Online Amplification of COVID-19 Elites in the United States [J].
Gallagher, Ryan J. ;
Doroshenko, Larissa ;
Shugars, Sarah ;
Lazer, David ;
Welles, Brooke Foucault .
SOCIAL MEDIA + SOCIETY, 2021, 7 (02)
[27]   Assessing the bias in samples of large online networks [J].
Gonzalez-Bailon, Sandra ;
Wang, Ning ;
Rivero, Alejandro ;
Borge-Holthoefer, Javier ;
Moreno, Yamir .
SOCIAL NETWORKS, 2014, 38 :16-27
[28]   Balancing privacy versus accuracy in research protocols [J].
Goroff, Daniel L. .
SCIENCE, 2015, 347 (6221) :479-480
[29]  
Graham M., 2013, Proceedings of ICWSM, P518
[30]   Artificial Intelligence and Ten Societal Megatrends: An Exploratory Study Using GPT-3 [J].
Haluza, Daniela ;
Jungwirth, David .
SYSTEMS, 2023, 11 (03)