Using #ActuallyAutistic on Twitter for Precision Diagnosis of Autism Spectrum Disorder: Machine Learning Study

被引:2
作者
Jaiswal, Aditi [1 ,2 ]
Washington, Peter [1 ]
机构
[1] Univ Hawaii Manoa, Dept Informat & Comp Sci, Honolulu, HI 96822 USA
[2] Univ Hawaii Manoa, Dept Informat & Comp Sci, Room 312C,1680 East West Rd, Honolulu, HI 96822 USA
基金
美国国家科学基金会;
关键词
ASD; autism spectrum disorder; machine learning; natural language processing; public health; sentiment analysis; social media analysis; Twitter;
D O I
10.2196/52660
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
Background: The increasing use of social media platforms has given rise to an unprecedented surge in user-generated content, with millions of individuals publicly sharing their thoughts, experiences, and health-related information. Social media can serve as a useful means to study and understand public health. Twitter (subsequently rebranded as "X") is one such social media platform that has proven to be a valuable source of rich information for both the general public and health officials. We conducted the first study applying Twitter data mining to autism screening. Objective: This study used Twitter as the primary source of data to study the behavioral characteristics and real-time emotional projections of individuals identifying with autism spectrum disorder (ASD). We aimed to improve the rigor of ASD analytics research by using the digital footprint of an individual to study the linguistic patterns of individuals with ASD. Methods: We developed a machine learning model to distinguish individuals with autism from their neurotypical peers based on the textual patterns from their public communications on Twitter. We collected 6,515,470 tweets from users' self-identification with autism using "#ActuallyAutistic" and a separate control group to identify linguistic markers associated with ASD traits. To construct the data set, we targeted English-language tweets using the search query "#ActuallyAutistic" posted from January 1, 2014, to December 31, 2022. From these tweets, we identified unique users who used keywords such as "autism" OR "autistic" OR "neurodiverse" in their profile description and collected all the tweets from their timeline. To build the control group data set, we formulated a search query excluding the hashtag, "-#ActuallyAutistic," and collected 1000 tweets per day during the same time period. We trained a word2vec model and an attention-based, bidirectional long short-term memory model to validate the performance of per-tweet and per-profile classification models. We also illustrate the utility of the data set through common natural language processing tasks such as sentiment analysis and topic modeling. Results: Our tweet classifier reached a 73% accuracy, a 0.728 area under the receiver operating characteristic curve score, and an 0.71 F1-score using word2vec representations fed into a logistic regression model, while the user profile classifier achieved an 0.78 area under the receiver operating characteristic curve score and an F1-score of 0.805 using an attention-based, bidirectional long short-term memory model. This is a promising start, demonstrating the potential for effective digital phenotyping studies and large-scale intervention using text data mined from social media. Conclusions: Textual differences in social media communications can help researchers and clinicians conduct symptomatology studies in natural settings.
引用
收藏
页数:13
相关论文
共 81 条
[1]   Eye Tracking-Based Diagnosis and Early Detection of Autism Spectrum Disorder Using Machine Learning and Deep Learning Techniques [J].
Ahmed, Ibrahim Abdulrab ;
Senan, Ebrahim Mohammed ;
Rassem, Taha H. ;
Ali, Mohammed A. H. ;
Shatnawi, Hamzeh Salameh Ahmad ;
Alwazer, Salwa Mutahar ;
Alshahrani, Mohammed .
ELECTRONICS, 2022, 11 (04)
[2]   Eye gaze as a biomarker in the recognition of autism spectrum disorder using virtual reality and machine learning: A proof of concept for diagnosis [J].
Alcaniz, Mariano ;
Chicchi-Giglioli, Irene Alice ;
Carrasco-Ribelles, Lucia A. ;
Marin-Morales, Javier ;
Eleonora Minissi, Maria ;
Teruel-Garcia, Gonzalo ;
Sirera, Marian ;
Abad, Luis .
AUTISM RESEARCH, 2022, 15 (01) :131-145
[3]   Classification of Autism Spectrum Disorder From EEG-Based Functional Brain Connectivity Analysis [J].
Alotaibi, Noura ;
Maharatna, Koushik .
NEURAL COMPUTATION, 2021, 33 (07) :1914-1941
[4]  
[Anonymous], Creating safe AGI that benefits all of humanity
[5]  
[Anonymous], about us
[6]  
[Anonymous], 2010, P 19 INT C WORLD WID, DOI 10.1145/ 1772690.1772777
[7]   Toward the Autism Motor Signature: Gesture patterns during smart tablet gameplay identify children with autism [J].
Anzulewicz, Anna ;
Sobota, Krzysztof ;
Delafield-Butt, Jonathan T. .
SCIENTIFIC REPORTS, 2016, 6
[8]  
Aramaki E., 2011, P 2011 C EMP METH NA, P1568
[9]  
Bakombo Schwab, 2023, Int J Environ Res Public Health, V20, DOI 10.3390/ijerph20043246
[10]   Training and Profiling a Pediatric Facial Expression Classifier for Children on Mobile Devices: Machine Learning Study [J].
Banerjee, Agnik ;
Mutlu, Onur Cezmi ;
Kline, Aaron ;
Surabhi, Saimourya ;
Washington, Peter ;
Wall, Dennis Paul .
JMIR FORMATIVE RESEARCH, 2023, 7