A LDA-Based Social Media Data Mining Framework for Plastic Circular Economy

被引:10
作者
Xue, Yangyimin [1 ]
Kambhampati, Chandrasekhar [1 ]
Cheng, Yongqiang [1 ]
Mishra, Nishikant [2 ]
Wulandhari, Nur [2 ]
Deutz, Pauline [3 ]
机构
[1] Univ Hull, Dept Comp Sci, Kingston Upon Hull HU6 7RX, England
[2] Univ Hull, Sch Business, Kingston Upon Hull HU6 7RX, England
[3] Univ Hull, Dept Geog Geol & Environm, Kingston Upon Hull HU6 7RX, England
关键词
LDA; Model visualisation; Sentiment analysis; Comments' classification;
D O I
10.1007/s44196-023-00375-7
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The mass production of plastic waste has caused an urgent worldwide public health crisis. Although government policies and industrial innovation are the driving forces to meet this challenge, trying to understand public attitudes may improve the efficiency of this process. Social media has become the main ways for the public to obtain information and express opinions and feelings. This motivated us to mine the perceptions and behavioral responses towards plastic usage using social media data. In this paper, we proposed a framework for data collection and analysis based on mainstream media in the UK to obtain public opinions on plastics. An unsupervised machine learning model based on Latent Dirichlet Allocation (LDA) has been employed to analyse and cluster the topics to deal with the lack of annotation of the data contents. An additional dictionary method was then proposed to evaluate the sentiment of the comments. The framework also provides tools to visualise the model and results to stimulate insightful understandings. We validated the framework's effectiveness by applying it to analyse three mainstream social media, where 6 first-level topic categories and 13 second-level topic categories from the comment texts related to plastics have been identified. The results show that public sentiment towards plastic products is generally stable. The spatiotemporal distribution of each topic's sentiment is highly correlated with the number of occurrences.
引用
收藏
页数:14
相关论文
共 27 条
[1]  
Bakshi RK, 2016, PROCEEDINGS OF THE 10TH INDIACOM - 2016 3RD INTERNATIONAL CONFERENCE ON COMPUTING FOR SUSTAINABLE GLOBAL DEVELOPMENT, P452
[2]   Latent Dirichlet allocation [J].
Blei, DM ;
Ng, AY ;
Jordan, MI .
JOURNAL OF MACHINE LEARNING RESEARCH, 2003, 3 (4-5) :993-1022
[3]   Positive Returns and Equilibrium: Simultaneous Feedback Between Public Opinion and Social Policy [J].
Breznau, Nate .
POLICY STUDIES JOURNAL, 2017, 45 (04) :583-612
[4]  
Chowdhary KR1442, 2020, Natural language processing. Fundamentals of artificial intelligence, P603, DOI DOI 10.1007/978-81-322-3972-7_19
[5]   Public attitudes towards plastics [J].
Dilkes-Hoffman, Leela Sarena ;
Pratt, Steven ;
Laycock, Bronwyn ;
Ashworth, Peta ;
Lant, Paul Andrew .
RESOURCES CONSERVATION AND RECYCLING, 2019, 147 :227-235
[6]   Deep Learning for Aspect-Based Sentiment Analysis: A Comparative Review [J].
Do, Hai Ha ;
Prasad, P. W. C. ;
Maag, Angelika ;
Alsadoon, Abeer .
EXPERT SYSTEMS WITH APPLICATIONS, 2019, 118 :272-299
[7]  
Ellen MacArthur Foundation, 2015, GROWTH CIRC EC VIS C
[8]   Social interaction via new social media: (How) can interactions on Twitter affect effectual thinking and behavior? [J].
Fischer, Eileen ;
Reuber, A. Rebecca .
JOURNAL OF BUSINESS VENTURING, 2011, 26 (01) :1-18
[9]   A survey on classification techniques for opinion mining and sentiment analysis [J].
Hemmatian, Fatemeh ;
Sohrabi, Mohammad Karim .
ARTIFICIAL INTELLIGENCE REVIEW, 2019, 52 (03) :1495-1545
[10]  
Hidayatullah Ahmad Fathan, 2019, IOP Conference Series: Materials Science and Engineering, V482, DOI 10.1088/1757-899X/482/1/012033