Enhancing the government accounting information systems using social media information: An application of text mining and machine learning

被引:24
作者
Duan, Huijue Kelly [1 ]
Vasarhelyi, Miklos A. [2 ]
Codesso, Mauricio [3 ]
Alzamil, Zamil [4 ]
机构
[1] Sacred Heart Univ, Jack Welch Coll Business & Technol, 3135 Easton Turnpike, Fairfield, CT 06825 USA
[2] Rutgers State Univ, Rutgers Business Sch, 1 Washington Pk, Newark, NJ 07102 USA
[3] Northeastern Univ, DAmore McKim Sch Business, 319J Hayden Hall, Boston, MA 02115 USA
[4] Majmaah Univ, Comp Sci Dept, Al Majmaah 11952, Saudi Arabia
关键词
Social media; Text mining; Machine learning; Sentiment analysis; BALANCED SCORECARD; PERFORMANCE-MEASUREMENT; LOCAL-GOVERNMENTS; DECISION-MAKING; TWITTER; SENTIMENT; CITIZENS; COMMUNICATION; PREDICTION; INNOVATION;
D O I
10.1016/j.accinf.2022.100600
中图分类号
F [经济];
学科分类号
02 ;
摘要
This study demonstrates a way of bringing an innovative data source, social media information, to the government accounting information systems to support accountability to stakeholders and managerial decision-making. Future accounting and auditing processes will heavily rely on multiple forms of exogenous data. As an example of the techniques that could be used to generate this needed information, the study applies text mining techniques and machine learning algo-rithms to Twitter data. The information is developed as an alternative performance measure for NYC street cleanliness. It utilizes Naive Bayes, Random Forest, and XGBoost to classify the tweets, illustrates how to use the sampling method to solve the imbalanced class distribution issue, and uses VADER sentiment to derive the public opinion about street cleanliness. This study also ex-tends the research to another social media platform, Facebook, and finds that the incremental value is different between the two social media platforms. This data can then be linked to gov-ernment accounting information systems to evaluate costs and provide a better understanding of the efficiency and effectiveness of operations.
引用
收藏
页数:19
相关论文
共 99 条
[1]  
Alom Z, 2018, 2018 IEEE/ACM INTERNATIONAL CONFERENCE ON ADVANCES IN SOCIAL NETWORKS ANALYSIS AND MINING (ASONAM), P1191, DOI 10.1109/ASONAM.2018.8508495
[2]  
Alpaydin E, 2014, ADAPT COMPUT MACH LE, P115
[3]  
[Anonymous], 2018, SEKE
[4]  
Arschin D, 2022, FOX 5 NEW YORK
[5]  
Asur S., 2010, Proceedings 2010 IEEE/ACM International Conference on Web Intelligence-Intelligent Agent Technology (WI-IAT), P492, DOI 10.1109/WI-IAT.2010.63
[6]   Hybrid N-gram model using Naive Bayes for classification of political sentiments on Twitter [J].
Awwalu, Jamilu ;
Abu Bakar, Azuraliza ;
Yaakub, Mohd Ridzwan .
NEURAL COMPUTING & APPLICATIONS, 2019, 31 (12) :9207-9220
[7]   A hybrid classification method for Twitter spam detection based on differential evolution and random forest [J].
Bazzaz Abkenar, Sepideh ;
Mahdipour, Ebrahim ;
Jameii, Seyed Mahdi ;
Haghi Kashani, Mostafa .
CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2021, 33 (21)
[8]  
Berner M., 2004, State and Local Government Review, V36, P140, DOI [DOI 10.1177/0160323X0403600205, 10.1177/0160323X0403600205]
[9]   Latent Dirichlet allocation [J].
Blei, DM ;
Ng, AY ;
Jordan, MI .
JOURNAL OF MACHINE LEARNING RESEARCH, 2003, 3 (4-5) :993-1022
[10]   Twitter mood predicts the stock market [J].
Bollen, Johan ;
Mao, Huina ;
Zeng, Xiaojun .
JOURNAL OF COMPUTATIONAL SCIENCE, 2011, 2 (01) :1-8