ICON: Instagram Profile Classification Using Image and Natural Language Processing Methods

被引:3
作者
Guven, Ebu Yusuf [1 ]
Boyaci, Ali [2 ]
Saritemur, Fatma Nur [1 ]
Turk, Zehra [1 ]
Sutcu, Gizem [1 ]
Turna, Ozgur Can [1 ]
机构
[1] Istanbul Univ Cerrahpasa, Dept Comp Engn, TR-34320 Istanbul, Turkiye
[2] Istanbul Ticaret Univ, Dept Comp Engn, TR-34378 Istanbul, Turkiye
关键词
Image processing; natural language processing (NLP); profile and comment analysis; social media analytics;
D O I
10.1109/TCSS.2023.3275428
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The use of social media has grown significantly, and businesses are now using these platforms to promote their products and services. To do this, companies have created business accounts on social networks. However, social media platforms can also be a breeding ground for unwanted behaviors such as cyberbullying, sexual content, and promotional comments. To address this issue, a study was conducted to create a system that could classify public accounts on Instagram by analyzing comments, profile pictures, bios, and posts shared by users with business accounts. First, a crawler was developed, and data were collected using this crawler and then anonymized. Next, the collected data were processed using natural language processing (NLP) techniques for text and image processing methods for images to extract features and create a dataset. Nearly 10 000 profiles and 30 000 comments from public accounts were manually tagged to create the classification model. The final model had an accuracy rate of 95% on the dataset, allowing for the effective identification of different types of business accounts on Instagram.
引用
收藏
页码:2776 / 2783
页数:8
相关论文
共 21 条
[1]   Instagram Fake and Automated Account Detection [J].
Akyon, Fatih Cagatay ;
Kalfaoglu, M. Esat .
2019 INNOVATIONS IN INTELLIGENT SYSTEMS AND APPLICATIONS CONFERENCE (ASYU), 2019, :519-525
[2]   Fake news detection in multiple platforms and languages [J].
Arruda Faustini, Pedro Henrique ;
Covoes, Thiago Ferreira .
EXPERT SYSTEMS WITH APPLICATIONS, 2020, 158
[3]  
Curry D., 2021, INSTAGRAM REACHES 2
[4]   Polarization and Fake News: Early Warning of Potential Misinformation Targets [J].
Del Vicario, Michela ;
Quattrociocchi, Walter ;
Scala, Antonio ;
Zollo, Fabiana .
ACM TRANSACTIONS ON THE WEB, 2019, 13 (02)
[5]  
Dey Ananya, 2019, International Journal of Computer Science & Information Technology, V11, P80, DOI 10.5121/ijcsit.2019.11507
[6]  
Dong G., 2018, Feature Engineering for Machine Learning and Data Analytics
[7]  
eMarketer, 2020, SOC NETW SEE BOOSTS
[8]  
Eurostat, 2021, E COMM STAT IND
[9]   An ensemble machine learning approach through effective feature extraction to classify fake news [J].
Hakak, Saqib ;
Alazab, Mamoun ;
Khan, Suleman ;
Gadekallu, Thippa Reddy ;
Maddikunta, Praveen Kumar Reddy ;
Khan, Wazir Zada .
FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2021, 117 :47-58
[10]   Deep Residual Learning for Image Recognition [J].
He, Kaiming ;
Zhang, Xiangyu ;
Ren, Shaoqing ;
Sun, Jian .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :770-778