Twitter Pornography Multilingual Content Identification Based on Machine Learning

被引:5
|
作者
Barfian, Edo [1 ]
Iswanto, Bambang Heru [2 ]
Isa, Sani Muhamad [1 ]
机构
[1] Bina Nusantara Univ, Jl KH Syahdan 9 Kemanggisan, Jakarta 11480, Indonesia
[2] Jakarta State Univ, Jl Rawamangun Muka,RT-11-RW-14, Rawamangun 13220, Jakarta Timur, Indonesia
关键词
Pornography; Twitter; Machine Learning;
D O I
10.1016/j.procs.2017.10.024
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Pornography on social media raises a lot of negative impact and affect the moral of children and teenagers. Social media used to spread pornography can have a negative impact. Thus, the spread of pornography on social media must be prevented. One of the social media which is often used as a medium pornography is Twitter. Pornography used on Twitter in the form of text and image. Among the two types of media, the text is very interesting to study because of the use of a variety of languages. In this study, the classification process will be conducted in Indonesian and English tweet and a combination of both languages. This classification uses three methods of machine learning, Decision Tree, Naive Bayes and Support Vector Machines for the purpose of comparing which method is the best in the classification process. In this study also conducted additional experiment was carried out with the aim of improving the performance in classification. The results showed that the level of accuracy is quite high. However, different grammar is a constraint that affects the accuracy of the results in the classification. (C) 2017 The Authors. Published by Elsevier B.V.
引用
收藏
页码:129 / 136
页数:8
相关论文
共 50 条
  • [1] Machine Learning-Based Identification of Collusive Users in Twitter Stream
    Solanki, Aastik
    Jaiswal, Adesh Kumar
    Mishra, Aditi
    Mishra, Anant Prakash
    Sabherwal, Suruchi
    2021 INTERNATIONAL CONFERENCE ON COMPUTATIONAL PERFORMANCE EVALUATION (COMPE-2021), 2021, : 797 - 802
  • [2] Sarcasm Identification on Twitter: A Machine Learning Approach
    Onan, Aytug
    ARTIFICIAL INTELLIGENCE TRENDS IN INTELLIGENT SYSTEMS, CSOC2017, VOL 1, 2017, 573 : 374 - 383
  • [3] Sentiment identification on Twitter using machine learning
    Morales-Castro, Wendy
    Careta, Eduardo Perez
    Rayas, Angelica Hernandez
    Mukhopadhyay, Tirtha Prasad
    Crespo, J. Armando Perez
    Cabrera, Rafael Guzman
    2022 EURO-ASIA CONFERENCE ON FRONTIERS OF COMPUTER SCIENCE AND INFORMATION TECHNOLOGY, FCSIT, 2022, : 28 - 31
  • [4] Identification of Twitter User Credibility Using Machine Learning
    Kurniati, Rizki
    Widyantoro, Dwi H.
    PROCEEDINGS OF 2017 5TH INTERNATIONAL CONFERENCE ON INSTRUMENTATION, COMMUNICATIONS, INFORMATION TECHNOLOGY, AND BIOMEDICAL ENGINEERING (ICICI-BME): SCIENCE AND TECHNOLOGY FOR A BETTER LIFE, 2017, : 282 - 286
  • [5] Drake or Hen? Machine Learning for Gender Identification on Twitter
    Gombert, Arnault
    Cerquides, Jesus
    ARTIFICIAL INTELLIGENCE RESEARCH AND DEVELOPMENT, 2022, 356 : 59 - 66
  • [6] Machine learning-based approach for depression detection in twitter using content and activity features
    Alsagri, Hatoon S.
    Ykhlef, Mourad
    IEICE Transactions on Information and Systems, 2020, E103D (08) : 1825 - 1832
  • [7] Machine Learning-Based Approach for Depression Detection in Twitter Using Content and Activity Features
    Alsagri, Hatoon S.
    Ykhlef, Mourad
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2020, E103D (08): : 1825 - 1832
  • [8] A Machine Learning Based Method for Automatic Identification of Disaster Related Information Using Twitter Data
    Christidou, Athina Ntiana
    Drakaki, Maria
    Linardos, Vasileios
    INTELLIGENT AND FUZZY SYSTEMS: DIGITAL ACCELERATION AND THE NEW NORMAL, INFUS 2022, VOL 2, 2022, 505 : 70 - 76
  • [9] Harnessing Twitter for Automatic Sentiment Identification Using Machine Learning Techniques
    Dash, Amiya Kumar
    Rout, Jitendra Kumar
    Jena, Sanjay Kumar
    PROCEEDINGS OF 3RD INTERNATIONAL CONFERENCE ON ADVANCED COMPUTING, NETWORKING AND INFORMATICS, ICACNI 2015, VOL 2, 2016, 44 : 507 - 514
  • [10] REWRITING IDENTIFICATION TECHNOLOGY FOR TEXT CONTENT BASED ON MACHINE LEARNING METHODS br
    Kholodna, N.
    Vysotska, V.
    RADIO ELECTRONICS COMPUTER SCIENCE CONTROL, 2022, (04) : 126 - 148