Utilizing Machine Learning for Detecting Harmful Situations by Audio and Text

被引:1
作者
Allouch, Merav [1 ]
Mansbach, Noa [1 ]
Azaria, Amos [1 ]
Azoulay, Rina [2 ]
机构
[1] Ariel Univ, Dept Comp Sci, IL-40700 Ariel, Israel
[2] Jerusalem Coll Technol, Dept Comp Sci, IL-9372115 Jerusalem, Israel
来源
APPLIED SCIENCES-BASEL | 2023年 / 13卷 / 06期
关键词
text classification; audio classification; machine learning; bulling; pretrained models; children's safety; assistive technologies for persons with disabilities; DEEP NEURAL-NETWORK; EMOTION RECOGNITION; SENTIMENT ANALYSIS; AUTISM; MIND;
D O I
10.3390/app13063927
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Children with special needs may struggle to identify uncomfortable and unsafe situations. In this study, we aimed at developing an automated system that can detect such situations based on audio and text cues to encourage children's safety and prevent situations of violence toward them. We composed a text and audio database with over 1891 sentences extracted from videos presenting real-world situations, and categorized them into three classes: neutral sentences, insulting sentences, and sentences indicating unsafe conditions. We compared insulting and unsafe sentence-detection abilities of various machine-learning methods. In particular, we found that a deep neural network that accepts the text embedding vectors of bidirectional encoder representations from transformers (BERT) and audio embedding vectors of Wav2Vec as input attains the highest accuracy in detecting unsafe and insulting situations. Our results indicate that it may be applicable to build an automated agent that can detect unsafe and unpleasant situations that children with special needs may encounter, given the dialogue contexts conducted with these children.
引用
收藏
页数:23
相关论文
共 65 条
  • [41] Prevalence of School Bullying Among Youth with Autism Spectrum Disorders: A Systematic Review and Meta-Analysis
    Maiano, Christophe
    Normand, Claude L.
    Salvas, Marie-Claude
    Moullec, Gregory
    Aime, Annie
    [J]. AUTISM RESEARCH, 2016, 9 (06) : 601 - 615
  • [42] An Agent for Competing with Humans in a Deceptive Game Based on Vocal Cues
    Mansbach, Noa
    Neiterman, Evgeny Hershkovitch
    Azaria, Amos
    [J]. INTERSPEECH 2021, 2021, : 4134 - 4138
  • [43] Abusive Language Detection in Online User Content
    Nobata, Chikashi
    Tetreault, Joel
    Thomas, Achint
    Mehdad, Yashar
    Chang, Yi
    [J]. PROCEEDINGS OF THE 25TH INTERNATIONAL CONFERENCE ON WORLD WIDE WEB (WWW'16), 2016, : 145 - 153
  • [44] Spectral contrast enhancement improves speech intelligibility in noise for cochlear implants
    Nogueira, Waldo
    Rode, Thilo
    Buechner, Andreas
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2016, 139 (02) : 728 - 739
  • [45] Vocal-based emotion recognition using random forests and decision tree
    Noroozi F.
    Sapiński T.
    Kamińska D.
    Anbarjafari G.
    [J]. International Journal of Speech Technology, 2017, 20 (2) : 239 - 246
  • [46] Speech emotion recognition using hidden Markov models
    Nwe, TL
    Foo, SW
    De Silva, LC
    [J]. SPEECH COMMUNICATION, 2003, 41 (04) : 603 - 623
  • [47] Pascal F., 2021, SPEECH CLASSIFICATIO
  • [48] Linguistically Regularized LSTM for Sentiment Classification
    Qian, Qiao
    Huang, Minlie
    Lei, Jinhao
    Zhu, Xiaoyan
    [J]. PROCEEDINGS OF THE 55TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2017), VOL 1, 2017, : 1679 - 1689
  • [49] Ramos Juan., 2003, USING TF IDF DETERMI, V242, P29, DOI DOI 10.15804/TNER.2015.42.4.03
  • [50] A survey on opinion mining and sentiment analysis: Tasks, approaches and applications
    Ravi, Kumar
    Ravi, Vadlamani
    [J]. KNOWLEDGE-BASED SYSTEMS, 2015, 89 : 14 - 46