Detecting dark patterns in shopping websites - a multi-faceted approach using Bidirectional Encoder Representations From Transformers (BERT)

被引:0
作者
Vedhapriyavadhana, R. [1 ]
Bharti, Priyanshu [2 ]
Chidambaranathan, Senthilnathan [3 ]
机构
[1] Univ West Scotland, Sch Comp Engn & Phys Sci, Import Bldg,2 Clove Crescent, London E14 2BE, England
[2] Vellore Inst Technol, Sch Comp Sci & Engn, Chennai, India
[3] Virtusa, Dept Architecture & Design, Piscataway, NJ USA
关键词
Dark patterns; multi-class text classification; natural language processing; BERT; user experience; user interfaces;
D O I
10.1080/17517575.2025.2457961
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Dark patterns refer to certain elements of the user interface and user experience that are designed to deceive, manipulate, confuse, and pressure users of a particular platform or website into making decisions they wouldn't have made knowingly. Many companies have begun implementing dark patterns on their websites, employing carefully crafted language and design elements to manipulate their users. Numerous studies have examined this subject and developed a classification system for these patterns. Additionally, governments worldwide have taken actions to restrict the use of these practices. This proposed work seeks to establish a fundamental framework for developing a browser extension, the purpose of which is to extract text from a specific shopping website, employ Bidirectional Encoder Representations from Transformers (BERT), an open-source natural language processing model, to identify and expose dark patterns to users who may be unaware of them. This tool's development has the potential to create a more equitable environment and enable individuals to enhance their knowledge in this area. The proposed work explores the issues and challenges associated with detecting dark patterns, as well as the strategies employed by companies to make detection more challenging by carefully modifying the design of their websites and applications. Moreover, the proposed work aims to enhance the accuracy for the detection of dark patterns using a natural language processing (NLP) model, i.e. BERT which results in accuracy 97% compared to classical models such as Random Forest and SVM having accuracy of 95.4% and 95.8% respectively. It seeks to facilitate future research and improvements to ensure the tool remains up-to-date with the constantly changing tactics. At last, the proposed work introduces a novel approach for safeguarding users from dark patterns using a machine-learning detection chromium extension. It additionally provides insights beyond the technical complexities that could help in the further development of this application. Dark patterns refer to certain elements of the user interface and user experience that are designed to deceive,confuse,and pressure users of a particular platform or website into making decisions they wouldn't have made knowingly. This proposed work seeks to establish a fundamental framework for developing a browser extension to extract text from a specific shopping website, employ an open-source natural language processing model, to identify and expose dark patterns to users who may be unaware of them. It aims to enhance the accuracy for the detection of dark patterns which results in accuracy 97% compared to other classical models.
引用
收藏
页数:33
相关论文
共 37 条
  • [31] Detecting changes in help seeker conversations on a suicide prevention helpline during the COVID-19 pandemic: in-depth analysis using encoder representations from transformers
    Salmi, Salim
    Merelle, Saskia
    Gilissen, Renske
    van der Mei, Rob
    Bhulai, Sandjai
    BMC PUBLIC HEALTH, 2022, 22 (01)
  • [32] Automatic text classification of actionable radiology reports of tinnitus patients using bidirectional encoder representations from transformer (BERT) and in-domain pre-training (IDPT)
    Jia Li
    Yucong Lin
    Pengfei Zhao
    Wenjuan Liu
    Linkun Cai
    Jing Sun
    Lei Zhao
    Zhenghan Yang
    Hong Song
    Han Lv
    Zhenchang Wang
    BMC Medical Informatics and Decision Making, 22
  • [33] Automatic text classification of actionable radiology reports of tinnitus patients using bidirectional encoder representations from transformer (BERT) and in-domain pre-training (IDPT)
    Li, Jia
    Lin, Yucong
    Zhao, Pengfei
    Liu, Wenjuan
    Cai, Linkun
    Sun, Jing
    Zhao, Lei
    Yang, Zhenghan
    Song, Hong
    Lv, Han
    Wang, Zhenchang
    BMC MEDICAL INFORMATICS AND DECISION MAKING, 2022, 22 (01)
  • [34] Extracting Multiple Worries From Breast Cancer Patient Blogs Using Multilabel Classification With the Natural Language Processing Model Bidirectional Encoder Representations From Transformers: Infodemiology Study of Blogs
    Watanabe, Tomomi
    Yada, Shuntaro
    Aramaki, Eiji
    Yajima, Hiroshi
    Kizaki, Hayato
    Hori, Satoko
    JMIR CANCER, 2022, 8 (02):
  • [35] Multi-Label Classification in Patient-Doctor Dialogues With the RoBERTa-WWM-ext plus CNN (Robustly Optimized Bidirectional Encoder Representations From Transformers Pretraining Approach With Whole Word Masking Extended Combining a Convolutional Neural Network) Model: Named Entity Study
    Sun, Yuanyuan
    Gao, Dongping
    Shen, Xifeng
    Li, Meiting
    Nan, Jiale
    Zhang, Weining
    JMIR MEDICAL INFORMATICS, 2022, 10 (04) : 142 - 152
  • [36] A Natural Language Processing Model for COVID-19 Detection Based on Dutch General Practice Electronic Health Records by Using Bidirectional Encoder Representations From Transformers: Development and Validation Study
    Homburg, Maarten
    Meijer, Eline
    Berends, Matthijs
    Kupers, Thijmen
    Hartman, Tim Olde
    Muris, Jean
    de Schepper, Evelien
    Velek, Premysl
    Kuiper, Jeroen
    Berger, Marjolein
    Peters, Lilian
    JOURNAL OF MEDICAL INTERNET RESEARCH, 2023, 25
  • [37] Using Character-Level and Entity-Level Representations to Enhance Bidirectional Encoder Representation From Transformers-Based Clinical Semantic Textual Similarity Model: ClinicalSTS Modeling Study
    Xiong, Ying
    Chen, Shuai
    Chen, Qingcai
    Yan, Jun
    Tang, Buzhou
    JMIR MEDICAL INFORMATICS, 2020, 8 (12)