Detecting dark patterns in shopping websites - a multi-faceted approach using Bidirectional Encoder Representations From Transformers (BERT)

被引：0

作者：

Vedhapriyavadhana, R. ^{[1
]}

Bharti, Priyanshu ^{[2
]}

Chidambaranathan, Senthilnathan ^{[3
]}

机构：

[1] Univ West Scotland, Sch Comp Engn & Phys Sci, Import Bldg,2 Clove Crescent, London E14 2BE, England

[2] Vellore Inst Technol, Sch Comp Sci & Engn, Chennai, India

[3] Virtusa, Dept Architecture & Design, Piscataway, NJ USA

来源：

ENTERPRISE INFORMATION SYSTEMS | 2025年

关键词：

Dark patterns; multi-class text classification; natural language processing; BERT; user experience; user interfaces;

D O I：

10.1080/17517575.2025.2457961

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Dark patterns refer to certain elements of the user interface and user experience that are designed to deceive, manipulate, confuse, and pressure users of a particular platform or website into making decisions they wouldn't have made knowingly. Many companies have begun implementing dark patterns on their websites, employing carefully crafted language and design elements to manipulate their users. Numerous studies have examined this subject and developed a classification system for these patterns. Additionally, governments worldwide have taken actions to restrict the use of these practices. This proposed work seeks to establish a fundamental framework for developing a browser extension, the purpose of which is to extract text from a specific shopping website, employ Bidirectional Encoder Representations from Transformers (BERT), an open-source natural language processing model, to identify and expose dark patterns to users who may be unaware of them. This tool's development has the potential to create a more equitable environment and enable individuals to enhance their knowledge in this area. The proposed work explores the issues and challenges associated with detecting dark patterns, as well as the strategies employed by companies to make detection more challenging by carefully modifying the design of their websites and applications. Moreover, the proposed work aims to enhance the accuracy for the detection of dark patterns using a natural language processing (NLP) model, i.e. BERT which results in accuracy 97% compared to classical models such as Random Forest and SVM having accuracy of 95.4% and 95.8% respectively. It seeks to facilitate future research and improvements to ensure the tool remains up-to-date with the constantly changing tactics. At last, the proposed work introduces a novel approach for safeguarding users from dark patterns using a machine-learning detection chromium extension. It additionally provides insights beyond the technical complexities that could help in the further development of this application. Dark patterns refer to certain elements of the user interface and user experience that are designed to deceive,confuse,and pressure users of a particular platform or website into making decisions they wouldn't have made knowingly. This proposed work seeks to establish a fundamental framework for developing a browser extension to extract text from a specific shopping website, employ an open-source natural language processing model, to identify and expose dark patterns to users who may be unaware of them. It aims to enhance the accuracy for the detection of dark patterns which results in accuracy 97% compared to other classical models.

引用

页数：33

共 37 条

[31] Detecting changes in help seeker conversations on a suicide prevention helpline during the COVID-19 pandemic: in-depth analysis using encoder representations from transformers
Salmi, Salim
Merelle, Saskia
Gilissen, Renske
van der Mei, Rob
Bhulai, Sandjai
BMC PUBLIC HEALTH, 2022, 22 (01)
[32] Automatic text classification of actionable radiology reports of tinnitus patients using bidirectional encoder representations from transformer (BERT) and in-domain pre-training (IDPT)
Jia Li
Yucong Lin
Pengfei Zhao
Wenjuan Liu
Linkun Cai
Jing Sun
Lei Zhao
Zhenghan Yang
Hong Song
Han Lv
Zhenchang Wang
BMC Medical Informatics and Decision Making, 22
[33] Automatic text classification of actionable radiology reports of tinnitus patients using bidirectional encoder representations from transformer (BERT) and in-domain pre-training (IDPT)
Li, Jia
Lin, Yucong
Zhao, Pengfei
Liu, Wenjuan
Cai, Linkun
Sun, Jing
Zhao, Lei
Yang, Zhenghan
Song, Hong
Lv, Han
Wang, Zhenchang
BMC MEDICAL INFORMATICS AND DECISION MAKING, 2022, 22 (01)
[34] Extracting Multiple Worries From Breast Cancer Patient Blogs Using Multilabel Classification With the Natural Language Processing Model Bidirectional Encoder Representations From Transformers: Infodemiology Study of Blogs
Watanabe, Tomomi
Yada, Shuntaro
Aramaki, Eiji
Yajima, Hiroshi
Kizaki, Hayato
Hori, Satoko
JMIR CANCER, 2022, 8 (02):
[35] Multi-Label Classification in Patient-Doctor Dialogues With the RoBERTa-WWM-ext plus CNN (Robustly Optimized Bidirectional Encoder Representations From Transformers Pretraining Approach With Whole Word Masking Extended Combining a Convolutional Neural Network) Model: Named Entity Study
Sun, Yuanyuan
Gao, Dongping
Shen, Xifeng
Li, Meiting
Nan, Jiale
Zhang, Weining
JMIR MEDICAL INFORMATICS, 2022, 10 (04) : 142 - 152
[36] A Natural Language Processing Model for COVID-19 Detection Based on Dutch General Practice Electronic Health Records by Using Bidirectional Encoder Representations From Transformers: Development and Validation Study
Homburg, Maarten
Meijer, Eline
Berends, Matthijs
Kupers, Thijmen
Hartman, Tim Olde
Muris, Jean
de Schepper, Evelien
Velek, Premysl
Kuiper, Jeroen
Berger, Marjolein
Peters, Lilian
JOURNAL OF MEDICAL INTERNET RESEARCH, 2023, 25
[37] Using Character-Level and Entity-Level Representations to Enhance Bidirectional Encoder Representation From Transformers-Based Clinical Semantic Textual Similarity Model: ClinicalSTS Modeling Study
Xiong, Ying
Chen, Shuai
Chen, Qingcai
Yan, Jun
Tang, Buzhou
JMIR MEDICAL INFORMATICS, 2020, 8 (12)

← 1 2 3 4 →