Can Large Language Models Provide Security & Privacy Advice? Measuring the Ability of LLMs to Refute Misconceptions

被引：19

作者：

Chen, Yufan ^{[1
]}

Arunasalam, Arjun ^{[1
]}

Celik, Z. Berkay ^{[1
]}

机构：

[1] Purdue Univ, W Lafayette, IN 47907 USA

来源：

39TH ANNUAL COMPUTER SECURITY APPLICATIONS CONFERENCE, ACSAC 2023 | 2023年

关键词：

Large language models; security and privacy advice; misconception;

D O I：

10.1145/3627106.3627196

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

Users seek security & privacy (S&P) advice from online resources, including trusted websites and content-sharing platforms. These resources help users understand S&P technologies and tools and suggest actionable strategies. Large Language Models (LLMs) have recently emerged as trusted information sources. However, their accuracy and correctness have been called into question. Prior research has outlined the shortcomings of LLMs in answering multiple-choice questions and user ability to inadvertently circumvent model restrictions (e.g., to produce toxic content). Yet, the ability of LLMs to provide reliable S&P advice is not well-explored. In this paper, we measure their ability to refute popular S&P misconceptions that the general public holds. We first study recent academic literature to curate a dataset of over a hundred S&Prelated misconceptions across six different topics. We then query two popular LLMs (Bard and ChatGPT) and develop a labeling guide to evaluate their responses to these misconceptions. To comprehensively evaluate their responses, we further apply three strategies: query each misconception multiple times, generate and query their paraphrases, and solicit source URLs of the responses. Both models demonstrate, on average, a 21.3% non-negligible error rate, incorrectly supporting popular S&P misconceptions. The error rate increases to 32.6% when we repeatedly query LLMs with the same or paraphrased misconceptions. We also expose that models may partially support a misconception or remain noncommittal, refusing a firm stance on misconceptions. Our exploration of information sources for responses revealed that LLMs are susceptible to providing invalid URLs (21.2% for Bard and 67.7% for ChatGPT) or point to unrelated sources (44.2% returned by Bard and 18.3% by ChatGPT). Our findings highlight that existing LLMs are not completely reliable for S&P advice and motivate future work in understanding how users can better interact with this technology.

引用

页码：366 / 378

页数：13

共 72 条

[1] Stance detection on social media: State of the art and trends [J].

ALDayel, Abeer ;

Magdy, Walid .

INFORMATION PROCESSING & MANAGEMENT, 2021, 58 (04)

[2]

Ali Waleed, 2019, International Journal of Media, Journalism and Mass Communications, V5

[3]

[Anonymous], 2023, Requests: HTTP for Humans

[4]

[Anonymous], 2023, ChatGPT API

[5]

[Anonymous], 2023, IAn important next step on our AI journey

[6]

[Anonymous], 2023, I use ChatGPT to ace interviews: 'Works for every single job'

[7]

[Anonymous], 2023, Could ChatGPT REALLY slay Google

[8]

[Anonymous], 2023, Miss-date doctor launches revolutionary AI platform to help people find partners

[9]

[Anonymous], 2023, OpenAI ChatGPT job interview questions

[10]

[Anonymous], 2023, ChatGPT

← 1 2 3 4 5 6 7 8 →