Trojan Detection in Large Language Models: Insights from The Trojan Detection Challenge

被引:0
|
作者
Maloyan, Narek
Verma, Ekansh
Nutfullin, Bulat
Ashinov, Bislan
机构
来源
arXiv |
关键词
Compilation and indexing terms; Copyright 2025 Elsevier Inc;
D O I
暂无
中图分类号
学科分类号
摘要
Computational linguistics - Malware
引用
收藏
相关论文
共 49 条
  • [41] TrojanSAINT: Gate-Level Netlist Sampling-Based Inductive Learning for Hardware Trojan Detection
    New York University Abu Dhabi, United Arab Emirates
    arXiv,
  • [42] TrojanSAINT: Gate-Level Netlist Sampling-Based Inductive Learning for Hardware Trojan Detection
    Lashen, Hazem
    Alrahis, Lilas
    Knechtel, Johann
    Sinanoglu, Ozgur
    Proceedings - IEEE International Symposium on Circuits and Systems, 2023, 2023-May
  • [43] MAGECODE: Machine-Generated Code Detection Method Using Large Language Models
    Pham, Hung
    Ha, Huyen
    Tong, Van
    Hoang, Dung
    Tran, Duc
    Le, Tuyen Ngoc
    IEEE Access, 2024, 12 : 190186 - 190202
  • [44] Empirical Risk-aware Machine Learning on Trojan-Horse Detection for Trusted Quantum Key Distribution Networks
    Chou, Hong-Fu
    Vu, Thang X.
    Maity, Ilora
    Garces-Socarras, Luis M.
    Gonzalez-Rios, Jorge L.
    Merlano-Duncan, Juan Carlos
    Ma, Sean Longyu
    Chatzinotas, Symeon
    Ottersten, Björn
    arXiv,
  • [45] Exploring Reasoning Biases in Large Language Models Through Syllogism: Insights from the NeuBAROCO Dataset
    Ozeki, Kentaro
    Ando, Risako
    Morishita, Takanobu
    Abe, Hirohiko
    Mineshima, Koji
    Okada, Mitsuhiro
    arXiv,
  • [46] HSTF-Model: an HTTP-based Trojan Detection Model via the Hierarchical Spatio-Temporal Features of Traffics
    Xie, Jiang
    Li, Shuhao
    Yun, Xiaochun
    Zhang, Yongzheng
    Chang, Peng
    arXiv, 2023,
  • [47] Predicting seizure recurrence from medical records using large language models
    Mbizvo, Gashirai K.
    Buchan, Ian
    LANCET DIGITAL HEALTH, 2023, 5 (12): : E851 - E852
  • [48] Detection of COVID-19 from X-Ray Images Using Machine Learning Models
    Sakib, Md. Masrul
    Karim, Meem
    Swachchha, Aftab Miraj
    Islam, Maheen
    Lecture Notes in Networks and Systems, 2023, 578 : 759 - 773
  • [49] Think from Words(TFW): Initiating Human-Like Cognition in Large Language Models Through Think from Words for Japanese Text-level Classification
    Gan, Chengguang
    Zhang, Qinghao
    Mori, Tatsunori
    arXiv, 2023,