Privacy-Preserving Information Extraction for Ethical Case Studies in Machine Learning Using ChatGLM-LtMP

被引:0
作者
Gao, Xindan [1 ]
Ba, Xinyi [1 ]
Xing, Jian [1 ]
Liu, Ying [1 ]
机构
[1] Northeast Forestry Univ, Coll Comp & Control Engn, Harbin 150040, Peoples R China
基金
中国国家自然科学基金;
关键词
privacy protection; data security; large language model; information extraction; knowledge graph; intelligent question answering;
D O I
10.3390/electronics14071352
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Ensuring privacy protection in machine learning is crucial for handling sensitive information, particularly in ethical case studies within computer engineering. Traditional information extraction methods often expose private data to risks such as membership inference and reconstruction attacks, compromising confidentiality. To address these concerns, we propose ChatGLM-LtMP, a privacy-preserving information extraction framework that integrates Least-to-Most Prompting and P-Tuning v2 for structured and secure data retrieval. By employing controlled prompting mechanisms, our approach minimizes data exposure while maintaining high accuracy (93.71%), outperforming baseline models. Additionally, we construct a knowledge graph using the Neo4j 4.4 database and integrate LangChain 0.2 for case-based intelligent question answering. This framework enables secure and interpretable extraction of ethical case data, making it suitable for applications in sensitive machine learning scenarios. The proposed method advances information extraction, safeguarding sensitive ethical cases from potential attacks in automated learning environments.
引用
收藏
页数:26
相关论文
共 46 条
[1]   Repeatability of Fine-Tuning Large Language Models Illustrated Using QLoRA [J].
Alahmari, Saeed S. ;
Hall, Lawrence O. ;
Mouton, Peter R. ;
Goldgof, Dmitry B. .
IEEE ACCESS, 2024, 12 :153221-153231
[2]  
Dodge J., 2024, P C EMP METH NAT LAN
[3]   FashionGPT: LLM instruction fine-tuning with multiple LoRA-adapter fusion [J].
Gao, Dehong ;
Ma, Yufei ;
Liu, Sen ;
Song, Mengfei ;
Jin, Linbo ;
Jiang, Wen ;
Wang, Xin ;
Ning, Wei ;
Yu, Shanqing ;
Xuan, Qi ;
Cai, Xiaoyan ;
Yang, Libin .
KNOWLEDGE-BASED SYSTEMS, 2024, 299
[4]  
[高茂 Gao Mao], 2022, [计算机应用研究, Application Research of Computers], V39, P2257
[5]   Prompt Engineering with ChatGPT: A Guide for Academic Writers [J].
Giray, Louie .
ANNALS OF BIOMEDICAL ENGINEERING, 2023, 51 (12) :2629-2633
[6]   Research on Joint Extraction Method of Elevator Safety Risk Control Knowledge Based on Multi-Perspective Learning [J].
Hao, Suli ;
Shi, Fenfen .
IEEE ACCESS, 2024, 12 :159488-159502
[7]   Performance Metrics for Multilabel Emotion Classification: Comparing Micro, Macro, and Weighted F1-Scores [J].
Hinojosa Lee, Maria Cristina ;
Braet, Johan ;
Springael, Johan .
APPLIED SCIENCES-BASEL, 2024, 14 (21)
[8]   A syntax-enhanced parameter generation network for multi-source cross-lingual event extraction [J].
Huang, Wenzhi ;
Zhang, Junchi ;
Tian, Tian ;
Ji, Donghong .
KNOWLEDGE-BASED SYSTEMS, 2024, 292
[9]   Category Preferred Canopy-K-means based Collaborative Filtering algorithm [J].
Li, Jianjiang ;
Zhang, Kai ;
Yang, Xiaolei ;
Wei, Peng ;
Wang, Jie ;
Mitra, Karan ;
Ranja, Rajiv .
FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2019, 93 :1046-1054
[10]  
Li L., 2023, P NAT LANG PROC CHIN