Lightweight Conceptual Dictionary Learning for Text Classification Using Information Compression

被引:0
作者
Wan, Li [1 ]
Alpcan, Tansu [1 ]
Kuijper, Margreta [1 ]
Viterbo, Emanuele [2 ]
机构
[1] Univ Melbourne, Dept Elect Elect Engn, Parkville, Vic 3010, Australia
[2] Monash Univ, Dept Elect Comp Syst Engn, Clayton, Vic 3800, Australia
关键词
Dictionaries; Atoms; Classification algorithms; Text categorization; Neural networks; Accuracy; Vectors; Dictionary learning; information bottleneck; information theory; supervised learning;
D O I
10.1109/TKDE.2024.3421255
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose a novel supervised dictionary learning framework for text classification, integrating the Lempel-Ziv-Welch (LZW) algorithm for data compression and dictionary construction. This two-phase approach refines dictionaries by optimizing dictionary atoms for discriminative power using mutual information and class distribution. Our method facilitates classifier training, such as SVMs and neural networks. We introduce the information plane area rank (IPAR) to evaluate the information-theoretic performance of our algorithm. Tested on six benchmark text datasets, our model performs nearly as well as top models in limited-vocabulary settings, lagging by only about 2% while using just 10% of the parameters. However, its performance drops in diverse-vocabulary contexts due to the LZW algorithm's limitations with low-repetition data. This contrast highlights its efficiency and limitations across different dataset types.
引用
收藏
页码:8711 / 8717
页数:7
相关论文
共 39 条
[1]  
Deng Liwei, 2023, WSDM '23: Proceedings of the Sixteenth ACM International Conference on Web Search and Data Mining, P375, DOI 10.1145/3539597.3570410
[2]   Recurrent Marked Temporal Point Processes: Embedding Event History to Vector [J].
Du, Nan ;
Dai, Hanjun ;
Trivedi, Rakshit ;
Upadhyay, Utkarsh ;
Gomez-Rodriguez, Manuel ;
Song, Le .
KDD'16: PROCEEDINGS OF THE 22ND ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2016, :1555-1564
[3]   Learning to Simulate Human Mobility [J].
Feng, Jie ;
Yang, Zeyu ;
Xu, Fengli ;
Yu, Haisu ;
Wang, Mudan ;
Li, Yong .
KDD '20: PROCEEDINGS OF THE 26TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2020, :3426-3433
[4]   DeepMove: Predicting Human Mobility with Attentional Recurrent Networks [J].
Feng, Jie ;
Li, Yong ;
Zhang, Chao ;
Sun, Funing ;
Meng, Fanchao ;
Guo, Ang ;
Jin, Depeng .
WEB CONFERENCE 2018: PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE (WWW2018), 2018, :1459-1468
[5]  
Gao Q, 2017, PROCEEDINGS OF THE TWENTY-SIXTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, P1689
[6]  
Gao TY, 2021, 2021 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2021), P6894
[7]  
Gong LT, 2023, AAAI CONF ARTIF INTE, P4276
[8]  
Gupta V, 2022, AAAI CONF ARTIF INTE, P4005
[9]  
Gupta V, 2021, PR MACH LEARN RES, V130
[10]  
Hochreiter S, 1997, NEURAL COMPUT, V9, P1735, DOI [10.1162/neco.1997.9.8.1735, 10.1162/neco.1997.9.1.1, 10.1007/978-3-642-24797-2]