A Sparsifier Model for Efficient Information Retrieval

被引:1
作者
Dobrynin, Viacheslav [1 ]
Sherman, Mark [1 ]
Abramovich, Roman [1 ]
Platonov, Alexey [1 ]
机构
[1] ITMO Univ, Fac Software Engn & Comp Syst, St Petersburg, Russia
来源
2024 IEEE 18TH INTERNATIONAL CONFERENCE ON APPLICATION OF INFORMATION AND COMMUNICATION TECHNOLOGIES, AICT 2024 | 2024年
关键词
sparsity; inverted index; neural networks; independence;
D O I
10.1109/AICT61888.2024.10740301
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The constant development of dense neural models leads to improved search quality. It is crucial to adapt these models to meet performance requirements. Solutions like SPLADE or SparseEmbed address this by solving the ranking task, whereas our work proposes addressing the simplified task of sparsifying dense vector representations. This approach facilitates the faster adaptation of new dense models for use with efficient inverted indexes. The importance of the independence property for sparse space features, achieved through the use of iVAE, is demonstrated. Additionally, the model is trained to maintain the ranking properties of the dense model, which in our case was a BERT model. As a result, the obtained model showed search quality close to the original BERT model. The proposed sparsification approach can be applied to other tasks requiring sparse spaces by adding new or replacing existing properties of the sparse space. Thus, the paper describes the main aspects of a sparsifier model applied to the task of information retrieval.
引用
收藏
页数:4
相关论文
共 15 条
[11]  
Loshchilov I., 2017, C TRACK P
[12]  
Paria B., 2020, Minimizing flops to learn efficient sparse representations
[13]   The probabilistic relevance framework: BM25 and beyond [J].
Robertson, Stephen ;
Zaragoza, Hugo .
Foundations and Trends in Information Retrieval, 2009, 3 (04) :333-389
[14]  
Thakur N, 2021, Arxiv, DOI arXiv:2104.08663
[15]   From Neural Re-Ranking to Neural Ranking: Learning a Sparse Representation for Inverted Indexing [J].
Zamani, Hamed ;
Dehghani, Mostafa ;
Croft, W. Bruce ;
Learned-Miller, Erik ;
Kamps, Jaap .
CIKM'18: PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, 2018, :497-506