Reverse Maximum Inner Product Search: Formulation, Algorithms, and Analysis

被引:1
作者
Amagata, Daichi [1 ]
Hara, Takahiro [1 ]
机构
[1] Osaka Univ, Grad Sch Informat Sci & Technol, 1-5 Yamadaoka, Suita, Osaka 5650871, Japan
关键词
Reverse maximum inner product search; high dimensional data; algorithm;
D O I
10.1145/3587215
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The maximum inner product search (MIPS), which finds the item with the highest inner product with a given query user, is an essential problem in the recommendation field. Usually e-commerce companies face situations where they want to promote and sell new or discounted items. In these situations, we have to consider the following questions: Who is interested in the items, and how do we find them? This article answers this question by addressing a new problem called reverse maximum inner product search (reverse MIPS). Given a query vector and two sets of vectors (user vectors and item vectors), the problem of reverse MIPS finds a set of user vectors whose inner product with the query vector is the maximum among the query and item vectors. Although the importance of this problem is clear, its straightforward implementation incurs a computationally expensive cost. We therefore propose Simpfer, a simple, fast, and exact algorithm for reverse MIPS. In an offline phase, Simpfer builds a simple index that maintains a lower bound of the maximum inner product. By exploiting this index, Simpfer judges whether the query vector can have the maximum inner product or not, for a given user vector, in a constant time. Our index enables filtering user vectors, which cannot have the maximum inner product with the query vector, in a batch. We theoretically demonstrate that Simpfer outperforms baselines employing state-of-the-art MIPS techniques. In addition, we answer two new research questions. Can approximation algorithms further improve reverse MIPS processing? Is there an exact algorithm that is faster than Simpfer? For the former, we show that approximation with quality guarantee provides a little speed-up. For the latter, we propose Simpfer++, a theoretically and practically faster algorithm than Simpfer. Our extensive experiments on real datasets show that Simpfer is at least two orders of magnitude faster than the baselines, and Simpfer++ further improves the online processing time.
引用
收藏
页数:23
相关论文
共 50 条
[1]   To Index or Not to Index: Optimizing Exact Maximum Inner Product Search [J].
Abuzaid, Firas ;
Sethi, Geet ;
Bailis, Peter ;
Zaharia, Matei .
2019 IEEE 35TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2019), 2019, :1250-1261
[2]   Reverse Maximum Inner Product Search: How to efficiently find users who would like to buy my item? [J].
Amagata, Daichi ;
Hara, Takahiro .
15TH ACM CONFERENCE ON RECOMMENDER SYSTEMS (RECSYS 2021), 2021, :273-281
[3]   Speeding Up the Xbox Recommender System Using a Euclidean Transformation for Inner-Product Spaces [J].
Bachrach, Yoram ;
Finkelstein, Yehuda ;
Gilad-Bachrach, Ran ;
Katzir, Liran ;
Koenigstein, Noam ;
Nice, Nir ;
Paquet, Ulrich .
PROCEEDINGS OF THE 8TH ACM CONFERENCE ON RECOMMENDER SYSTEMS (RECSYS'14), 2014, :257-264
[4]  
Chen C, 2020, AAAI CONF ARTIF INTE, V34, P19
[5]  
Chin WS, 2016, J MACH LEARN RES, V17
[6]   Approximating matrix multiplication for pattern recognition tasks [J].
Cohen, E ;
Lewis, DD .
JOURNAL OF ALGORITHMS-COGNITION INFORMATICS AND LOGIC, 1999, 30 (02) :211-252
[7]  
Cremonesi Paolo, 2010, RecSys'10-Proceedings of the 4th ACM Conference on Recommender Systems, P39, DOI [DOI 10.1145/1864708.1864721, 10.1145/1864708.1864721]
[8]  
Curtin R. R., 2013, SDM, P1
[9]  
Dai XY, 2020, AAAI CONF ARTIF INTE, V34, P51
[10]  
Ding S, 2011, PROCEEDINGS OF THE 34TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR'11), P993