Simpler is Much Faster: Fair and Independent Inner Product Search

被引:3
作者
Aoyama, Kazuyoshi [1 ]
Amagata, Daichi [1 ]
Fujita, Sumio [2 ]
Hara, Takahiro [1 ]
机构
[1] Osaka Univ, Suita, Osaka, Japan
[2] Yahoo Japan Corp, Tokyo, Japan
来源
PROCEEDINGS OF THE 46TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, SIGIR 2023 | 2023年
关键词
inner product search; fairness; independence;
D O I
10.1145/3539618.3592061
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The problem of inner product search (IPS) is important in many fields. Although maximum inner product search (MIPS) is often considered, its result is usually skewed and static. Users are hence hard to obtain diverse and/or new items by using the MIPS problem. Motivated by this, we formulate a new problem, namely the fair and independent IPS problem. Given a query, a threshold, and an output size k, this problem randomly samples.. items from a set of items such that the inner product of the query and item is not less than the threshold. For each item that satisfies the threshold, this problem is fair, because the probability that such an item is outputted is equal to that for each other item. This fairness can yield diversity and novelty, but this problem faces a computational challenge. Some existing (M)IPS techniques can be employed in this problem, but they require O(n) or o(n) time, where n is the dataset size. To scale well to large datasets, we propose a simple yet efficient algorithm that runs in O(log n + k) expected time. We conduct experiments using real datasets, and the results demonstrate that our algorithm is up to 330 times faster than baselines.
引用
收藏
页码:2379 / 2383
页数:5
相关论文
共 36 条
[1]   Managing Diversity in Airbnb Search [J].
Abdool, Mustafa ;
Haldar, Malay ;
Ramanathan, Prashant ;
Sax, Tyler ;
Zhang, Lanbo ;
Manaswala, Aamir ;
Yang, Lynn ;
Turnbull, Bradley ;
Zhang, Qing ;
Legrand, Thomas .
KDD '20: PROCEEDINGS OF THE 26TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2020, :2952-2960
[2]  
Afshani P., 2019, SOCG, p4:1
[3]  
Afshani Peyman, 2017, ESA, V87
[4]  
Amagata D., 2016, DEBS, P1
[5]   Reverse Maximum Inner Product Search: How to efficiently find users who would like to buy my item? [J].
Amagata, Daichi ;
Hara, Takahiro .
15TH ACM CONFERENCE ON RECOMMENDER SYSTEMS (RECSYS 2021), 2021, :273-281
[6]  
Amagata Daichi, 2023, AAAI
[7]  
Amagata Daichi, 2023, ACM T WEB
[8]   Algorithmic Effects on the Diversity of Consumption on Spotify [J].
Anderson, Ashton ;
Maystre, Lucas ;
Mehrotra, Rishabh ;
Anderson, Ian ;
Lalmas, Mounia .
WEB CONFERENCE 2020: PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE (WWW 2020), 2020, :2155-2165
[9]   Sampling a Near Neighbor in High Dimensions - Who is the Fairest of Them All? [J].
Aumuller, Martin ;
Har-Peled, Sariel ;
Mahabadi, Sepideh ;
Pagh, Rasmus ;
Silvestri, Francesco .
ACM TRANSACTIONS ON DATABASE SYSTEMS, 2022, 47 (01)
[10]  
Aumüller M, 2021, SIGMOD REC, V50, P42, DOI [10.1145/3375395.3387648, 10.1145/3471485.3471496]