Fast Redescription Mining Using Locality-Sensitive Hashing

被引:0
|
作者
Karjalainen, Maiju [1 ]
Galbrun, Esther [1 ]
Miettinen, Pauli [1 ]
机构
[1] Univ Eastern Finland, Kuopio, Finland
来源
MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES: RESEARCH TRACK, PT VII, ECML PKDD 2024 | 2024年 / 14947卷
关键词
Redescription mining; Locality-Sensitive hashing;
D O I
10.1007/978-3-031-70368-3_8
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Redescription mining is a data analysis technique that has found applications in diverse fields. The most used redescription mining approaches involve two phases: finding matching pairs among data attributes and extending the pairs. This process is relatively efficient when the number of attributes remains limited and when the attributes are Boolean, but becomes almost intractable when the data consist of many numerical attributes. In this paper, we present new algorithms that perform the matching and extension orders of magnitude faster than the existing approaches. Our algorithms are based on locality-sensitive hashing with a tailored approach to handle the discretisation of numerical attributes as used in redescription mining.
引用
收藏
页码:124 / 142
页数:19
相关论文
共 50 条
  • [41] Privacy-preserving Distributed Service Recommendation based on Locality-Sensitive Hashing
    Qi, Lianyong
    Xiang, Haolong
    Dou, Wanchun
    Yang, Chi
    Qin, Yongrui
    Zhang, Xuyun
    2017 IEEE 24TH INTERNATIONAL CONFERENCE ON WEB SERVICES (ICWS 2017), 2017, : 49 - 56
  • [42] Query-aware locality-sensitive hashing scheme for l p norm
    Huang, Qiang
    Feng, Jianlin
    Fang, Qiong
    Ng, Wilfred
    Wang, Wei
    VLDB JOURNAL, 2017, 26 (05) : 683 - 708
  • [43] A Graph Classification Method Based on Support Vector Machines and Locality-Sensitive Hashing
    Gonzalez-Lima, Maria D.
    Ludena, Carenne C.
    Otazo-Sanchez, Gibran G.
    IEEE ACCESS, 2024, 12 : 15791 - 15799
  • [44] Parallel set similarity join on big data based on Locality-Sensitive Hashing
    Sohrabi, Mohammad Karim
    Azgomi, Hosseion
    SCIENCE OF COMPUTER PROGRAMMING, 2017, 145 : 1 - 12
  • [45] Efficient locality-sensitive hashing over high-dimensional streaming data
    Hao Wang
    Chengcheng Yang
    Xiangliang Zhang
    Xin Gao
    Neural Computing and Applications, 2023, 35 : 3753 - 3766
  • [46] Fast Fuzzy Search for Mixed Data Using Locality Sensitive Hashing
    Lee, Kyung Mi
    Lee, Keon Myung
    PROGRESS IN MECHATRONICS AND INFORMATION TECHNOLOGY, PTS 1 AND 2, 2014, 462-463 : 321 - +
  • [47] A Projection-based Locality-Sensitive Hashing Technique for Reducing False Negatives
    Lee, Keon Myung
    INFORMATION TECHNOLOGY APPLICATIONS IN INDUSTRY, PTS 1-4, 2013, 263-266 : 1341 - 1346
  • [48] Efficient Data Stream Clustering with Sliding Windows based on Locality-Sensitive Hashing
    Youn, Jonghem
    Shim, Junho
    Lee, Sang-Goo
    IEEE ACCESS, 2018, 6 : 63757 - 63776
  • [49] Boosting Multi-Kernel Locality-Sensitive Hashing for Scalable Image Retrieval
    Xia, Hao
    Wu, Pengcheng
    Hoi, Steven C. H.
    Jin, Rong
    SIGIR 2012: PROCEEDINGS OF THE 35TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2012, : 55 - 64