The Kernel Trick for Content-Based Media Retrieval in Online Social Networks

被引:1
作者
Cha, Guang-Ho [1 ]
机构
[1] Seoul Natl Univ Sci & Technol, Dept Comp Sci & Engn, Seoul, South Korea
来源
JOURNAL OF INFORMATION PROCESSING SYSTEMS | 2021年 / 17卷 / 05期
基金
新加坡国家研究基金会;
关键词
Content-Based Retrieval; Dimensionality Curse; Nearest Neighbor Query; Online Social Network; Kernel Method; Kernel Principal Component Analysis; Similarity Search; Social Network Service; APPROXIMATE NEAREST-NEIGHBOR; OPTIMAL HASHING ALGORITHMS; IMAGE;
D O I
10.3745/JIPS.02.0167
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Nowadays, online or mobile social network services (SNS) are very popular and widely spread in our society and daily lives to instantly share, disseminate, and search information. In particular, SNS such as YouTube, Flickr, Facebook, and Amazon allow users to upload billions of images or videos and also provide a number of multimedia information to users. Information retrieval in multimedia-rich SNS is very useful but challenging task. Content-based media retrieval (CBMR) is the process of obtaining the relevant image or video objects for a given query from a collection of information sources. However, CBMR suffers from the dimensionality curse due to inherent high dimensionality features of media data. This paper investigates the effectiveness of the kernel trick in CBMR, specifically, the kernel principal component analysis (KPCA) for dimensionality reduction. KPCA is a nonlinear extension of linear principal component analysis (LPCA) to discovering nonlinear embeddings using the kernel trick. The fundamental idea of KPCA is mapping the input data into a highdimensional feature space through a nonlinear kernel function and then computing the principal components on that mapped space. This paper investigates the potential of KPCA in CBMR for feature extraction or dimensionality reduction. Using the Gaussian kernel in our experiments, we compute the principal components of an image dataset in the transformed space and then we use them as new feature dimensions for the image dataset. Moreover, KPCA can be applied to other many domains including CBMR, where LPCA has been used to extract features and where the nonlinear extension would be effective. Our results from extensive experiments demonstrate that the potential of KPCA is very encouraging compared with LPCA in CBMR.
引用
收藏
页码:1020 / 1033
页数:14
相关论文
共 22 条
  • [1] Ahmad W, 2016, 2016 3RD INTERNATIONAL CONFERENCE ON RECENT ADVANCES IN INFORMATION TECHNOLOGY (RAIT), P631, DOI 10.1109/RAIT.2016.7507972
  • [2] Near-optimal hashing algorithms for approximate nearest neighbor in high dimensions
    Andoni, Alexandr
    Indyk, Piotr
    [J]. COMMUNICATIONS OF THE ACM, 2008, 51 (01) : 117 - 122
  • [3] Andoni A, 2006, ANN IEEE SYMP FOUND, P459
  • [4] [Anonymous], 1994, P 20 INT C VER LARG
  • [5] BECKMANN N, 1990, SIGMOD REC, V19, P322, DOI 10.1145/93605.98741
  • [6] Cha GH, 2002, IEEE T MULTIMEDIA, V4, P76
  • [7] A new indexing scheme for content-based image retrieval
    Cha, GH
    Chung, CW
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 1998, 6 (03) : 263 - 288
  • [8] Faloutsos C., 1994, Journal of Intelligent Information Systems: Integrating Artificial Intelligence and Database Technologies, V3, P231, DOI 10.1007/BF00962238
  • [9] QUERY BY IMAGE AND VIDEO CONTENT - THE QBIC SYSTEM
    FLICKNER, M
    SAWHNEY, H
    NIBLACK, W
    ASHLEY, J
    HUANG, Q
    DOM, B
    GORKANI, M
    HAFNER, J
    LEE, D
    PETKOVIC, D
    STEELE, D
    YANKER, P
    [J]. COMPUTER, 1995, 28 (09) : 23 - 32
  • [10] Dynamic vp-tree indexing for n-nearest neighbor search given pair-wise distances
    Fu, AW
    Chan, PM
    Cheung, YL
    Moon, YS
    [J]. VLDB JOURNAL, 2000, 9 (02) : 154 - 173