Information retrieval on an SCI-based PC cluster

被引:2
|
作者
Chung, SH [1 ]
Kwon, HC [1 ]
Ryu, KR [1 ]
Chung, Y [1 ]
Jang, H [1 ]
Choi, CA [1 ]
机构
[1] Pusan Natl Univ, Sch Elect & Comp Engn, Pusan 609735, South Korea
来源
JOURNAL OF SUPERCOMPUTING | 2001年 / 19卷 / 03期
关键词
cluster computing; PC cluster; SCI; information retrieval; inverted index file;
D O I
10.1023/A:1011178530932
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
This article presents an efficient parallel information retrieval (IR) system which provides fast information service for the Internet users on low-cost high-performance PC-NOW environment. The IR system is implemented on a PC cluster based on the scalable coherent interface (SCI), a powerful interconnecting mechanism for both shared memory models and message-passing models. In the IR system, the inverted-index file (IIF) is partitioned into pieces using a greedy declustering algorithm and distributed to the cluster nodes to be stored on each node's hard disk. For each incoming user's query with multiple terms, terms are sent to the corresponding nodes which contain the relevant pieces of the IIF to be evaluated in parallel. The IR system is developed using a distributed-shared memory (DSM) programming technique based on the SCI. According to the experiments, the IR system outperforms an MPI-based IR system using Fast Ethernet as an interconnect. Speed-up of up to 5.0 was obtained with an 8-node cluster in processing each query on a 500,000-document IIF.
引用
收藏
页码:251 / 265
页数:15
相关论文
共 50 条
  • [1] Information Retrieval on an SCI-Based PC Cluster
    Sang-Hwa Chung
    Hyuk-Chul Kwon
    Kwang Ryel Ryu
    Yoojin Chung
    Hankook Jang
    Cham-Ah Choi
    The Journal of Supercomputing, 2001, 19 : 251 - 265
  • [2] Parallel information retrieval on an SCI-based PC-NOW
    Chung, SH
    Kwon, HC
    Ryu, KR
    Jang, HK
    Kim, JH
    Choi, CA
    PARALLEL AND DISTRIBUTED PROCESSING, PROCEEDINGS, 2000, 1800 : 81 - 90
  • [3] An SCI-Based PC Cluster Utilizing Coherent Network Cache
    Sang-Hwa Chung
    Soo-Cheol Oh
    Cluster Computing, 2003, 6 (2) : 153 - 159
  • [4] Experiences with scientific applications on an SCI-based Linux cluster
    Bücker, HM
    Eck, B
    Henrichs, J
    2000 INTERNATIONAL WORKSHOPS ON PARALLEL PROCESSING, PROCEEDINGS, 2000, : 347 - 351
  • [5] A CC-NUMA prototype card for SCI-based PC clustering
    Chung, SH
    Oh, SC
    Park, S
    Jang, H
    Ha, CJ
    CLUSTER 2000: IEEE INTERNATIONAL CONFERENCE ON CLUSTER COMPUTING, PROCEEDINGS, 2000, : 375 - 376
  • [6] SCI-based LINUX PC-clusters as a platform for electromagnetic field calculations
    Trinitis, C
    Schulz, M
    Eberl, M
    Karl, W
    PARALLEL COMPUTING TECHNOLOGIES, 2001, 2127 : 510 - 513
  • [7] Use of SCI-based publication counts
    Arunachalam, S
    CURRENT SCIENCE, 2003, 85 (10): : 1391 - 1392
  • [8] Optimizing data locality for SCI-based PC-clusters with the SMiLE monitoring approach
    Karl, Wolfgang
    Leberecht, Markus
    Schulz, Martin
    Parallel Architectures and Compilation Techniques - Conference Proceedings, PACT, 1999, : 169 - 176
  • [9] Design and implementation of CC-NUMA card II for SCI-based PC clustering
    Oh, SC
    Chung, SH
    Jang, H
    2002 IEEE INTERNATIONAL CONFERENCE ON CLUSTER COMPUTING, PROCEEDINGS, 2002, : 145 - 151
  • [10] AN ATM NETWORK INTERFACE FOR AN SCI-BASED SYSTEM
    KURE, O
    MOLDEKLEV, K
    INFORMATION NETWORKS AND DATA COMMUNICATION, 1994, 23 : 203 - 215