A Parallel Architecture for the Partitioning around Medoids (PAM) Algorithm for Scalable Multi-Core Processor Implementation with Applications in Healthcare

被引:9
|
作者
Mushtaq, Hassan [1 ]
Khawaja, Sajid Gul [2 ]
Akram, Muhammad Usman [2 ]
Yasin, Amanullah [1 ]
Muzammal, Muhammad [3 ]
Khalid, Shehzad [4 ]
Khan, Shoab Ahmad [2 ]
机构
[1] Sir Syed CASE Inst Technol, Dept Elect & Comp Engn, Islamabad 44000, Pakistan
[2] Natl Univ Sci & Technol, Dept Comp & Software Engn, CE&ME, Islamabad 44000, Pakistan
[3] Bahria Univ, Dept Comp Sci, Islamabad 44000, Pakistan
[4] Bahria Univ, Dept Comp Engn, Islamabad 44000, Pakistan
关键词
clustering; partitioning around medoids; scalable; parallel; reconfigurable; FPGA; MPSoCs; multi-core processor; time complexity; speedup; EFFICIENT; MANAGEMENT; QUALITY;
D O I
10.3390/s18124129
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
Clustering is the most common method for organizing unlabeled data into its natural groups (called clusters), based on similarity (in some sense or another) among data objects. The Partitioning Around Medoids (PAM) algorithm belongs to the partitioning-based methods of clustering widely used for objects categorization, image analysis, bioinformatics and data compression, but due to its high time complexity, the PAM algorithm cannot be used with large datasets or in any embedded or real-time application. In this work, we propose a simple and scalable parallel architecture for the PAM algorithm to reduce its running time. This architecture can easily be implemented either on a multi-core processor system to deal with big data or on a reconfigurable hardware platform, such as FPGA and MPSoCs, which makes it suitable for real-time clustering applications. Our proposed model partitions data equally among multiple processing cores. Each core executes the same sequence of tasks simultaneously on its respective data subset and shares intermediate results with other cores to produce results. Experiments show that the computational complexity of the PAM algorithm is reduced exponentially as we increase the number of cores working in parallel. It is also observed that the speedup graph of our proposed model becomes more linear with the increase in number of data points and as the clusters become more uniform. The results also demonstrate that the proposed architecture produces the same results as the actual PAM algorithm, but with reduced computational complexity.
引用
收藏
页数:17
相关论文
共 5 条
  • [1] Balancing Parallel Applications on Multi-core Processors Based on Cache Partitioning
    Suo, Guang
    Yang, Xue-jun
    2009 IEEE INTERNATIONAL SYMPOSIUM ON PARALLEL AND DISTRIBUTED PROCESSING WITH APPLICATIONS, PROCEEDINGS, 2009, : 190 - 195
  • [2] Implementation of an Algorithm for Heart Rate Measurement in a Specialized Multi-core Processor
    Sondej, Tadeusz
    Tomaszewski, Damian
    Rozanowski, Krzysztof
    2015 22ND INTERNATIONAL CONFERENCE MIXED DESIGN OF INTEGRATED CIRCUITS & SYSTEMS (MIXDES), 2015, : 74 - 78
  • [3] A Performance evaluation of a Probabilistic Parallel Genetic Algorithm: FPGA vs. Multi-core Processor
    Jewajinda, Yutana
    2013 INTERNATIONAL COMPUTER SCIENCE AND ENGINEERING CONFERENCE (ICSEC), 2013, : 298 - 301
  • [4] Parallel Implementation Strategy for Content Based Video Copy Detection Using a Multi-core Processor
    Liao, Kaiyang
    Zhao, Fan
    Zhang, Mingzhu
    KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS, 2014, 8 (10): : 3520 - 3537
  • [5] A Parallel Algorithm of Multiple String Matching Based on Set-Partition in Multi-core Architecture
    Liu, Jiahui
    Li, Fangzhou
    Sun, Guanglu
    INTERNATIONAL JOURNAL OF SECURITY AND ITS APPLICATIONS, 2016, 10 (04): : 267 - 278