OpenMP Implementation of Parallel Longest Common Subsequence Algorithm for Mathematical Expression Retrieval

被引:1
|
作者
Perepu, Pavan Kumar [1 ]
机构
[1] Mahindra Ecole Cent Coll Engn, Dept Comp Sci & Engn, Hyderabad, India
关键词
Mathematical expression retrieval; longest common subsequence; time complexity; parallel algorithm; OpenMP; SEARCH;
D O I
10.1142/S0129626421500079
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Given a mathematical expression in LaTeX or MathML format, retrieval algorithm extracts similar expressions from a database. In our previous work, we have used Longest Common Subsequence (LCS) algorithm to match two expressions of lengths, m and n, which takes O(mn) time complexity. If there are T database expressions, total complexity is O(Tmn), and an increase in T also increases this complexity. In the present work, we propose to use parallel LCS algorithm in our retrieval process. Parallel LCS has O(max(m,n)) time complexity with max(m,n) processors and total complexity can be reduced to O(Tmax(m,n)). For our experimentation, OpenMP based implementation has been used on Intel i3 processor with 4 cores. However, for smaller expressions, parallel version takes more time as the implementation overhead dominates the algorithmic improvement. As such, we have proposed to use parallel version, selectively, only on larger expressions, in our retrieval algorithm to achieve better performance. We have compared the sequential and parallel versions of our ME retrieval algorithm, and the performance results have been reported on a database of 829 mathematical expressions.
引用
收藏
页数:18
相关论文
共 46 条
  • [31] A space efficient algorithm for the longest common subsequence in k-length substrings
    Zhu, Daxin
    Wang, Lei
    Wang, Tinran
    Wang, Xiaodong
    THEORETICAL COMPUTER SCIENCE, 2017, 687 : 79 - 92
  • [32] An efficient and hardware-implementable systolic algorithm for the longest common subsequence problem
    Hu, Shu-Hua
    Wang, Cong-Wei
    Chen, Hsing-Lung
    PROCEEDINGS OF 2008 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2008, : 3150 - +
  • [33] A parallel algorithm for longest common substring of multiple biosequences
    Liu, Wei
    Chen, Ling
    DCABES 2006 PROCEEDINGS, VOLS 1 AND 2, 2006, : 13 - 17
  • [34] An almost-linear time and linear space algorithm for the longest common subsequence problem
    Guo, JY
    Hwang, FK
    INFORMATION PROCESSING LETTERS, 2005, 94 (03) : 131 - 135
  • [35] A fast and simple algorithm for computing the longest common subsequence of run-length encoded strings
    Ann, Hsing-Yen
    Yang, Chang-Biau
    Tseng, Chiou-Ting
    Hor, Chiou-Yi
    INFORMATION PROCESSING LETTERS, 2008, 108 (06) : 360 - 364
  • [36] Most discriminating segment - Longest common subsequence (MDSLCS) algorithm for dynamic hand gesture classification
    Stern, Helman
    Shmueli, Merav
    Berman, Sigal
    PATTERN RECOGNITION LETTERS, 2013, 34 (15) : 1980 - 1989
  • [37] RISC-Based Simulation of Longest Common Subsequence Algorithm in MIPS64 Simulators
    Gara, Glenn Paul P.
    Pacot, Mark Phil B.
    Uy, Roger Luis T.
    PROCEEDINGS OF TENCON 2018 - 2018 IEEE REGION 10 CONFERENCE, 2018, : 0811 - 0814
  • [38] An effective branch-and-bound algorithm to solve the k-Longest Common Subsequence problem
    Huang, GF
    Lim, A
    ECAI 2004: 16TH EUROPEAN CONFERENCE ON ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2004, 110 : 191 - 195
  • [39] Longest common subsequence between run-length-encoded strings: a new algorithm with improved parallelism
    Freschi, V
    Bogliolo, A
    INFORMATION PROCESSING LETTERS, 2004, 90 (04) : 167 - 173
  • [40] Batch Source-Code Plagiarism Detection Using an Algorithm for the Bounded Longest Common Subsequence Problem
    Campos, R. A. Castro
    Martinez, F. J. Zaragoza
    2012 9TH INTERNATIONAL CONFERENCE ON ELECTRICAL ENGINEERING, COMPUTING SCIENCE AND AUTOMATIC CONTROL (CCE), 2012,