A hybrid linear text segmentation algorithm using hierarchical agglomerative clustering and discrete particle swarm optimization

被引:34
|
作者
Wu, Ji-Wei [1 ]
Tseng, Judy C. R. [2 ]
Tsai, Wen-Nung
机构
[1] Natl Chiao Tung Univ, Dept Comp Sci, Hsinchu, Taiwan
[2] Chung Hua Univ, Dept Comp Sci & Informat Engn, Hsinchu, Taiwan
关键词
Linear text segmentation; hierarchical agglomerative clustering; discrete particle swarm optimization; natural language processing;
D O I
10.3233/ICA-130446
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Linear text segmentation plays an important role in many natural language processing tasks. Many algorithms have been proposed and shown to improve the performance of linear text segmentation. However, the previous studies often suffer from either lower segmentation accuracy or higher computational complexity. Moreover, parameter setting is another critical problem in some algorithms. Although manual assignment is an approach to solve this problem, it may increase the user's burden, and the parameters provided may not always be suitable to reflect the real metadata of a text. In this paper, a hybrid algorithm, TSHAC-DPSO, is proposed to tackle these problems. A novel linear Text Segmentation algorithm based on Hierarchical Agglomerative Clustering (TSHAC) is proposed to rapidly generate a satisfactory solution without an auxiliary knowledge base, parameter setting, or user involvement; then an efficient evolutional algorithm, Discrete Particle Swarm Optimization (DPSO), is adopted to generate the global optimal solution by refining the solution created by TSHAC. TSHAC-DPSO fully utilizes the merits of both algorithms which not only improve the accuracy of linear text segmentation, but also make the execution more efficient and flexible. The experimental results show that TSHAC-DPSO provides comparable segmentation accuracy with several well-known linear text segmentation algorithms.
引用
收藏
页码:35 / 46
页数:12
相关论文
共 50 条
  • [41] Automatic particle swarm optimization clustering algorithm
    Chen, Ching-Yi
    Feng, Hsuan-Ming
    Ye, Fun
    International Journal of Electrical Engineering, 2006, 13 (04): : 379 - 387
  • [42] A Hybrid Quantum-behaved Particle Swarm Optimization Algorithm for Clustering Analysis
    Lu Kezhong
    Fang Kangnian
    Me Guangqian
    FIFTH INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY, VOL 1, PROCEEDINGS, 2008, : 21 - 25
  • [43] The Buttressed Walls Problem: An Application of a Hybrid Clustering Particle Swarm Optimization Algorithm
    Garcia, Jose
    Marti, Jose, V
    Yepes, Victor
    MATHEMATICS, 2020, 8 (06)
  • [44] An improved discrete particle swarm optimization algorithm
    Liu, QingFeng
    Lecture Notes in Electrical Engineering, 2013, 219 LNEE (VOL. 4): : 883 - 890
  • [45] A New Discrete Particle Swarm Optimization Algorithm
    Strasser, Shane
    Goodman, Rollie
    Sheppard, John
    Butcher, Stephyn
    GECCO'16: PROCEEDINGS OF THE 2016 GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE, 2016, : 53 - 60
  • [46] Study on Discrete Particle Swarm Optimization Algorithm
    Wang Beizhan
    Deng Xiang
    Ye, Weichuan
    Wei, Haifang
    ADVANCES IN MANUFACTURING TECHNOLOGY, PTS 1-4, 2012, 220-223 : 1787 - 1794
  • [47] Survey of discrete particle swarm optimization algorithm
    Shen, Lin-Cheng
    Huo, Xiao-Hua
    Niu, Yi-Feng
    Xi Tong Gong Cheng Yu Dian Zi Ji Shu/Systems Engineering and Electronics, 2008, 30 (10): : 1986 - 1990
  • [48] A discrete-time switched linear model of the particle swarm optimization algorithm
    Zhang, Haopeng
    SWARM AND EVOLUTIONARY COMPUTATION, 2020, 52
  • [49] FCM fuzzy clustering image segmentation algorithm based on fractional particle swarm optimization
    Zhang, Le
    Wang, Jinsong
    An, Zhiyong
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2020, 38 (04) : 3575 - 3584
  • [50] Dynamic particle swarm optimization and K-means clustering algorithm for image segmentation
    Li, Haiyang
    He, Hongzhou
    Wen, Yongge
    OPTIK, 2015, 126 (24): : 4817 - 4822