A Hybrid Approach For Word Segmentation

被引:0
|
作者
Mohammed, Ammar [1 ,2 ]
Karam, Mohamed [3 ]
Hefny, Hesham [3 ]
机构
[1] Arab East Coll, Dept Comp Sci, Riyadh, Saudi Arabia
[2] Cairo Univ, Dept Comp Sci, ISSR, Giza, Egypt
[3] Cairo Univ, Inst Stat Studies & Res, Dept Comp Sci, Giza, Egypt
关键词
Word segmentation; Word statistics; Maximum matching; Hybrid methods; RECOGNITION;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Automatic word segmentation is the process of finding the most likely sequence of words from a sequence of characters without spaces. The central issues of the word segmentation process are the complexity and accuracy. This paper proposes a hybrid method for automatic word segmentation depending on a dictionary based approach, word-statistics and the length of the word. In comparison to the word segmentation using Maximum Length Descending Frequency and Entropy Rate method, the paper shows that the proposed method gives a better accuracy.
引用
收藏
页码:232 / 238
页数:7
相关论文
共 50 条
  • [1] A Hybrid Approach to Vietnamese Word Segmentation
    Tuan-Phong Nguyen
    Anh-Cuong Le
    2016 IEEE RIVF INTERNATIONAL CONFERENCE ON COMPUTING & COMMUNICATION TECHNOLOGIES, RESEARCH, INNOVATION, AND VISION FOR THE FUTURE (RIVF), 2016, : 114 - 119
  • [2] A Hybrid Approach to Chinese Word Segmentation
    Chen, Bing
    Tai, Xiaoying
    2009 INTERNATIONAL SYMPOSIUM ON INTELLIGENT INFORMATION SYSTEMS AND APPLICATIONS, PROCEEDINGS, 2009, : 154 - 158
  • [3] A Hybrid Approach to Word Segmentation of Vietnamese Texts
    Phuong, Le Hong
    Nguyen Thi Minh Huyen
    Roussanaly, Azim
    Ho Tuong Vinh
    LANGUAGE AND AUTOMATA THEORY AND APPLICATIONS, 2008, 5196 : 240 - +
  • [4] A hybrid approach to word segmentation and POS tagging
    Oki Electric Industry Co., Ltd., 2−5−7 Honmachi, Chuo-ku, Osaka
    541−0053, Japan
    不详
    619−0289, Japan
    Proc. Annu. Meet. Assoc. Comput Linguist., 1600, (217-220):
  • [5] A Hybrid Approach for Thai Word Segmentation with Crowdsourcing Feedback System
    Chaonithi, Kriangkrai
    Prom-on, Santitham
    2016 13TH INTERNATIONAL CONFERENCE ON ELECTRICAL ENGINEERING/ELECTRONICS, COMPUTER, TELECOMMUNICATIONS AND INFORMATION TECHNOLOGY (ECTI-CON), 2016,
  • [6] A hybrid approach of text segmentation based on sensitive word concept for NLP
    Ren, F
    COMPUTATIONAL LINGUISTICS AND INTELLIGENT TEXT PROCESSING, 2001, 2004 : 375 - 388
  • [7] A Hybrid Approach to Vietnamese Word Segmentation using Part of Speech tags
    Dang Due Pham
    Giang Binh Tran
    Son Bao Pham
    INTERNATIONAL CONFERENCE ON KNOWLEDGE AND SYSTEMS ENGINEERING (KSE 2009), 2009, : 154 - 161
  • [8] A New Unsupervised Approach to Word Segmentation
    Wang, Hanshi
    Zhu, Jian
    Tang, Shiping
    Fan, Xiaozhong
    COMPUTATIONAL LINGUISTICS, 2011, 37 (03) : 421 - 454
  • [9] A combining approach for Chinese word segmentation
    Aiqing, Wang
    Sen, Zhang
    SNPD 2007: EIGHTH ACIS INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING, ARTIFICIAL INTELLIGENCE, NETWORKING, AND PARALLEL/DISTRIBUTED COMPUTING, VOL 3, PROCEEDINGS, 2007, : 738 - +
  • [10] An Improved Unsupervised Approach to Word Segmentation
    WANG Hanshi
    HAN Xuhong
    LIU Lizhen
    SONG Wei
    YUAN Mudan
    中国通信, 2015, 12 (07) : 82 - 95