Multi-feature based Chinese-English Named Entity Extraction from comparable corpora

被引:0
|
作者
Lu, Min [1 ]
Zhao, Jun [1 ]
机构
[1] Chinese Acad Sci, Inst Automat, Natl Lab Pattern Recognit, Beijing 100080, Peoples R China
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Bilingual Named Entity Extraction is,important to some cross language information processes such as machine translation (MT), cross-lingual information retrieval (CLIR), etc. A lot of previous work extracted bilingual Named Entities from parallel corpus. Here we propose a multi-feature based method to extract bilingual Named Entities from comparable corpus. We first recognize the, Chinese and English Named Entities respectively from the Chinese and English part of the comparable corpus. Then all the feature scores are calculated for every possible pair of Chinese and English Named Entities. At last we combine these feature scores together and decide which pairs are mutual translations. For translation score calculation, we didn't use the formula of IBM model I like previous approach. In stead, we used a modified edit distance to take the order of words into consideration. Experiment shows that-the F-score of this method increased by 11%. And with the multi-feature integration strategy encouraging results are obtained. http://www.aclweb.org/anthology/Y06-1018
引用
收藏
页码:134 / 141
页数:8
相关论文
共 50 条
  • [41] Subject Knowledge Entity Relationship Extraction Based on Multi-feature Fusion and Relation Specific Horns Tagging
    Tian, Xiuxia
    Pei, Zhuang
    Li, Bingxue
    KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT, PT I, KSEM 2024, 2024, 14884 : 255 - 267
  • [42] MFE-Transformer: Adaptive English Text Named EntityRecognition Method Based on Multi-feature Extractionand Transformer
    Gao, Liuxin
    COMPUTER SCIENCE AND INFORMATION SYSTEMS, 2024, 21 (04)
  • [43] Feature selection for entity extraction from multiple biomedical corpora: A PSO-based approach
    Shweta Yadav
    Asif Ekbal
    Sriparna Saha
    Soft Computing, 2018, 22 : 6881 - 6904
  • [44] Feature selection for entity extraction from multiple biomedical corpora: A PSO-based approach
    Yadav, Shweta
    Ekbal, Asif
    Saha, Sriparna
    SOFT COMPUTING, 2018, 22 (20) : 6881 - 6904
  • [45] Haptic Display of Image Based on Multi-Feature Extraction
    Tian, Lei
    Song, Aiguo
    Chen, Dapeng
    Ni, Dejing
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2016, 30 (08)
  • [46] Chinese Named Entity Implicit Relation Extraction Based on Company Verbs
    Wan C.-X.
    Gan L.-X.
    Jiang T.-J.
    Liu D.-X.
    Liu X.-P.
    Liu Y.
    Jisuanji Xuebao/Chinese Journal of Computers, 2019, 42 (12): : 2795 - 2820
  • [47] Chinese Sentence Similarity Based on Multi-feature Combination
    Liu, Yi
    Liu, Qiang
    PROCEEDINGS OF THE 2009 WRI GLOBAL CONGRESS ON INTELLIGENT SYSTEMS, VOL III, 2009, : 14 - 19
  • [48] Multi-Feature Extraction of Ships From SAR Images
    Gu, Dandan
    Xu, Xiaojian
    2013 6TH INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING (CISP), VOLS 1-3, 2013, : 454 - 458
  • [49] Improving feature extraction in named entity recognition based on maximum entropy model
    Jiang, Wei
    Guan, Yi
    Wang, Xiao-Long
    PROCEEDINGS OF 2006 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2006, : 2630 - +
  • [50] An optimization based feature extraction and machine learning techniques for named entity identification
    Govindarajan, Saravanan
    Mustafa, Mohammed Ahmed
    Kiyosov, Sherzod
    Duong, Nguyen Duc
    Raju, M. Naga
    Gola, Kamal Kumar
    OPTIK, 2023, 272