Multi-feature based Chinese-English Named Entity Extraction from comparable corpora

被引:0
|
作者
Lu, Min [1 ]
Zhao, Jun [1 ]
机构
[1] Chinese Acad Sci, Inst Automat, Natl Lab Pattern Recognit, Beijing 100080, Peoples R China
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Bilingual Named Entity Extraction is,important to some cross language information processes such as machine translation (MT), cross-lingual information retrieval (CLIR), etc. A lot of previous work extracted bilingual Named Entities from parallel corpus. Here we propose a multi-feature based method to extract bilingual Named Entities from comparable corpus. We first recognize the, Chinese and English Named Entities respectively from the Chinese and English part of the comparable corpus. Then all the feature scores are calculated for every possible pair of Chinese and English Named Entities. At last we combine these feature scores together and decide which pairs are mutual translations. For translation score calculation, we didn't use the formula of IBM model I like previous approach. In stead, we used a modified edit distance to take the order of words into consideration. Experiment shows that-the F-score of this method increased by 11%. And with the multi-feature integration strategy encouraging results are obtained. http://www.aclweb.org/anthology/Y06-1018
引用
收藏
页码:134 / 141
页数:8
相关论文
共 50 条
  • [31] Keyword Extraction Based on Multi-feature Fusion for Chinese Web Pages
    He, Qi
    Hao, Hong-Wei
    Yin, Xu-Cheng
    PROCEEDINGS OF THE 2011 2ND INTERNATIONAL CONGRESS ON COMPUTER APPLICATIONS AND COMPUTATIONAL SCIENCE, VOL 1, 2012, 144 : 119 - 124
  • [32] French-English terminology extraction from comparable corpora
    Daille, B
    Morin, E
    NATURAL LANGUAGE PROCESSING - IJCNLP 2005, PROCEEDINGS, 2005, 3651 : 707 - 718
  • [33] Extracting Historical Terms Based on Aligned Chinese-English Parallel Corpora
    Li, Xiuying
    Che, Chao
    Han, Limin
    Liu, Xiaoxia
    IEEE NLP-KE 2009: PROCEEDINGS OF INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING AND KNOWLEDGE ENGINEERING, 2009, : 296 - 301
  • [34] Multi-feature fusion named entity recognition method for grape knowledge graph construction
    Nie X.
    Zhang L.
    Niu D.
    Wu H.
    Zhu H.
    Zhang H.
    Nongye Gongcheng Xuebao/Transactions of the Chinese Society of Agricultural Engineering, 2024, 40 (03): : 201 - 210
  • [35] Tibetan-Chinese Cross Language Named Entity Extraction Based on Comparable Corpus and Naturally Annotated Resources
    Sun, Yuan
    Guo, Wenbin
    Zhao, Xiaobing
    2014 IEEE SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE AND DATA MINING (CIDM), 2014, : 288 - 295
  • [36] Chinese Relation Extraction Based on Cross-Attention and Multi-feature Perception
    Xu, Shiao
    Sun, Shuihua
    Zhang, Zhiyuan
    Zhou, Huan
    Journal of Network Intelligence, 2024, 9 (03): : 1837 - 1853
  • [37] EEG FEATURE EXTRACTION AND RECOGNITION BASED ON MULTI-FEATURE FUSION
    Sun, Jian
    Wu, Quanyu
    Gao, Nan
    Pan, Lingjiao
    Tao, Weige
    BIOMEDICAL ENGINEERING-APPLICATIONS BASIS COMMUNICATIONS, 2024, 36 (06):
  • [38] Multi-feature Fusion for Relation Extraction using Entity Types and Word Dependencies
    Zhang, Pu
    Li, Junwei
    Chen, Sixing
    Zhang, Jingyu
    Tang, Libo
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2023, 14 (07) : 275 - 285
  • [39] Chinese named entity recognition method based on multiscale feature fusion
    Jiang, Xiaoguang
    INTERNATIONAL JOURNAL OF BIOMETRICS, 2024, 16 (3-4) : 337 - 349
  • [40] Recognition of the agricultural named entities with multi-feature fusion based on BERT
    Zhao P.
    Zhao C.
    Wu H.
    Wang W.
    Nongye Gongcheng Xuebao/Transactions of the Chinese Society of Agricultural Engineering, 2022, 38 (03): : 112 - 118