Multi-feature based Chinese-English Named Entity Extraction from comparable corpora

被引:0
|
作者
Lu, Min [1 ]
Zhao, Jun [1 ]
机构
[1] Chinese Acad Sci, Inst Automat, Natl Lab Pattern Recognit, Beijing 100080, Peoples R China
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Bilingual Named Entity Extraction is,important to some cross language information processes such as machine translation (MT), cross-lingual information retrieval (CLIR), etc. A lot of previous work extracted bilingual Named Entities from parallel corpus. Here we propose a multi-feature based method to extract bilingual Named Entities from comparable corpus. We first recognize the, Chinese and English Named Entities respectively from the Chinese and English part of the comparable corpus. Then all the feature scores are calculated for every possible pair of Chinese and English Named Entities. At last we combine these feature scores together and decide which pairs are mutual translations. For translation score calculation, we didn't use the formula of IBM model I like previous approach. In stead, we used a modified edit distance to take the order of words into consideration. Experiment shows that-the F-score of this method increased by 11%. And with the multi-feature integration strategy encouraging results are obtained. http://www.aclweb.org/anthology/Y06-1018
引用
收藏
页码:134 / 141
页数:8
相关论文
共 50 条
  • [21] Chinese Named Entity Recognition Based on BERT and Lightweight Feature Extraction Model
    Yang, Ruisen
    Gan, Yong
    Zhang, Chenfang
    INFORMATION, 2022, 13 (11)
  • [22] Weakly Supervised Named Entity Transliteration and Discovery from Multilingual Comparable Corpora
    Klementiev, Alexandre
    Roth, Dan
    COLING/ACL 2006, VOLS 1 AND 2, PROCEEDINGS OF THE CONFERENCE, 2006, : 817 - 824
  • [23] Generating Chinese named entity data from parallel corpora
    Fu, Ruiji
    Qin, Bing
    Liu, Ting
    FRONTIERS OF COMPUTER SCIENCE, 2014, 8 (04) : 629 - 641
  • [24] Generating Chinese named entity data from parallel corpora
    Ruiji Fu
    Bing Qin
    Ting Liu
    Frontiers of Computer Science, 2014, 8 : 629 - 641
  • [25] Chinese Clinical Named Entity Recognition Using Multi-Feature Fusion and Multi-Scale Local Context Enhancement
    Li, Meijing
    Huang, Runqing
    Qi, Xianxian
    CMC-COMPUTERS MATERIALS & CONTINUA, 2024, 80 (02): : 2283 - 2299
  • [26] An Entity Relation Extraction Method Based on Dynamic Context and Multi-Feature Fusion
    Ma, Xiaolin
    Wu, Kaiqi
    Kuang, Hailan
    Liu, Xinhua
    APPLIED SCIENCES-BASEL, 2022, 12 (03):
  • [27] Research on multi-feature fusion entity relation extraction based on deep learning
    Xu, Shiao
    Sun, Shuihua
    Zhang, Zhiyuan
    Xu, Fan
    INTERNATIONAL JOURNAL OF AD HOC AND UBIQUITOUS COMPUTING, 2022, 39 (1-2) : 93 - 104
  • [28] Research of Chinese Entity Recognition Model Based on Multi-Feature Semantic Enhancement
    Yuan, Ling
    Zeng, Chenglong
    Pan, Peng
    ELECTRONICS, 2024, 13 (24):
  • [29] Joint entity and relation extraction with fusion of multi-feature semantics
    Wang, Ting
    Yang, Wenjie
    Wu, Tao
    Yang, Chuan
    Liang, Jiaying
    Wang, Hongyang
    Li, Jia
    Xiang, Dong
    Zhou, Zheng
    JOURNAL OF INTELLIGENT INFORMATION SYSTEMS, 2024, : 21 - 42
  • [30] Research on the Extraction Methods of Translation Equivalence Pairs in the Chinese-English Comparable Corpus
    Zheng, Juan
    BASIC & CLINICAL PHARMACOLOGY & TOXICOLOGY, 2019, 124 : 124 - 124