Identification of transliterated foreign words in Hebrew script

被引:0
|
作者
Goldberg, Yoav [1 ]
Elhadad, Michael [1 ]
机构
[1] Ben Gurion Univ Negev, Dept Comp Sci, IL-84105 Beer Sheva, Israel
来源
COMPUTATIONAL LINGUISTICS AND INTELLIGENT TEXT PROCESSING | 2008年 / 4919卷
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present a loosely-supervised method for context-free identification of transliterated foreign names and borrowed words in Hebrew text. The method is purely statistical and does not require the use of any lexicons or linguistic analysis tool for the source languages (Hebrew, in our case). It also does not require any manually annotated data for training we learn from noisy data acquired by over-generation. We report precision/recall results of 80/82 for a corpus of 4044 unique words, containing 368 foreign words.
引用
收藏
页码:466 / 477
页数:12
相关论文
共 50 条
  • [31] COMPOUND WORDS IN LXX REPRESENTING 2 OR MORE HEBREW WORDS
    TOV, E
    BIBLICA, 1977, 58 (02) : 189 - 212
  • [32] Automatic identification and back-transliteration of foreign words for information retrieval
    Jeong, KS
    Myaeng, SH
    Lee, JS
    Choi, KS
    INFORMATION PROCESSING & MANAGEMENT, 1999, 35 (04) : 523 - 540
  • [33] Script identification: a review
    Bashir R.
    Quadri S.M.K.
    Giri K.J.
    International Journal of Information Technology, 2022, 14 (1) : 459 - 473
  • [34] Automatic identification and back-transliteration of foreign words for information retrieval
    Jeong, Kil Soon
    Myaeng, Sung Hyon
    Lee, Jae Sung
    Choi, Key-Sun
    Information Processing and Management, 1999, 35 (04): : 523 - 540
  • [35] Texture for script identification
    Busch, A
    Boles, WW
    Sridharan, S
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2005, 27 (11) : 1720 - 1732
  • [36] A Note on 'Hebrew-Script Tombstones From Jam, Afghanistan'
    Shaked, Shaul
    JOURNAL OF JEWISH STUDIES, 2010, 61 (02): : 305 - 307
  • [37] Anticipating the use of Hebrew script in the LC/NACO authority file
    Lerner, Heidi
    LIBRARY RESOURCES & TECHNICAL SERVICES, 2006, 50 (04): : 252 - 263
  • [38] Specimens of Mediaeval Hebrew Scripts, vol 3, Ashkenazic Script
    Olszowy-Schlanger, Judith
    JOURNAL OF JEWISH STUDIES, 2018, 69 (02): : 426 - 429
  • [39] ARAMAIC TOMB INSCRIPTION WRITTEN IN PALEO-HEBREW SCRIPT
    NAVEH, J
    ISRAEL EXPLORATION JOURNAL, 1973, 23 (02) : 82 - 91
  • [40] Script Identification of Multi-Script Documents: A Survey
    Ubul, Kurban
    Tursun, Gulzira
    Aysa, Alimjan
    Impedovo, Donato
    Pirlo, Giuseppe
    Yibulayin, Tuergen
    IEEE ACCESS, 2017, 5 : 6546 - 6559