Named Entity Recognition for Social Media Texts with Semantic Augmentation

被引:0
|
作者
Nie, Yuyang [1 ]
Tian, Yuanhe [2 ]
Wan, Xiang [3 ]
Yan Song [3 ,4 ]
Bo Dai [1 ]
机构
[1] Univ Elect Sci & Technol China, Chengdu, Peoples R China
[2] Washington Univ, St Louis, MO USA
[3] Shenzhen Res Inst Big Data, Shenzhen, Peoples R China
[4] Chinese Univ Hong Kong Shenzhen, Shenzhen, Peoples R China
来源
PROCEEDINGS OF THE 2020 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP) | 2020年
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Existing approaches for named entity recognition suffer from data sparsity problems when conducted on short and informal texts, especially user-generated social media content. Semantic augmentation is a potential way to alleviate this problem. Given that rich semantic information is implicitly preserved in pre-trained word embeddings, they are potential ideal resources for semantic augmentation. In this paper, we propose a neural-based approach to NER for social media texts where both local (from running text) and augmented semantics are taken into account. In particular, we obtain the augmented semantic information from a large-scale corpus, and propose an attentive semantic augmentation module and a gate module to encode and aggregate such information, respectively. Extensive experiments are performed on three benchmark datasets collected from English and Chinese social media platforms, where the results demonstrate the superiority of our approach to previous studies across all three datasets.(1)
引用
收藏
页码:1383 / 1391
页数:9
相关论文
共 50 条
  • [1] Improving Named Entity Recognition for Social Media with Data Augmentation
    Liu, Wenzhong
    Cui, Xiaohui
    APPLIED SCIENCES-BASEL, 2023, 13 (09):
  • [2] Simple Semantic-based Data Augmentation for Named Entity Recognition in Biomedical Texts
    Phan, Uyen T. P.
    Nguyen, Nhung T. H.
    PROCEEDINGS OF THE 21ST WORKSHOP ON BIOMEDICAL LANGUAGE PROCESSING (BIONLP 2022), 2022, : 123 - 129
  • [3] Weakly labeled data augmentation for social media named entity recognition
    Kim, Juae
    Kim, Yejin
    Kang, Sangwoo
    EXPERT SYSTEMS WITH APPLICATIONS, 2022, 209
  • [4] Mining Adverse Drug Reactions in Social Media with Named Entity Recognition and Semantic Methods
    Chen, Xiaoyi
    Deldossi, Myrtille
    Aboukhamis, Rim
    Faviez, Carole
    Dahamna, Badisse
    Karapetiantz, Pierre
    Guenegou-Arnoux, Armelle
    Girardeau, Yannick
    Guillemin-Lanne, Sylvie
    Lillo-Le-Louet, Agnes
    Texier, Nathalie
    Burgun, Anita
    Katsahian, Sandrine
    MEDINFO 2017: PRECISION HEALTHCARE THROUGH INFORMATICS, 2017, 245 : 322 - 326
  • [5] A method for named entity recognition in social media texts with syntactically enhanced multiscale feature fusion
    Li, Yuhan
    Zhou, Yang
    Hu, Xiaofei
    Li, Qingxiang
    Tian, Jiali
    SCIENTIFIC REPORTS, 2024, 14 (01):
  • [6] Grounded Multimodal Named Entity Recognition on Social Media
    Yu, Jianfei
    Li, Ziyan
    Wang, Jieming
    Xia, Rui
    PROCEEDINGS OF THE 61ST ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2023): LONG PAPERS, VOL 1, 2023, : 9141 - 9154
  • [7] Named Entity Recognition for Code Mixed Social Media Sentences
    Sharma, Yashvardhan
    Bhargava, Rupal
    Tadikonda, Bapiraju Vamsi
    INTERNATIONAL JOURNAL OF SOFTWARE SCIENCE AND COMPUTATIONAL INTELLIGENCE-IJSSCI, 2021, 13 (02): : 23 - 36
  • [8] Named Entity Recognition Experiments on Turkish Texts
    Kuecuek, Dilek
    Yazici, Adnan
    FLEXIBLE QUERY ANSWERING SYSTEMS: 8TH INTERNATIONAL CONFERENCE, FQAS 2009, 2009, 5822 : 524 - 535
  • [9] A recent survey of Arabic named entity recognition on social media
    Ali B.A.B.
    Mihi S.
    Bazi I.E.
    Laachfoubi N.
    Revue d'Intelligence Artificielle, 2020, 34 (02) : 125 - 135
  • [10] Hybrid Framework for Named Entity Recognition in Turkish Social Media
    Yilmaz, Selim F.
    Balaban, Ismail
    Tekin, Selim F.
    Kozat, Suleyman S.
    2020 28TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2020,