Improvement of Vietnamese noun phrases chunking in text-based person image search

被引:0
|
作者
Khanh-Toan Luong [1 ]
Thi-Hoai Phan [1 ,2 ]
Thi-Ngoc-Diep Do [1 ,2 ]
Thi-Lan Le [1 ,2 ]
机构
[1] Hanoi Univ Sci & Technol, Sch Elect & Elect Engn SEEE, Hanoi, Vietnam
[2] Hanoi Univ Sci & Technol, SigM Lab, SEEE, Hanoi, Vietnam
来源
2024 IEEE TENTH INTERNATIONAL CONFERENCE ON COMMUNICATIONS AND ELECTRONICS, ICCE 2024 | 2024年
关键词
Vietnamese noun phrase chunking; CRF; Person Search;
D O I
10.1109/ICCE62051.2024.10634596
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Natural language processing is a crucial and beneficial task in various multimedia processing applications. Text-based person search (TBPS) application involves finding person images in an image gallery using an input as a text sentence. Previous studies have shown that noun phrases is one of the most important components in person description sentence as users employ noun phrases to describe the person characteristics. However, in recent TBPS the noun phrase chunking mainly bases on some set of rules. Low quality noun phrase chunking may lead to irrelevant results of text-based person search. In this paper, a method for Vietnamese noun phrase chunking named VNPC (Vietnamese Noun Phrase Chunking) is proposed. The method is based on Conditional Random Fields (CRFs) model and it is improved with a task-dependent post processing ruler to be integrated in Vietnamese text-based person image search framework. Compared to the Vietnamese Noun Phrase Chunking based on Conditional Random Fields [14] using only simple CRFs, which achieved an average recall and precision of 82.72% and 82.62% respectively, we achieved better results in chunking (with the same dataset splitting ratio). Experimental results show that the proposed method allows to obtain a high quality noun phrase chunking performance with 88.02%, 90.09%, and 89.01% of Precision, Recall, F1 score. And it helps to improve the person search results in TBPS by 1.325%, 0.675% and 0.25%, of accuracies at the top-1, top-5 at the top-10 respectively.
引用
收藏
页码:381 / 386
页数:6
相关论文
共 11 条
  • [11] Text-to-Image Person Re-Identification Based on Multimodal Graph Convolutional Network
    Han, Guang
    Lin, Min
    Li, Ziyang
    Zhao, Haitao
    Kwong, Sam
    IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 6025 - 6036