Zero-Shot Text Classification with Semantically Extended Graph Convolutional Network

被引：10

作者：

Liu, Tengfei ^{[1
]}

Hu, Yongli ^{[1
]}

Gao, Junbin ^{[2
]}

Sun, Yanfeng ^{[1
]}

Yin, Baocai ^{[1
,3
]}

机构：

[1] Beijing Univ Technol, Fac Informat Technol, Beijing Inst Artificial Intelligence, Beijing Key Lab Multimedia & Intelligent Software, Beijing 100124, Peoples R China

[2] Univ Sydney, Business Sch, Discipline Business Analyt, Sydney, NSW, Australia

[3] Dalian Univ Technol, Faulty Elect Informat & Elect Engn, Dalian, Peoples R China

来源：

2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR) | 2021年

基金：

中国国家自然科学基金;

关键词：

D O I：

10.1109/ICPR48806.2021.9411914

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

As a challenging task of Natural Language Processing(NLP), zero-shot text classification has attracted more and more attention recently. It aims to detect classes that the model has never seen in the training set. For this purpose, a feasible way is to construct connection between the seen and unseen classes by semantic extension and classify the unseen classes by information propagation over the connection. Although many related zero-shot text classification methods have been exploited, how to realize semantic extension properly and propagate information effectively are far from solved. In this paper, we propose a novel zero-shot text classification method called Semantically Extended Graph Convolutional Network (SEGCN). In the proposed method, the semantic category knowledge from ConceptNet is utilized to semantic extension for linking seen classes to unseen classes and constructing a graph of all categories. Then, we build upon Graph Convolutional Network (GCN) for predicting the textual classifier for each category, which transfers the category knowledge by the convolution operators on the constructed graph and is trained in a semi-supervised manner using the samples of the seen classes. The experimental results on Dbpedia and 20newsgroup datasets show that our method outperforms the state of the art zero-shot text classification methods.

引用

页码：8352 / 8359

页数：8

共 33 条

[1]

[Anonymous], 2017, GENERATIVE DISCRIMIN

[2]

[Anonymous], 2015, CoRR

[3]

Bastings J., 2017, P 2017 C EMPIRICAL M, P1957

[4]

Cao YX, 2019, 57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), P1452

[5]

Chang Ming-Wei, 2008, AAAI

[6] Deep Computational Phenotyping [J].

Che, Zhengping ;

Kale, David ;

Li, Wenzhe ;

Bahadori, Mohammad Taha ;

Liu, Yan .

KDD'15: PROCEEDINGS OF THE 21ST ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2015, :507-516

[7]

Chen XY, 2015, AAAI CONF ARTIF INTE, P2224

[8]

Devlin J, 2019, 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, P4171

[9]

Frome A., 2013, Proc. Adv. Neural Inf. Process. Syst., P2121

[10] Recent Advances in Zero-Shot Recognition Toward data-efficient understanding of visual content [J].

Fu, Yanwei ;

Xiang, Tao ;

Jiang, Yu-Gang ;

Xue, Xiangyang ;

Sigal, Leonid ;

Gong, Shaogang .

IEEE SIGNAL PROCESSING MAGAZINE, 2018, 35 (01) :112-125

← 1 2 3 4 →