A study on word vector dimensions for sentence classifications using convolutional neural networks

被引:0
作者
Takuya S. [1 ]
Satoshi Y. [2 ]
机构
[1] Graduate School of Information and Computer Science, Chiba Institute of Technology, 2-17-1, Tsudanuma, Narashino, Chiba
[2] Dept. of Computer Science, Chiba Institute of Technology, 2-17-1, Tsudanuma, Narashino, Chiba
关键词
Convolutional neural networks; Distributed representation of words; Natural language processing; Sentence classification; Word2vec;
D O I
10.1541/ieejeiss.139.1066
中图分类号
学科分类号
摘要
Recently, convolutional neural networks (CNNs) have achieved remarkable results on sentence classification problems. In these approaches, each word in the sentences is transformed to real number vectors (called word vectors) and the sentences as input data to the CNN are represented by the sequences of the word vectors. A dataset for training and testing for the CNN includes the large number of words, therefore the word vectors are embedded so high-dimentional space. As a result of this, the input data space of the CNN becomes very high. When the input data have high dimension, much training data are required for enough training of the CNN. It is not always possible, however, to get enough number of data for training. If the enough data cannot prepare for learning, it is desirable to decrease the dimension of input data. This paper shows the results that the smaller dimensional word vectors are applied to sentence classifications by CNNs. The results have shown that some dimensionality reduction does not effect too much to the accuracy of the sentence classifications by CNNs. © 2019 The Institute of Electrical Engineers of Japan.
引用
收藏
页码:1066 / 1079
页数:13
相关论文
共 50 条
[31]   A Radiolarian Classifier using Convolutional Neural Networks [J].
Dionisio, Abigaile ;
Solano, Geoffrey ;
Quisote, Micah ;
Marquez, Edanjarlo .
2020 INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND SIGNAL PROCESSING (AISP), 2020,
[32]   Personality Recognition Using Convolutional Neural Networks [J].
Gimenez, Maite ;
Paredes, Roberto ;
Rosso, Paolo .
COMPUTATIONAL LINGUISTICS AND INTELLIGENT TEXT PROCESSING, CICLING 2017, PT II, 2018, 10762 :313-323
[33]   Using Convolutional Neural Networks for Plant Classification [J].
Razavi, Salar ;
Yalcin, Hulya .
2017 25TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2017,
[34]   Wheel Classification Using Convolutional Neural Networks [J].
Nie, Yuncong ;
Xia, Siyu ;
Wu, Yu .
PROCEEDINGS 2018 33RD YOUTH ACADEMIC ANNUAL CONFERENCE OF CHINESE ASSOCIATION OF AUTOMATION (YAC), 2018, :515-520
[35]   Using Convolutional Neural Networks for Emoticon Classification [J].
Burnik, K. ;
Knezevic, D. Bjelobrk .
2019 42ND INTERNATIONAL CONVENTION ON INFORMATION AND COMMUNICATION TECHNOLOGY, ELECTRONICS AND MICROELECTRONICS (MIPRO), 2019, :1614-1618
[36]   Recognition of flowers using convolutional neural networks [J].
Alkhonin, Abdulrahman ;
Almutairi, Abdulelah ;
Alburaidi, Abdulmajeed ;
Saudagar, Abdul Khader Jilani .
INTERNATIONAL JOURNAL OF INTELLIGENT ENGINEERING INFORMATICS, 2020, 8 (03) :186-197
[37]   Optimizing convolutional neural networks on multi-core vector accelerator [J].
Liu, Zhong ;
Xiao, Xin ;
Li, Chen ;
Ma, Sheng ;
Rangyu, Deng .
PARALLEL COMPUTING, 2022, 112
[38]   Homogeneous vector bundles and G-equivariant convolutional neural networks [J].
Aronsson, Jimmy .
SAMPLING THEORY SIGNAL PROCESSING AND DATA ANALYSIS, 2022, 20 (02)
[39]   Sentiment Analysis System using Hybrid Word Embeddings with Convolutional Recurrent Neural Network [J].
Alotaibi, Fahd ;
Gupta, Vishal .
INTERNATIONAL ARAB JOURNAL OF INFORMATION TECHNOLOGY, 2022, 19 (03) :330-335
[40]   Masked Neural Style Transfer using Convolutional Neural Networks [J].
Handa, Arushi ;
Garg, Prerna ;
Khare, Vijay .
2018 INTERNATIONAL CONFERENCE ON RECENT INNOVATIONS IN ELECTRICAL, ELECTRONICS & COMMUNICATION ENGINEERING (ICRIEECE 2018), 2018, :2099-2104