Telugu Lyrics Based Classification By Using Naive Bayes

被引:2
作者
Aziz, Abdul Md [1 ]
Ravula, Ganga Raju [1 ]
Shaik, Mobeen Taj [1 ]
Potharaju, Sravanthi [1 ]
机构
[1] RGUKT APIIIT RK Valley, CSE Dept, Kadapa, India
来源
2020 INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND SIGNAL PROCESSING (AISP) | 2020年
关键词
Text classification; Lyrics classification; Naive Bayes classifier; Feature selection; Text mining; Machine Learning;
D O I
10.1109/aisp48273.2020.9073597
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The main objective of this research is to predict the song category by using a Naive Bayes classifier. This type of research was previously done for Telugu songs using both audio and lyrical features. In that research they have addressed the lyrics as a whole, beginning, and the ending of the song. When they have given the whole song to model the accuracies are low, while they used both Naive Bayes and SVM for classification. Now this paper describing how Naive Bayes itself only holds the classification results more accurate with the whole song given as input to the model. This paper mainly holds the concepts of data pre-processing, feature extraction, and text classification to evaluate the model. The dataset consists of lyrics collected from four different genres, such as Melody, Sad, Rainy, and Pelli (marriage). This proposed method performs classification and calculates accuracies for the given dataset. The final accuracy obtained for this model is 92.3% using the Naive Bayes classifier.
引用
收藏
页数:6
相关论文
共 17 条
  • [1] Abburi Harika, 2016, Multimodal sentiment analysis of telugu songs, P48
  • [2] Adam Sadovsky, 2006, PREVIOUS CS224N FINA
  • [3] [Anonymous], 2018, AUTOMATED TEXT CLASS
  • [4] Buzic D, 2018, 2018 41ST INTERNATIONAL CONVENTION ON INFORMATION AND COMMUNICATION TECHNOLOGY, ELECTRONICS AND MICROELECTRONICS (MIPRO), P1011, DOI 10.23919/MIPRO.2018.8400185
  • [5] Caraffini F., 2019, NAIVE BAYES LEARNING
  • [6] Choi K, 2014, ACM-IEEE J CONF DIG, P453, DOI 10.1109/JCDL.2014.6970221
  • [7] Natural language processing
    Chowdhury, GG
    [J]. ANNUAL REVIEW OF INFORMATION SCIENCE AND TECHNOLOGY, 2003, 37 : 51 - 89
  • [8] Dokkara S. R. S., INT J COMPUTER APPL, V165
  • [9] Han J, 2012, MOR KAUF D, P1
  • [10] THE MEANING AND USE OF THE AREA UNDER A RECEIVER OPERATING CHARACTERISTIC (ROC) CURVE
    HANLEY, JA
    MCNEIL, BJ
    [J]. RADIOLOGY, 1982, 143 (01) : 29 - 36