Deep Learning Speech Synthesis Model for Word/Character-Level Recognition in the Tamil Language

被引：0

作者：

Rajendran, Sukumar ^{[1
]}

Raja, Kiruba Thangam ^{[2
]}

Nagarajan, G. ^{[3
]}

Dass, A. Stephen ^{[2
]}

Kumar, M. Sandeep ^{[2
]}

Jayagopal, Prabhu ^{[2
]}

机构：

[1] VIT Bhopal Univ, Sch Comp Sci & Engn, Indore Highway Kothrikalan, Bhopal, India

[2] Vellore Inst Technol, Sch Informat Technol & Engn, Vellore, India

[3] Panimalar Engn Coll, Dept Math, Chennai, India

来源：

INTERNATIONAL JOURNAL OF E-COLLABORATION | 2023年 / 19卷 / 04期

关键词：

Deep Learning; Language; Modeling; Tamil Speech; Visualization;

D O I：

10.4018/IJeC.316824

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

As electronics and the increasing popularity of social media are widely used, a large amount of text data is created at unprecedented rates. All data created cannot be read by humans, and what they discuss in their sphere of interest may be found. Modeling of themes is a way to identify subjects in a vast number of texts. There has been a lot of study on subject-modeling in English. At the same time, millions of people worldwide speak Tamil; there is no great development in resource-scarce languages such as Tamil being spoken by millions of people worldwide. The consequences of specific deep learning models are usually difficult to interpret for the typical user. They are utilizing various visualization techniques to represent the outcomes of deep learning in a meaningful way. Then, they use metrics like similarity, correlation, perplexity, and coherence to evaluate the deep learning models.

引用

页码：20 / 20

页数：1

共 21 条

[1] Speech emotion recognition: Emotional models, databases, features, preprocessing methods, supporting modalities, and classifiers [J].

Akcay, Mehmet Berkehan ;

Oguz, Kaya .

SPEECH COMMUNICATION, 2020, 116 (116) :56-76

[2] Recognition of emotion from speech using evolutionary cepstral coefficients [J].

Bakhshi, Ali ;

Chalup, Stephan ;

Harimi, Ali ;

Mirhassani, Seyed Mostafa .

MULTIMEDIA TOOLS AND APPLICATIONS, 2020, 79 (47-48) :35739-35759

[3] WordSeg: Standardizing unsupervised word form segmentation from text [J].

Bernard, Mathieu ;

Thiolliere, Roland ;

Saksida, Amanda ;

Loukatou, Georgia R. ;

Larsen, Elin ;

Johnson, Mark ;

Fibla, Laia ;

Dupoux, Emmanuel ;

Daland, Robert ;

Cao, Xuan Nga ;

Cristia, Alejandrina .

BEHAVIOR RESEARCH METHODS, 2020, 52 (01) :264-278

[4] Multimodal speech emotion recognition and classification using convolutional neural network techniques [J].

Christy, A. ;

Vaithyasubramanian, S. ;

Jesudoss, A. ;

Praveena, M. D. Anto .

INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2020, 23 (02) :381-388

[5]

Gaonkar Radhika Shamsunder, 2019, Ph. D. Dissertation

[6]

Grave E, 2018, Arxiv, DOI [arXiv:1802.06893, DOI 10.48550/ARXIV.1802.06893]

[7]

He F, 2020, PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2020), P6494

[8] Pattern recognition and features selection for speech emotion recognition model using deep learning [J].

Jermsittiparsert, Kittisak ;

Abdurrahman, Abdurrahman ;

Siriattakul, Parinya ;

Sundeeva, Ludmila A. ;

Hashim, Wahidah ;

Rahim, Robbi ;

Maseleno, Andino .

INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2020, 23 (04) :799-806

[9]

Joulin A, 2016, Arxiv, DOI [arXiv:1612.03651, DOI 10.48550/ARXIV.1612.03651]

[10]

Joulin A, 2016, Arxiv, DOI arXiv:1607.01759

← 1 2 3 →