Development of the Speech-to-Text Chatbot Interface Based on Google API

被引：0

作者：

Shakhovska, Nataliya ^{[1
]}

Basystiuk, Oleh ^{[1
]}

Shakhovska, Khrystyna ^{[1
]}

机构：

[1] Lviv Polytech Natl Univ, UA-79013 Lvov, Ukraine

来源：

MOMLET&DS-2019: MODERN MACHINE LEARNING TECHNOLOGIES AND DATA SCIENCE | 2019年 / 2386卷

关键词：

natural language processing; speech-to-text; Google API; !text type='Python']Python[!/text; Flask; chatbot; hashing; time complexity; prefix-function;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The paper describes possibilities, which are provided by open APIs, and how to use them for creating unified interfaces using the example of our bot based on Google API. In last decade AI technologies became widespread and easy to implement and use. One of the most perspective technology in the AI field is speech recognition as part of natural language processing. New speech recognition technologies and methods will become a central part of future life because they save a lot of communication time, replacing common texting with voice/audio. In addition, this paper explores the advantages and disadvantages of well- known chatbots. The method of their improvement is built. The algorithms of Rabin-Karp and Knut-Pratt are used. The time complexity of proposed algorithm is compared with existed one.

引用

收藏

页码：212 / 221

页数：10

相关论文

共 50 条

[31] Improving End-to-End Speech-to-Text Translation With Document-Level Context [J].

Tian, Xinyu ;

Wei, Haoran ;

Gong, Zhengxian ;

Li, Junhui ;

Xie, Jun .

IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2025, 33 :2098-2109

[32] Domain Adaptation Speech-to-Text for Low-Resource European Portuguese Using Deep Learning [J].

Medeiros, Eduardo ;

Corado, Leonel ;

Rato, Luis ;

Quaresma, Paulo ;

Salgueiro, Pedro .

FUTURE INTERNET, 2023, 15 (05)

[33] End-to-end Jordanian dialect speech-to-text self-supervised learning framework [J].

Safieh, Ali A. ;

Abu Alhaol, Ibrahim ;

Ghnemat, Rawan .

FRONTIERS IN ROBOTICS AND AI, 2022, 9

[34] Novel Defense Method against Audio Adversarial Example for Speech-to-Text Transcription Neural Networks [J].

Tamura, Keiichi ;

Omagari, Akitada ;

Hashida, Shuichi .

2019 IEEE 11TH INTERNATIONAL WORKSHOP ON COMPUTATIONAL INTELLIGENCE AND APPLICATIONS (IWCIA 2019), 2019, :115-120

[35] A Friendly Speech User Interface based on Google Cloud Platform to Access a Tourism Semantic Website [J].

Boza-Quispe, Gustavo ;

Montalvan-Figueroa, Juan ;

Puente-Mansilla, Fabricio ;

Rosales-Huamani, Jimmy .

2017 CHILEAN CONFERENCE ON ELECTRICAL, ELECTRONICS ENGINEERING, INFORMATION AND COMMUNICATION TECHNOLOGIES (CHILECON), 2017,

[36] Speech-to-text intervention to support text production among students with writing difficulties: a single-case study in nordic countries [J].

Baeck, Gunilla Almgren ;

Mossige, Margunn ;

Svendsen, Helle Bundgaard ;

Ronneberg, Vibeke ;

Selenius, Heidi ;

Gottsche, Nina Berg ;

Dolmer, Grete ;

Faelth, Linda ;

Nilsson, Staffan ;

Svensson, Idor .

DISABILITY AND REHABILITATION-ASSISTIVE TECHNOLOGY, 2024, 19 (08) :3110-3129

[37] Multimodal Error Correction for Speech-to-Text in a Mobile Office Automated Vehicle: Results From a Remote Study [J].

Schartmueller, Clemens ;

Riener, Andreas .

IUI'22: 27TH INTERNATIONAL CONFERENCE ON INTELLIGENT USER INTERFACES, 2022, :496-505

[38] Use of Speech-to-Text Translation Resources to Address Communication Barriers in Patients With Hearing Loss: A Systematic Review [J].

Ferraro, Tatiana ;

Samaha, Nadia L. ;

Tannan, Utkarsh ;

Sookram, Sebastian ;

Wong, Kevin ;

Hwa, Tiffany Peng .

OTOLOGY & NEUROTOLOGY, 2024, 45 (09) :961-970

[39] MLLP-VRAIN Spanish ASR Systems for the Albayzin-RTVE 2020 Speech-to-Text Challenge: Extension [J].

Baquero-Arnal, Pau ;

Jorge, Javier ;

Gimenez, Adria ;

Iranzo-Sanchez, Javier ;

Perez, Alejandro ;

Garces Diaz-Munio, Goncal Vicent ;

Silvestre-Cerda, Joan Albert ;

Civera, Jorge ;

Sanchis, Albert ;

Juan, Alfons .

APPLIED SCIENCES-BASEL, 2022, 12 (02)

[40] GenEn-MNER: Enhancing Nested Chinese NER With Multimodal Fusion and Alignment via Speech-to-Text Generation [J].

Ning, Jinzhong ;

Sun, Yuanyuan ;

Yang, Zhihao ;

Wang, Zhijun ;

Luo, Ling ;

Lin, Hongfei ;

Zhang, Yijia .

IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2025, 33 :1628-1640

← 1 2 3 4 5 →