Transformer-Based Approach for Solving Mathematical Problems Using Automatic Speech Recognition

被引:0
作者
Grgurevic, Ante [1 ]
Babac, Marina Bagic [1 ]
机构
[1] Univ Zagreb, Fac Elect Engn & Comp, Zagreb 10000, Croatia
关键词
Mathematical models; Transformers; Computer architecture; Computational modeling; Training; Symbols; Deep learning; Data models; Natural language processing; Numerical models; Mathematical reasoning; natural language processing; automatic speech recognition; deep learning; transformer models;
D O I
10.1109/ACCESS.2025.3564121
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we introduce Vox Calculi, a system designed to solve mathematical problems using voice transcriptions. By leveraging state-of-the-art pretrained Automatic Speech Recognition (ASR) models, we accurately transcribe users' voice recordings. Additionally, we develop a specialized mathematical parser that converts natural language mathematical expressions into symbolic representations and numerical values while preserving the remainder of the transcription. We utilize Transformer and TP-Transformer models, trained on DeepMind's Mathematics dataset, to generate an appropriate mathematical answer from the provided input sequence. An in-depth evaluation of both the ASR and Transformer models demonstrates highly satisfactory results. Furthermore, we propose potential future improvements to enhance the system's performance.
引用
收藏
页码:79845 / 79859
页数:15
相关论文
共 40 条
[1]  
Alvermann D, 2019, Word Error Rate & Character Error Rate-How to Evaluate a Model
[2]  
[Anonymous], 2023, About us
[3]  
[Anonymous], 2011, P 28 INT C INT C MAC, DOI DOI 10.5555/3104482.3104610
[4]  
Baevski A, 2020, ADV NEUR IN, V33
[5]  
Bahdanau D, 2016, Arxiv, DOI arXiv:1409.0473
[6]  
Becker C., 2020, Introduction to Transfer Learning for NLP
[7]   Detecting Deception Using Natural Language Processing and Machine Learning in Datasets on COVID-19 and Climate Change [J].
Brzic, Barbara ;
Boticki, Ivica ;
Bagic Babac, Marina .
ALGORITHMS, 2023, 16 (05)
[8]  
Cho K., 2014, P 2014 C EMP METH NA, P1724, DOI 10.3115/v1/d14-1179
[9]  
Cobbe K, 2021, Arxiv, DOI [arXiv:2110.14168, DOI 10.48550/ARXIV.2110.14168]
[10]  
Conneau A, 2020, Arxiv, DOI arXiv:2006.13979