Improving Adaptive Learning Models Using Prosodic Speech Features

被引：0

作者：

Wilschut, Thomas ^{[1
]}

Sense, Florian ^{[2
]}

Scharenborg, Odette ^{[3
]}

van Rijn, Hedderik ^{[1
]}

机构：

[1] Univ Groningen, Dept Expt Psychol, Groningen, Netherlands

[2] InfiniteTactics LLC, Beavercreek, OH USA

[3] Delft Univ Technol, Dept Multimedia & Comp, Delft, Netherlands

来源：

ARTIFICIAL INTELLIGENCE IN EDUCATION, AIED 2023 | 2023年 / 13916卷

关键词：

Adaptive Learning; Cognitive Modeling; Automatic Speech Recognition; Machine learning; Speech prosody; Pitch; Speaking Speed; Intensity;

D O I：

10.1007/978-3-031-36272-9_21

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Cognitive models of memory retrieval aim to describe human learning and forgetting over time. Such models have been successfully applied in digital systems that aid in memorizing information by adapting to the needs of individual learners. The memory models used in these systems typically measure the accuracy and latency of typed retrieval attempts. However, recent advances in speech technology have led to the development of learning systems that allow for spoken inputs. Here, we explore the possibility of improving a cognitive model of memory retrieval by using information present in speech signals during spoken retrieval attempts. We asked 44 participants to study vocabulary items by spoken rehearsal, and automatically extracted high-level prosodic speech features-patterns of stress and intonation-such as pitch dynamics, speaking speed and intensity from over 7,000 utterances. We demonstrate that some prosodic speech features are associated with accuracy and response latency for retrieval attempts, and that speech feature informed memory models make better predictions of future performance relative to models that only use accuracy and response latency. Our results have theoretical relevance, as they show how memory strength is reflected in a specific speech signature. They also have important practical implications as they contribute to the development of memory models for spoken retrieval that have numerous real-world applications.

引用

页码：255 / 266

页数：12

共 50 条

[21] Learning a Neural Diff for Speech Models [J].

Macoskey, Jonathan ;

Strimel, Grant P. ;

Rastrow, Ariya .

INTERSPEECH 2021, 2021, :2536-2540

[22] A survey on hate speech detection and sentiment analysis using machine learning and deep learning models [J].

Subramanian, Malliga ;

Sathiskumar, Veerappampalayam Easwaramoorthy ;

Deepalakshmi, G. ;

Cho, Jaehyuk ;

Manikandan, G. .

ALEXANDRIA ENGINEERING JOURNAL, 2023, 80 :110-121

[23] Adaptive features of machine learning methods [J].

Berka, P .

2002 FIRST INTERNATIONAL IEEE SYMPOSIUM INTELLIGENT SYSTEMS, VOL II, EUNITE INVITED SESSION, PROCEEDINGS, 2002, :40-43

[24] Improving cyberbullying detection using Twitter users' psychological features and machine learning [J].

Balakrishnan, Vimala ;

Khan, Shahzaib ;

Arabnia, Hamid R. .

COMPUTERS & SECURITY, 2020, 90

[25] Lexical Tone Recognition in Mizo using Acoustic-Prosodic Features [J].

Gogoi, Parismita ;

Dey, Abhishek ;

Lalhminghlui, Wendy ;

Sarmah, Priyankoo ;

Prasanna, S. R. M. .

PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2020), 2020, :6458-6461

[26] Multilingual hope speech detection from tweets using transfer learning models [J].

Ahmad, Muhammad ;

Ameer, Iqra ;

Sharif, Wareesa ;

Usman, Sardar ;

Muzamil, Muhammad ;

Hamza, Ameer ;

Jalal, Muhammad ;

Batyrshin, Ildar ;

Sidorov, Grigori .

SCIENTIFIC REPORTS, 2025, 15 (01)

[27] Probabilistic Amplitude Demodulation Features in Speech Synthesis for Improving Prosody [J].

Lazaridis, Alexandros ;

Cernak, Milos ;

Garner, Philip N. .

17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, :2298-2302

[28] Heart Rate Detection and Classification from Speech Spectral Features Using Machine Learning [J].

Usman, Mohammed ;

Zubair, Mohammed ;

Ahmad, Zeeshan ;

Zaidi, Monji ;

Ijyas, Thafasal ;

Parayangat, Muneer ;

Wajid, Mohd ;

Shiblee, Mohammad ;

Ali, Syed Jaffar .

ARCHIVES OF ACOUSTICS, 2021, 46 (01) :41-53

[29] Machine Learning Approach for Improving the Intelligibility of Noisy Speech [J].

Saleem, Nasir ;

Khattak, Muhammad Irfan ;

Ahmad, Sheeraz ;

Ali, Muhammad Yousaf ;

Mohmand, Muhammad Ismail .

PROCEEDINGS OF 2020 17TH INTERNATIONAL BHURBAN CONFERENCE ON APPLIED SCIENCES AND TECHNOLOGY (IBCAST), 2020, :303-308

[30] Design and evaluation of adaptive deep learning models for weather forecasting [J].

Abdulla, Nawaf ;

Demirci, Mehmet ;

Ozdemir, Suat .

ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2022, 116

← 1 2 3 4 5 →