OneBitPitch (OBP): Ultra-High-Speed Pitch Detection Algorithm Based on One-Bit Quantization and Modified Autocorrelation

被引:0
作者
Coccoluto, Davide [1 ]
Cesarini, Valerio [1 ]
Costantini, Giovanni [1 ]
机构
[1] Univ Roma Tor Vergata, Dept Elect Engn, I-00133 Rome, Italy
来源
APPLIED SCIENCES-BASEL | 2023年 / 13卷 / 14期
关键词
pitch detection; F0; algorithm; auto-tune; audio signal processing; ESTIMATOR;
D O I
10.3390/app13148191
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Featured Application Fast pitch detection algorithm for the real-time estimation of the fundamental frequency, optimized for hardware implementation. This paper presents a novel, high-speed, and low-complexity algorithm for pitch (F0) detection, along with a new dataset for testing and a comparison of some of the most effective existing techniques. The algorithm, called OneBitPitch (OBP), is based on a modified autocorrelation function applied to a single-bit signal for fast computation. The focus is explicitly on speed for real-time pitch detection applications in pitch detection. A testing procedure is proposed using a proprietary synthetic dataset (SYNTHPITCH) against three of the most widely used algorithms: YIN, SWIPE (Sawtooth Inspired Pitch Estimator) and NLS (Nonlinear-Least Squares-based). The results show how OBP is 9 times faster than the fastest of its alternatives, and 50 times faster than a gold standard like SWIPE, with a mean elapsed time of 4.6 ms, or 0.046 x realtime. OBP is slightly less accurate for high-precision landmarks and noisy signals, but its performance in terms of acceptable error (<2%) is comparable to YIN and SWIPE. NLS emerges as the most accurate, but it is not flexible, being dependent on the input and requiring prior setup. OBP shows to be robust to octave errors while providing acceptable accuracies at ultra-high speeds, with a building nature suited for FPGA (Field-Programmable Gate Array) implementations.
引用
收藏
页数:20
相关论文
共 48 条
[1]   Machine learning- and statistical-based voice analysis of Parkinson?s disease patients: A survey [J].
Amato, Federica ;
Saggio, Giovanni ;
Cesarini, Valerio ;
Olmo, Gabriella ;
Costantini, Giovanni .
EXPERT SYSTEMS WITH APPLICATIONS, 2023, 219
[2]  
[Anonymous], 2019, IEEE STD 754 2019, P1, DOI [DOI 10.1109/IEEESTD.2008.4610935, 10.1109/IEEESTD.2019.8766229]
[3]  
[Anonymous], 1983, PITCH DETERMINATION, DOI DOI 10.1007/978-3-642-81926-1
[4]   Fast Fourier Transform and autocorrelation function for the analysis of complex mass spectra [J].
Apicella, B. ;
Bruno, A. ;
Wang, X. ;
Spinelli, N. .
INTERNATIONAL JOURNAL OF MASS SPECTROMETRY, 2013, 338 :30-38
[5]  
Bharathi V., 2011, Proceedings 2011 International Conference on Signal Processing, Communication, Computing and Networking Technologies (ICSCCN 2011), P724, DOI 10.1109/ICSCCN.2011.6024645
[6]   A sawtooth waveform inspired pitch estimator for speech and music [J].
Camacho, Arturo ;
Harris, John G. .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2008, 124 (03) :1638-1652
[7]   Machine Learning-based Study of Dysphonic Voices for the Identification and Differentiation of Vocal Cord Paralysis and Vocal Nodules [J].
Cesarini, Valerio ;
Robotti, Carlo ;
Piromalli, Ylenia ;
Mozzanica, Francesco ;
Schindler, Antonio ;
Saggio, Giovanni ;
Costantini, Giovanni .
BIOSIGNALS: PROCEEDINGS OF THE 15TH INTERNATIONAL JOINT CONFERENCE ON BIOMEDICAL ENGINEERING SYSTEMS AND TECHNOLOGIES - VOL 4: BIOSIGNALS, 2022, :265-272
[8]   Artificial Intelligence-Based Voice Assessment of Patients with Parkinson's Disease Off and On Treatment: Machine vs. Deep-Learning Comparison [J].
Costantini, Giovanni ;
Cesarini, Valerio ;
Di Leo, Pietro ;
Amato, Federica ;
Suppa, Antonio ;
Asci, Francesco ;
Pisani, Antonio ;
Calculli, Alessandra ;
Saggio, Giovanni .
SENSORS, 2023, 23 (04)
[9]   Deep learning and machine learning-based voice analysis for the detection of COVID-19: A proposal and comparison of architectures [J].
Costantini, Giovanni ;
Cesarini, Valerio ;
Robotti, Carlo ;
Benazzo, Marco ;
Pietrantonio, Filomena ;
Di Girolamo, Stefano ;
Pisani, Antonio ;
Canzi, Pietro ;
Mauramati, Simone ;
Bertino, Giulia ;
Cassaniti, Irene ;
Baldanti, Fausto ;
Saggio, Giovanni .
KNOWLEDGE-BASED SYSTEMS, 2022, 253
[10]   The Emotion Probe: On the Universality of Cross-Linguistic and Cross-Gender Speech Emotion Recognition via Machine Learning [J].
Costantini, Giovanni ;
Parada-Cabaleiro, Emilia ;
Casali, Daniele ;
Cesarini, Valerio .
SENSORS, 2022, 22 (07)