Development of a machine-learning based voice disorder screening tool
被引:16
作者:
Reid, Jonathan
论文数: 0引用数: 0
h-index: 0
机构:
Univ Alberta, Fac Med & Dent, Dept Surg, Div Otolaryngol Head & Neck Surg, Edmonton, AB, CanadaUniv Alberta, Fac Med & Dent, Dept Surg, Div Otolaryngol Head & Neck Surg, Edmonton, AB, Canada
Reid, Jonathan
[1
]
Parmar, Preet
论文数: 0引用数: 0
h-index: 0
机构:
Univ Alberta, Fac Sci, Dept Phys, Edmonton, AB, CanadaUniv Alberta, Fac Med & Dent, Dept Surg, Div Otolaryngol Head & Neck Surg, Edmonton, AB, Canada
Parmar, Preet
[2
]
Lund, Tyler
论文数: 0引用数: 0
h-index: 0
机构:
Univ Alberta, Fac Engn, Edmonton, AB, CanadaUniv Alberta, Fac Med & Dent, Dept Surg, Div Otolaryngol Head & Neck Surg, Edmonton, AB, Canada
Lund, Tyler
[3
]
Aalto, Daniel K.
论文数: 0引用数: 0
h-index: 0
机构:
Univ Alberta, Fac Rehabil Med, Commun Sci & Disorders, Edmonton, AB, CanadaUniv Alberta, Fac Med & Dent, Dept Surg, Div Otolaryngol Head & Neck Surg, Edmonton, AB, Canada
Aalto, Daniel K.
[4
]
Jeffery, Caroline C.
论文数: 0引用数: 0
h-index: 0
机构:
Univ Alberta, Fac Med & Dent, Dept Surg, Div Otolaryngol Head & Neck Surg, Edmonton, AB, Canada
Univ Alberta, Fac Rehabil Med, Commun Sci & Disorders, Edmonton, AB, CanadaUniv Alberta, Fac Med & Dent, Dept Surg, Div Otolaryngol Head & Neck Surg, Edmonton, AB, Canada
Jeffery, Caroline C.
[1
,4
]
机构:
[1] Univ Alberta, Fac Med & Dent, Dept Surg, Div Otolaryngol Head & Neck Surg, Edmonton, AB, Canada
[2] Univ Alberta, Fac Sci, Dept Phys, Edmonton, AB, Canada
Objective: Early recognition and referral are crucial for voice disorder management. Limited availability of subspecialists, poor primary care awareness, and the need for specialized equipment impede effective care. Thus, there is a need for a tool to improve voice pathology screening. Machine learning algorithms (MLAs) have shown promise in analyzing acoustic characteristics of phonation. However, few studies report clinical applications of MLAs for voice pathology detection. The objective of this study was to design and validate a MLA for detecting pathological voices.Methods: A MLA was developed for voice analysis. Audio samples converted into spectrograms were inputted into a pre-existing VGG19 convolutional neural network (CNN) and image-classifier. The resulting feature map was classified as either pathological or healthy using a Support Vector Machine (SVM) binary linear classifier. This combined MLA was "trained" with 950 sustained "/i/" vowel audio samples from the Saarbrucken Voice Database (SVD), which contains subjects with and without voice disorders. The trained MLA was "tested" with 406 SVD samples to determine sensitivity, specificity, and overall accuracy. External validation of the MLA was performed using clinical voice samples collected from patients attending a subspecialty voice clinic.Results: The MLA detected pathologies in SVD samples with 98.5% sensitivity, 97.1% specificity and 97.8% overall accuracy. In 30 samples obtained prospectively from voice clinic patients, the MLA detected pathologies with 100% sensitivity, 96.3% specificity and 96.7% overall accuracy.Conclusions: This study demonstrates that a MLA using a simple audio input can detect diverse vocal pathologies with high sensitivity and specificity. Thus, this algorithm shows promise as a potential screening tool.
机构:
King Saud Univ, Coll Comp & Informat Sci, Dept Comp Engn, Digital Speech Proc Grp, Riyadh 11543, Saudi ArabiaKing Saud Univ, Coll Comp & Informat Sci, Dept Comp Engn, Digital Speech Proc Grp, Riyadh 11543, Saudi Arabia
Muhammad, Ghulam
Alsulaiman, Mansour
论文数: 0引用数: 0
h-index: 0
机构:
King Saud Univ, Coll Comp & Informat Sci, Dept Comp Engn, Digital Speech Proc Grp, Riyadh 11543, Saudi ArabiaKing Saud Univ, Coll Comp & Informat Sci, Dept Comp Engn, Digital Speech Proc Grp, Riyadh 11543, Saudi Arabia
Alsulaiman, Mansour
论文数: 引用数:
h-index:
机构:
Ali, Zulfiqar
Mesallam, Tamer A.
论文数: 0引用数: 0
h-index: 0
机构:
King Saud Univ, ENT Dept, Coll Med, Riyadh, Saudi Arabia
King Saud Univ, Res Chair Voice Swallowing & Commun Disorders, Riyadh, Saudi Arabia
Al Menoufiya Univ, Coll Med, ENT Dept, Shebin Alkoum, EgyptKing Saud Univ, Coll Comp & Informat Sci, Dept Comp Engn, Digital Speech Proc Grp, Riyadh 11543, Saudi Arabia
Mesallam, Tamer A.
论文数: 引用数:
h-index:
机构:
Farahat, Mohamed
Malki, Khalid H.
论文数: 0引用数: 0
h-index: 0
机构:
King Saud Univ, ENT Dept, Coll Med, Riyadh, Saudi Arabia
King Saud Univ, Res Chair Voice Swallowing & Commun Disorders, Riyadh, Saudi ArabiaKing Saud Univ, Coll Comp & Informat Sci, Dept Comp Engn, Digital Speech Proc Grp, Riyadh 11543, Saudi Arabia
机构:
King Saud Univ, Coll Comp & Informat Sci, Digital Speech Proc Grp, Riyadh, Saudi ArabiaKing Saud Univ, Coll Comp & Informat Sci, Digital Speech Proc Grp, Riyadh, Saudi Arabia
Alsulaiman, Mansour
Elamvazuthi, Irraivan
论文数: 0引用数: 0
h-index: 0
机构:
Univ Teknol PETRONAS, CISIR, Dept Elect & Elect Engn, Perak, MalaysiaKing Saud Univ, Coll Comp & Informat Sci, Digital Speech Proc Grp, Riyadh, Saudi Arabia
Elamvazuthi, Irraivan
Muhammad, Ghulam
论文数: 0引用数: 0
h-index: 0
机构:
King Saud Univ, Coll Comp & Informat Sci, Digital Speech Proc Grp, Riyadh, Saudi ArabiaKing Saud Univ, Coll Comp & Informat Sci, Digital Speech Proc Grp, Riyadh, Saudi Arabia
Muhammad, Ghulam
Mesallam, Tamer A.
论文数: 0引用数: 0
h-index: 0
机构:
King Saud Univ, Coll Med, ENT Dept, Riyadh 11461, Saudi ArabiaKing Saud Univ, Coll Comp & Informat Sci, Digital Speech Proc Grp, Riyadh, Saudi Arabia
Mesallam, Tamer A.
Farahat, Mohamed
论文数: 0引用数: 0
h-index: 0
机构:
King Saud Univ, Coll Med, ENT Dept, Riyadh 11461, Saudi ArabiaKing Saud Univ, Coll Comp & Informat Sci, Digital Speech Proc Grp, Riyadh, Saudi Arabia
Farahat, Mohamed
Malki, Khalid H.
论文数: 0引用数: 0
h-index: 0
机构:
King Saud Univ, Coll Med, ENT Dept, Riyadh 11461, Saudi ArabiaKing Saud Univ, Coll Comp & Informat Sci, Digital Speech Proc Grp, Riyadh, Saudi Arabia
机构:
King Saud Univ, Coll Comp & Informat Sci, Dept Comp Engn, Digital Speech Proc Grp, Riyadh 11543, Saudi ArabiaKing Saud Univ, Coll Comp & Informat Sci, Dept Comp Engn, Digital Speech Proc Grp, Riyadh 11543, Saudi Arabia
Muhammad, Ghulam
Alsulaiman, Mansour
论文数: 0引用数: 0
h-index: 0
机构:
King Saud Univ, Coll Comp & Informat Sci, Dept Comp Engn, Digital Speech Proc Grp, Riyadh 11543, Saudi ArabiaKing Saud Univ, Coll Comp & Informat Sci, Dept Comp Engn, Digital Speech Proc Grp, Riyadh 11543, Saudi Arabia
Alsulaiman, Mansour
论文数: 引用数:
h-index:
机构:
Ali, Zulfiqar
Mesallam, Tamer A.
论文数: 0引用数: 0
h-index: 0
机构:
King Saud Univ, ENT Dept, Coll Med, Riyadh, Saudi Arabia
King Saud Univ, Res Chair Voice Swallowing & Commun Disorders, Riyadh, Saudi Arabia
Al Menoufiya Univ, Coll Med, ENT Dept, Shebin Alkoum, EgyptKing Saud Univ, Coll Comp & Informat Sci, Dept Comp Engn, Digital Speech Proc Grp, Riyadh 11543, Saudi Arabia
Mesallam, Tamer A.
论文数: 引用数:
h-index:
机构:
Farahat, Mohamed
Malki, Khalid H.
论文数: 0引用数: 0
h-index: 0
机构:
King Saud Univ, ENT Dept, Coll Med, Riyadh, Saudi Arabia
King Saud Univ, Res Chair Voice Swallowing & Commun Disorders, Riyadh, Saudi ArabiaKing Saud Univ, Coll Comp & Informat Sci, Dept Comp Engn, Digital Speech Proc Grp, Riyadh 11543, Saudi Arabia
机构:
King Saud Univ, Coll Comp & Informat Sci, Digital Speech Proc Grp, Riyadh, Saudi ArabiaKing Saud Univ, Coll Comp & Informat Sci, Digital Speech Proc Grp, Riyadh, Saudi Arabia
Alsulaiman, Mansour
Elamvazuthi, Irraivan
论文数: 0引用数: 0
h-index: 0
机构:
Univ Teknol PETRONAS, CISIR, Dept Elect & Elect Engn, Perak, MalaysiaKing Saud Univ, Coll Comp & Informat Sci, Digital Speech Proc Grp, Riyadh, Saudi Arabia
Elamvazuthi, Irraivan
Muhammad, Ghulam
论文数: 0引用数: 0
h-index: 0
机构:
King Saud Univ, Coll Comp & Informat Sci, Digital Speech Proc Grp, Riyadh, Saudi ArabiaKing Saud Univ, Coll Comp & Informat Sci, Digital Speech Proc Grp, Riyadh, Saudi Arabia
Muhammad, Ghulam
Mesallam, Tamer A.
论文数: 0引用数: 0
h-index: 0
机构:
King Saud Univ, Coll Med, ENT Dept, Riyadh 11461, Saudi ArabiaKing Saud Univ, Coll Comp & Informat Sci, Digital Speech Proc Grp, Riyadh, Saudi Arabia
Mesallam, Tamer A.
Farahat, Mohamed
论文数: 0引用数: 0
h-index: 0
机构:
King Saud Univ, Coll Med, ENT Dept, Riyadh 11461, Saudi ArabiaKing Saud Univ, Coll Comp & Informat Sci, Digital Speech Proc Grp, Riyadh, Saudi Arabia
Farahat, Mohamed
Malki, Khalid H.
论文数: 0引用数: 0
h-index: 0
机构:
King Saud Univ, Coll Med, ENT Dept, Riyadh 11461, Saudi ArabiaKing Saud Univ, Coll Comp & Informat Sci, Digital Speech Proc Grp, Riyadh, Saudi Arabia