共 195 条
[1]
Al-Ghezi R., Getman Y., Voskoboinik E., Singh M., Kurimo M., Automatic rating of spontaneous speech for low-resource languages, 2022 IEEE spoken language technology workshop, pp. 339-345, (2023)
[2]
Alam S., Sushmit A., Abdullah Z., Nakkhatra S., Ansary M., Hossen S.M., Et al., Bengali common voice speech dataset for automatic speech recognition, (2022)
[3]
Aldarmaki H., Ullah A., Ram S., Zaki N., Unsupervised automatic speech recognition: A review, Speech Communication, (2022)
[4]
Alharbi S., Alrazgan M., Alrashed A., Alnomasi T., Almojel R., Alharbi R., Et al., Automatic speech recognition: Systematic literature review, IEEE Access, 9, pp. 131858-131876, (2021)
[5]
Amodei D., Ananthanarayanan S., Anubhai R., Bai J., Battenberg E., Case C., Et al., Deep speech 2: End-to-end speech recognition in English and Mandarin, International conference on machine learning, pp. 173-182, (2016)
[6]
An K., Xiang H., Ou Z., CAT: A CTC-CRF based ASR toolkit bridging the hybrid and the end-to-end approaches towards data efficiency and low latency, (2020)
[7]
Anastasopoulos A., Bojar O., Bremerman J., Et al., (2021)
[8]
Anoop K., Pratik M., Pushpak B., Et al., (2018)
[9]
Ansari E., Axelrod A., Bach N., Bojar O., Cattoni R., Dalvi F., Et al., (2020)
[10]
Baevski A., Hsu W.-N., Xu Q., Babu A., Gu J., Auli M., Data2vec: A general framework for self-supervised learning in speech, vision and language, International conference on machine learning, pp. 1298-1312, (2022)