Knowledge Management Framework Over Low Resource Indian Colloquial Language Audio Contents

被引:0
作者
Srivatsa, Sharath [1 ]
Aparna, M. [1 ]
Madhavan, G. Sai [1 ]
Srinivasa, Srinath [1 ]
机构
[1] Int Inst Informat Technol, Bangalore, Karnataka, India
来源
PROCEEDINGS OF 7TH JOINT INTERNATIONAL CONFERENCE ON DATA SCIENCE AND MANAGEMENT OF DATA, CODS-COMAD 2024 | 2024年
关键词
community knowledge management; low-resource languages; automatic speech recognition; fuzzy n-gram search;
D O I
10.1145/3632410.3632483
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We introduce Graama-Kannada Audio Search, a knowledge management framework for low-resource Indian colloquial language audio content. Rural social communities often contain rich knowledge, but it can often be inaccessible on the internet as it is primarily oral. Organizations such as Namma Halli Radio(1) have collected an audio corpus of a few hours containing community interactions spoken in colloquial language. A simple and efficient platform to store, organize, and query knowledge in such audio can greatly help even for people with low literacy levels. In this work, we fine-tune state-of-the-art Automatic Speech Recognition (ASR) models using limited audio data to reduce the Word Error Rate (WER) for colloquial audio data to acquire transcripts for the audio, followed by creating an interface to search for keywords using simple fuzzy matching technique for n-gram inputs.
引用
收藏
页码:553 / 557
页数:5
相关论文
共 19 条
  • [1] [Anonymous], IARPA Babel Program
  • [2] Baevski A., 2021, Advances in Neural Information Processing Systems, V34, P27826
  • [3] Baevski A, 2020, ADV NEUR IN, V33
  • [4] Baskerville R., 2006, Knowledge Management Research & Practice, V4, P83, DOI 10.1057/palgrave.kmrp.8500090
  • [5] Chan W., 2021, PREPRINT, DOI DOI 10.48550/ARXIV.2104.02133
  • [6] Chen G., 2021, arXiv
  • [7] Galvez Daniel, 2021, P NEURAL INFORM PROC, V1
  • [8] He F, 2020, PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2020), P6494
  • [9] Hsiao R, 2013, 2013 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING (ASRU), P440, DOI 10.1109/ASRU.2013.6707770
  • [10] LEVENSHT.VI, 1965, DOKL AKAD NAUK SSSR+, V163, P845