Abstract
Development of fully featured Automatic Speech Recognition (ASR) systems for a complete language vocabulary generally requires large data repositories, massive computing power, and a stable digital network infrastructure. These conditions are not met in the case of many indigenous languages. Based on our research for over a decade in West Africa, we present a lightweight and downscaled approach to AI-based ASR and describe a set of associated experiments. The aim is to produce a variety of limited-vocabulary ASRs as a basis for the development of practically useful (mobile and radio) voice-based information services that fit needs, preferences and knowledge of local rural communities.
Original language | English |
---|---|
Title of host publication | WebSci '22 |
Subtitle of host publication | [Proceedings] 14th ACM Web Science Conference 2022 |
Publisher | Association for Computing Machinery |
Pages | 451-458 |
Number of pages | 8 |
ISBN (Electronic) | 9781450391917 |
DOIs | |
Publication status | Published - Jun 2022 |
Event | 14th ACM Web Science Conference, WebSci 2022 - Virtual, Online, Spain Duration: 26 Jun 2022 → 29 Jun 2022 |
Publication series
Name | ACM International Conference Proceeding Series |
---|
Conference
Conference | 14th ACM Web Science Conference, WebSci 2022 |
---|---|
Country/Territory | Spain |
City | Virtual, Online |
Period | 26/06/22 → 29/06/22 |
Bibliographical note
Publisher Copyright:© 2022 ACM.
Keywords
- automatic speech recognition
- low resource environments
- machine learning
- neural networks
- under-resourced/indigenous languages
- voice-based technologies