Projects:2017s1-103 Improving Usability and User Interaction with KALDI Open- Source Speech Recogniser
Contents
Project Team
Students
- George Mao
- Vinil Chukkapally
Supervisors
- Said Al-Sarawi
- Ahmad Hashemi-Sakhtsari (DSTG)
Introduction
What is KALDI?
KALDI is a free and open-source software toolkit for automatic speech recognition. It is designed for speech recognition researchers [1], and so requires speech recognition knowledge and familiarity with scripting to operate. As such, it is difficult for those without such knowledge or familiarity to use.
Aim of the Project
The aim of this project is to enable users to access functionalities of KALDI without the knowledge of scripting, a language like Bash, or detailed knowledge of the internal algorithms of KALDI. Furthermore attempts will be made to transcribe live audio speech continuously.
References
[1] Kaldi, "About the Kaldi project." [Online]. Available: http://kaldi-asr.org/doc/about.html [Accessed: 13 March 2017]