Difference between revisions of "Projects:2017s1-103 Improving Usability and User Interaction with KALDI Open- Source Speech Recogniser"
(Created page with " == Project Team == ===Students=== * George Mao * Vinil Chukkapally ===Supervisors=== * Said Al-Sarawi * Ahmad Hashemi-Sakhtsari (DSTG) == Introduction == The aim of this pr...") |
(→Introduction) |
||
Line 10: | Line 10: | ||
== Introduction == | == Introduction == | ||
+ | === What is KALDI? === | ||
+ | KALDI is a free and open-source software toolkit for automatic speech recognition. It is designed for speech recognition researchers [1], and so requires speech recognition knowledge and familiarity with scripting to operate. As such, it is difficult for those without such knowledge or familiarity to use. | ||
+ | |||
+ | === Aim of the Project === | ||
The aim of this project is to enable users to access functionalities of KALDI without the knowledge of scripting, a language like Bash, or detailed knowledge of the internal algorithms of KALDI. Furthermore attempts will be made to transcribe live audio speech continuously. | The aim of this project is to enable users to access functionalities of KALDI without the knowledge of scripting, a language like Bash, or detailed knowledge of the internal algorithms of KALDI. Furthermore attempts will be made to transcribe live audio speech continuously. | ||
+ | |||
+ | == References == | ||
+ | [1] Kaldi, "About the Kaldi project." [Online]. Available: http://kaldi-asr.org/doc/about.html [Accessed: 13 March 2017] |
Revision as of 15:11, 2 October 2017
Contents
Project Team
Students
- George Mao
- Vinil Chukkapally
Supervisors
- Said Al-Sarawi
- Ahmad Hashemi-Sakhtsari (DSTG)
Introduction
What is KALDI?
KALDI is a free and open-source software toolkit for automatic speech recognition. It is designed for speech recognition researchers [1], and so requires speech recognition knowledge and familiarity with scripting to operate. As such, it is difficult for those without such knowledge or familiarity to use.
Aim of the Project
The aim of this project is to enable users to access functionalities of KALDI without the knowledge of scripting, a language like Bash, or detailed knowledge of the internal algorithms of KALDI. Furthermore attempts will be made to transcribe live audio speech continuously.
References
[1] Kaldi, "About the Kaldi project." [Online]. Available: http://kaldi-asr.org/doc/about.html [Accessed: 13 March 2017]