Projects:2017s1-103 Improving Usability and User Interaction with KALDI Open- Source Speech Recogniser

From Projects
Revision as of 15:11, 2 October 2017 by A1669391 (talk | contribs) (Introduction)
Jump to: navigation, search

Project Team

Students

  • George Mao
  • Vinil Chukkapally

Supervisors

  • Said Al-Sarawi
  • Ahmad Hashemi-Sakhtsari (DSTG)

Introduction

What is KALDI?

KALDI is a free and open-source software toolkit for automatic speech recognition. It is designed for speech recognition researchers [1], and so requires speech recognition knowledge and familiarity with scripting to operate. As such, it is difficult for those without such knowledge or familiarity to use.

Aim of the Project

The aim of this project is to enable users to access functionalities of KALDI without the knowledge of scripting, a language like Bash, or detailed knowledge of the internal algorithms of KALDI. Furthermore attempts will be made to transcribe live audio speech continuously.

References

[1] Kaldi, "About the Kaldi project." [Online]. Available: http://kaldi-asr.org/doc/about.html [Accessed: 13 March 2017]