Projects:2017s1-103 Improving Usability and User Interaction with KALDI Open- Source Speech Recogniser

Project Team

Students

George Mao
Vinil Chukkapally

Supervisors

Said Al-Sarawi
Ahmad Hashemi-Sakhtsari (DSTG)

Introduction

What is KALDI?

KALDI is a free and open-source software toolkit for automatic speech recognition. It is designed for speech recognition researchers [1], and so requires speech recognition knowledge and familiarity with scripting to operate. As such, it is difficult for those without such knowledge or familiarity to use.

Aim of the Project

The aim of this project is to enable users to access functionalities of KALDI without the knowledge of scripting, a language like Bash, or detailed knowledge of the internal algorithms of KALDI. Furthermore attempts will be made to transcribe live audio speech continuously.

References

[1] Kaldi, "About the Kaldi project." [Online]. Available: http://kaldi-asr.org/doc/about.html [Accessed: 13 March 2017]

Projects:2017s1-103 Improving Usability and User Interaction with KALDI Open- Source Speech Recogniser

Contents

Project Team

Students

Supervisors

Introduction

What is KALDI?

Aim of the Project

References

Navigation menu

Personal tools

Namespaces

Variants

Views

More

Search

Navigation

Tools