Difference between revisions of "Projects:2017s1-103 Improving Usability and User Interaction with KALDI Open- Source Speech Recogniser"

From Projects
Jump to: navigation, search
(Created page with " == Project Team == ===Students=== * George Mao * Vinil Chukkapally ===Supervisors=== * Said Al-Sarawi * Ahmad Hashemi-Sakhtsari (DSTG) == Introduction == The aim of this pr...")
 
(Introduction)
Line 10: Line 10:
  
 
== Introduction ==
 
== Introduction ==
 +
=== What is KALDI? ===
 +
KALDI is a free and open-source software toolkit for automatic speech recognition. It is designed for speech recognition researchers [1], and so requires speech recognition knowledge and familiarity with scripting to operate. As such, it is difficult for those without such knowledge or familiarity to use.
 +
 +
=== Aim of the Project ===
 
The aim of this project is to enable users to access functionalities of KALDI without the knowledge of scripting, a language like Bash, or detailed knowledge of the internal algorithms of KALDI. Furthermore attempts will be made to transcribe live audio speech continuously.
 
The aim of this project is to enable users to access functionalities of KALDI without the knowledge of scripting, a language like Bash, or detailed knowledge of the internal algorithms of KALDI. Furthermore attempts will be made to transcribe live audio speech continuously.
 +
 +
== References ==
 +
[1] Kaldi, "About the Kaldi project." [Online]. Available: http://kaldi-asr.org/doc/about.html [Accessed: 13 March 2017]

Revision as of 15:11, 2 October 2017

Project Team

Students

  • George Mao
  • Vinil Chukkapally

Supervisors

  • Said Al-Sarawi
  • Ahmad Hashemi-Sakhtsari (DSTG)

Introduction

What is KALDI?

KALDI is a free and open-source software toolkit for automatic speech recognition. It is designed for speech recognition researchers [1], and so requires speech recognition knowledge and familiarity with scripting to operate. As such, it is difficult for those without such knowledge or familiarity to use.

Aim of the Project

The aim of this project is to enable users to access functionalities of KALDI without the knowledge of scripting, a language like Bash, or detailed knowledge of the internal algorithms of KALDI. Furthermore attempts will be made to transcribe live audio speech continuously.

References

[1] Kaldi, "About the Kaldi project." [Online]. Available: http://kaldi-asr.org/doc/about.html [Accessed: 13 March 2017]