Projects - User contributions [en]

Projects:2019s2-24101 Improving Usability and User Interaction with KALDI Open- Source Speech Recogniser

2019-10-06T17:48:53Z

A1758286:

'''Introduction'''
This project is about improving the word error rate in audio transcription by KALDI. KALDI is used to transcript audio using various language and acoustic models.
The previous work in improving the word error rate is done by improving the language and acoustic models of audio. our project focus on improving the input signal quality by audio processing using the software HARK. HARK works on Missing feature theory and eliminates noise by various algorithms.
----
'''Background'''
Speech transcription is a process done by computer hardware and software to transform an audio input into a text output.
Speech transcription increases the accessibility and understand-ability of voice recordings and other audio materials for many different use cases.

----
'''Project students'''
Riya Parth Dube
Pengyue Song
----
'''Supervisors'''
Dr. Said- Al- Sarawi

DSTG Dr Ahmad Hashemi-Sakhtsari
----
'''Method'''
----

'''Results'''
----

'''Conclusion'''
----

'''References'''
1 The HARK Documentation available online at https://www.hark.jp/document/3.0.0/hark-document-en/sect0002.html
2 The KALDI Decoder online available at https://kaldi-asr.org/
3 D. Povey, “KALDI – Home”, KALDI, n.d. [Online] Available at: http://kaldi-asr.org/ [Accessed: September 2019].

Projects:2019s2-24101 Improving Usability and User Interaction with KALDI Open- Source Speech Recogniser

2019-10-06T17:48:10Z

A1758286:

'''Introduction'''
This project is about improving the word error rate in audio transcription by KALDI. KALDI is used to transcript audio using various language and acoustic models.
The previous work in improving the word error rate is done by improving the language and acoustic models of audio. our project focus on improving the input signal quality by audio processing using the software HARK. HARK works on Missing feature theory and eliminates noise by various algorithms.
----
'''Background'''
Speech transcription is a process done by computer hardware and software to transform an audio input into a text output.
Speech transcription increases the accessibility and understand-ability of voice recordings and other audio materials for many different use cases.
----
'''Project students'''
Riya Parth Dube
Pengyue Song
----
'''Supervisors'''
Dr. Said- Al- Sarawi

DSTG Dr Ahmad Hashemi-Sakhtsari
----
'''Method'''
----

'''Results'''
----

'''Conclusion'''
----

'''References'''
1 The HARK Documentation available online at https://www.hark.jp/document/3.0.0/hark-document-en/sect0002.html
2 The KALDI Decoder online available at https://kaldi-asr.org/
3 D. Povey, “KALDI – Home”, KALDI, n.d. [Online] Available at: http://kaldi-asr.org/ [Accessed: September 2019].

Projects:2019s2-24101 Improving Usability and User Interaction with KALDI Open- Source Speech Recogniser

2019-10-06T17:47:52Z

A1758286:

'''Introduction'''
This project is about improving the word error rate in audio transcription by KALDI. KALDI is used to transcript audio using various language and acoustic models.
The previous work in improving the word error rate is done by improving the language and acoustic models of audio. our project focus on improving the input signal quality by audio processing using the software HARK. HARK works on Missing feature theory and eliminates noise by various algorithms.
----
'''Background'''
Speech transcription is a process done by computer hardware and software to transform an audio input into a text output.
Speech transcription increases the accessibility and understand-ability of voice recordings and other audio materials for many different use cases.
----

'''Bold text'''Project students'''
Riya Parth Dube
Pengyue Song
----
'''Supervisors'''
Dr. Said- Al- Sarawi

DSTG Dr Ahmad Hashemi-Sakhtsari
----
'''Method'''
----

'''Results'''
----

'''Conclusion'''
----

'''References'''
1 The HARK Documentation available online at https://www.hark.jp/document/3.0.0/hark-document-en/sect0002.html
2 The KALDI Decoder online available at https://kaldi-asr.org/
3 D. Povey, “KALDI – Home”, KALDI, n.d. [Online] Available at: http://kaldi-asr.org/ [Accessed: September 2019].

Projects:2019s2-24101 Improving Usability and User Interaction with KALDI Open- Source Speech Recogniser

2019-10-06T17:47:16Z

A1758286:

'''Introduction'''
This project is about improving the word error rate in audio transcription by KALDI. KALDI is used to transcript audio using various language and acoustic models.
The previous work in improving the word error rate is done by improving the language and acoustic models of audio. our project focus on improving the input signal quality by audio processing using the software HARK. HARK works on Missing feature theory and eliminates noise by various algorithms.
----
'''Background'''
Speech transcription is a process done by computer hardware and software to transform an audio input into a text output.
Speech transcription increases the accessibility and understand-ability of voice recordings and other audio materials for many different use cases.
----
'''Project team
Project students'''
Riya Parth Dube
Pengyue Song
----
'''Supervisors'''
Dr. Said- Al- Sarawi

DSTG Dr Ahmad Hashemi-Sakhtsari
----
'''Method'''
----

'''Results'''
----

'''Conclusion'''
----

'''References'''
1 The HARK Documentation available online at https://www.hark.jp/document/3.0.0/hark-document-en/sect0002.html
2 The KALDI Decoder online available at https://kaldi-asr.org/
3 D. Povey, “KALDI – Home”, KALDI, n.d. [Online] Available at: http://kaldi-asr.org/ [Accessed: September 2019].

Projects:2019s2-24101 Improving Usability and User Interaction with KALDI Open- Source Speech Recogniser

2019-10-06T17:36:50Z

A1758286:

'''Introduction'''
This project is about improving the word error rate in audio transcription by KALDI. KALDI is used to transcript audio using various language and acoustic models.
The previous work in improving the word error rate is done by improving the language and acoustic models of audio. our project focus on improving the input signal quality by audio processing using the software HARK. HARK works on Missing feature theory and eliminates noise by various algorithms.

'''Project team
Project students'''
Riya Parth Dube
Pengyue Song
----

'''Supervisors'''
Dr. Said- Al- Sarawi

DSTG Dr Ahmad Hashemi-Sakhtsari

'''Method'''
----

'''Results'''
----

'''Conclusion'''
----

'''References'''
1 The HARK Documentation available online at https://www.hark.jp/document/3.0.0/hark-document-en/sect0002.html
2 The KALDI Decoder online available at https://kaldi-asr.org/
3 D. Povey, “KALDI – Home”, KALDI, n.d. [Online] Available at: http://kaldi-asr.org/ [Accessed: September 2019].

Projects:2019s2-24101 Improving Usability and User Interaction with KALDI Open- Source Speech Recogniser

2019-10-06T17:36:25Z

A1758286:

'''Introduction'''
This project is about improving the word error rate in audio transcription by KALDI. KALDI is used to transcript audio using various language and acoustic models.
The previous work in improving the word error rate is done by improving the language and acoustic models of audio. our project focus on improving the input signal quality by audio processing using the software HARK. HARK works on Missing feature theory and eliminates noise by various algorithms.

'''Project team
Project students'''
Riya Parth Dube
Pengyue Song
'''Supervisors'''
Dr. Said- Al- Sarawi

DSTG Dr Ahmad Hashemi-Sakhtsari

'''Method'''
----

'''Results'''
----

'''Conclusion'''
----

'''References'''
1 The HARK Documentation available online at https://www.hark.jp/document/3.0.0/hark-document-en/sect0002.html
2 The KALDI Decoder online available at https://kaldi-asr.org/
3 D. Povey, “KALDI – Home”, KALDI, n.d. [Online] Available at: http://kaldi-asr.org/ [Accessed: September 2019].

Projects:2019s2-24101 Improving Usability and User Interaction with KALDI Open- Source Speech Recogniser

2019-10-06T17:34:10Z

A1758286:

'''Introduction'''
This project is about improving the word error rate in audio transcription by KALDI. KALDI is used to transcript audio using various language and acoustic models.
The previous work in improving the word error rate is done by improving the language and acoustic models of audio. our project focus on improving the input signal quality by audio processing using the software HARK. HARK works on Missing feature theory and eliminates noise by various algorithms.

'''Project team
Project students'''
Riya Parth Dube
Pengyue Song
'''Supervisors'''
Dr. Said- Al- Sarawi
DSTG

'''Method'''
----

'''Results'''
----

'''Conclusion'''
----

'''References'''
1 The HARK Documentation available online at https://www.hark.jp/document/3.0.0/hark-document-en/sect0002.html
2 The KALDI Decoder online available at https://kaldi-asr.org/<nowiki>
3 D. Povey, “KALDI – Home”, KALDI, n.d. [Online] Available at: http://kaldi-asr.org/ [Accessed: September 2019].

Projects:2019s2-24101 Improving Usability and User Interaction with KALDI Open- Source Speech Recogniser

2019-10-06T17:33:25Z

A1758286:

'''Introduction'''
This project is about improving the word error rate in audio transcription by KALDI. KALDI is used to transcript audio using various language and acoustic models.
The previous work in improving the word error rate is done by improving the language and acoustic models of audio. our project focus on improving the input signal quality by audio processing using the software HARK. HARK works on Missing feature theory and eliminates noise by various algorithms.

'''Project team
Project students'''
Riya Parth Dube
Pengyue Song
'''Supervisors'''
Dr. Said- Al- Sarawi
DSTG

'''Method'''
----

'''Results'''
----

'''Conclusion'''
----

'''References'''
1 The HARK Documentation available online at https://www.hark.jp/document/3.0.0/hark-document-en/sect0002.html<nowiki>Insert non-formatted text here</nowiki>
2 The KALDI Decoder online available at https://kaldi-asr.org/<nowiki>Insert non-formatted text here</nowiki>
3 D. Povey, “KALDI – Home”, KALDI, n.d. [Online] Available at: http://kaldi-asr.org/ [Accessed: September 2019].<nowiki>Insert non-formatted text here</nowiki>

Projects:2019s2-24101 Improving Usability and User Interaction with KALDI Open- Source Speech Recogniser

2019-10-06T17:32:33Z

A1758286:

'''Introduction'''
This project is about improving the word error rate in audio transcription by KALDI. KALDI is used to transcript audio using various language and acoustic models.
The previous work in improving the word error rate is done by improving the language and acoustic models of audio. our project focus on improving the input signal quality by audio processing using the software HARK. HARK works on Missing feature theory and eliminates noise by various algorithms.

'''Project team
Project students'''
Riya Parth Dube
Pengyue Song
'''Supervisors'''
Dr. Said- Al- Sarawi
DSTG

'''Method'''
----

'''Results'''
----

'''Conclusion'''
----

'''References'''
1 The HARK Documentation available online at https://www.hark.jp/document/3.0.0/hark-document-en/sect0002.html
2 The KALDI Decoder online available at https://kaldi-asr.org/
3 D. Povey, “KALDI – Home”, KALDI, n.d. [Online] Available at: http://kaldi-asr.org/ [Accessed: September 2019].

Projects:2019s2-24101 Improving Usability and User Interaction with KALDI Open- Source Speech Recogniser

2019-10-06T17:31:24Z

A1758286:

Introduction
This project is about improving the word error rate in audio transcription by KALDI. KALDI is used to transcript audio using various language and acoustic models.
The previous work in improving the word error rate is done by improving the language and acoustic models of audio. our project focus on improving the input signal quality by audio processing using the software HARK. HARK works on Missing feature theory and eliminates noise by various algorithms.

Project team
Project students
Riya Parth Dube
Pengyue Song
Supervisors
Dr. Said- Al- Sarawi
DSTG

Method
Results
Conclusion
References
1 The HARK Documentation available online at https://www.hark.jp/document/3.0.0/hark-document-en/sect0002.html
2 The KALDI Decoder online available at https://kaldi-asr.org/
3 D. Povey, “KALDI – Home”, KALDI, n.d. [Online] Available at: http://kaldi-asr.org/ [Accessed: September 2019].

Projects talk:2019s2-24101 Improving Usability and User Interaction with KALDI Open- Source Speech Recogniser

2019-10-06T17:13:45Z

A1758286: Blanked the page

Projects talk:2019s2-24101 Improving Usability and User Interaction with KALDI Open- Source Speech Recogniser

2019-10-06T17:13:23Z

A1758286: /* PROJECT AIM AND MOTIVATION. */

To enable users to access functionalities of KALDI (http://kaldi.sourceforge.net/about.html) without the
knowledge of scripting, a language like Bash, or detailed knowledge of the internal algorithms of KALDI. Furthermore
attempts will be made to transcribe live audio speech continuously.
Project Proposals: The proposal consists of two parts. For the first part is focused on improving usability and User
Interaction with KALDI through a GUI that has the following features:
• Availability of a microphone soft ON and OFF switch
• Minimal scripting knowledge or commands to operate.
• Provide users the ability to select acoustic and language models of their choice. This can be done by allowing
the users either to select one of the pre-trained models or to perform their own acoustic and language model
training in order to subsequently use those models.
• Allow the user to select transcribing from continuous live speech input or from recorded audio. Recording
audio from the speaker during live input allows the audio to be played back in order to correct errors in the
transcript.
• Isolating Utterance/Speaker ID and Speaker ID/Utterance pairs from decoded results for later analysis of
recognition performance of each user. This process also allows plain transcript for each user to be produced
that is free from labels and indices.
• A facility whereby a user can improve her/his recognition performance with KALDI through user adaptive
training i.e. by saving changes to her/his acoustic model after each decoding session.
The second part is reporting the project outcomes through
• Documenting the developed graphical user interface design and functionality for KALDI including the
processes for selecting acoustic and language models, and incorporating online decoding features.
• Documenting the results of evaluation studies related to the usability of the new GUI design.
• Presenting the work to interested staff in Intelligence Analytics Branch of DST Group.

Projects talk:2019s2-24101 Improving Usability and User Interaction with KALDI Open- Source Speech Recogniser

2019-10-06T17:13:06Z

A1758286: /* SUPERVISORS: */

Projects:2019s2-24101 Improving Usability and User Interaction with KALDI Open- Source Speech Recogniser

2019-10-06T17:10:56Z

A1758286:

Aim
To improve the user interact ability of KALDI systems.
To improve the audio transcription quality by text to word accuracy rate.
Interfacing KALDI decoder to implement Neural Network with Kaldi decoder and HARK.
----
Motivation
To create an open source environment for audio transcription using KALDI.
----
SUPERVISORS:
----
SAS:Dr Said Al-Sarawi

DSTG (Dr Hashemi-Sakhtsari)

Projects:2019s2-24101 Improving Usability and User Interaction with KALDI Open- Source Speech Recogniser

2019-10-06T17:09:15Z

A1758286:

Aim
To improve the user interact ability of KALDI systems.
----

To improve the audio transcription quality by text to word accuracy rate.
----

Interfacing KALDI decoder to implement Neural Network with Kaldi decoder and HARK.
----

Motivation
To create an open source environment for audio transcription using KALDI.

SUPERVISORS:
----

SAS:Dr Said Al-Sarawi
----

DSTG (Dr Hashemi-Sakhtsari)

Projects:2019s2-24101 Improving Usability and User Interaction with KALDI Open- Source Speech Recogniser

2019-10-06T17:08:04Z

A1758286: Created page with "Aim To improve the user interact ability of KALDI systems. To improve the audio transcription quality by text to word accuracy rate. Interfacing KALDI decoder to implement Neu..."

Aim
To improve the user interact ability of KALDI systems.
To improve the audio transcription quality by text to word accuracy rate.
Interfacing KALDI decoder to implement Neural Network with Kaldi decoder and HARK.
Motivation
To create an open source environment for audio transcription using KALDI.

Projects talk:2019s2-24101 Improving Usability and User Interaction with KALDI Open- Source Speech Recogniser

2019-10-06T17:04:17Z

A1758286: /* PROJECT AIM AND MOTIVATION. */ new section

Projects talk:2019s2-24101 Improving Usability and User Interaction with KALDI Open- Source Speech Recogniser

2019-10-06T17:02:04Z

A1758286: /* SUPERVISORS: */

Projects talk:2019s2-24101 Improving Usability and User Interaction with KALDI Open- Source Speech Recogniser

2019-10-06T16:59:28Z

A1758286: /* SUPERVISORS: */ new section

Projects talk:2019s2-24101 Improving Usability and User Interaction with KALDI Open- Source Speech Recogniser

2019-10-06T16:57:03Z

A1758286: Created page with "To enable users to access functionalities of KALDI (http://kaldi.sourceforge.net/about.html) without the knowledge of scripting, a language like Bash, or detailed knowledge of..."