<?xml version="1.0"?>
<feed xmlns="http://www.w3.org/2005/Atom" xml:lang="en">
	<id>https://projectswiki.eleceng.adelaide.edu.au/projects/api.php?action=feedcontributions&amp;feedformat=atom&amp;user=A1758286</id>
	<title>Projects - User contributions [en]</title>
	<link rel="self" type="application/atom+xml" href="https://projectswiki.eleceng.adelaide.edu.au/projects/api.php?action=feedcontributions&amp;feedformat=atom&amp;user=A1758286"/>
	<link rel="alternate" type="text/html" href="https://projectswiki.eleceng.adelaide.edu.au/projects/index.php/Special:Contributions/A1758286"/>
	<updated>2026-04-22T13:52:00Z</updated>
	<subtitle>User contributions</subtitle>
	<generator>MediaWiki 1.31.4</generator>
	<entry>
		<id>https://projectswiki.eleceng.adelaide.edu.au/projects/index.php?title=Projects:2019s2-24101_Improving_Usability_and_User_Interaction_with_KALDI_Open-_Source_Speech_Recogniser&amp;diff=13065</id>
		<title>Projects:2019s2-24101 Improving Usability and User Interaction with KALDI Open- Source Speech Recogniser</title>
		<link rel="alternate" type="text/html" href="https://projectswiki.eleceng.adelaide.edu.au/projects/index.php?title=Projects:2019s2-24101_Improving_Usability_and_User_Interaction_with_KALDI_Open-_Source_Speech_Recogniser&amp;diff=13065"/>
		<updated>2019-10-06T17:48:53Z</updated>

		<summary type="html">&lt;p&gt;A1758286: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;&amp;#039;&amp;#039;&amp;#039;Introduction&amp;#039;&amp;#039;&amp;#039;&lt;br /&gt;
This project is about improving the word error rate in audio transcription by KALDI. KALDI is used to transcript audio using various language and acoustic models. &lt;br /&gt;
The previous work in improving the word error rate is done by improving the language and acoustic models of audio. our project focus on improving the input signal quality by audio processing using the software HARK. HARK works on Missing feature theory and eliminates noise by various algorithms.&lt;br /&gt;
----&lt;br /&gt;
&amp;#039;&amp;#039;&amp;#039;Background&amp;#039;&amp;#039;&amp;#039;&lt;br /&gt;
Speech transcription is a process done by computer hardware and software to transform an audio input into a text output.&lt;br /&gt;
Speech transcription increases the accessibility and understand-ability of voice recordings and other audio materials for many different use cases.&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&amp;#039;&amp;#039;&amp;#039;Project students&amp;#039;&amp;#039;&amp;#039;&lt;br /&gt;
Riya Parth Dube&lt;br /&gt;
Pengyue Song&lt;br /&gt;
----&lt;br /&gt;
&amp;#039;&amp;#039;&amp;#039;Supervisors&amp;#039;&amp;#039;&amp;#039;&lt;br /&gt;
Dr. Said- Al- Sarawi&lt;br /&gt;
&lt;br /&gt;
DSTG Dr Ahmad Hashemi-Sakhtsari&lt;br /&gt;
----&lt;br /&gt;
&amp;#039;&amp;#039;&amp;#039;Method&amp;#039;&amp;#039;&amp;#039;&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
&amp;#039;&amp;#039;&amp;#039;Results&amp;#039;&amp;#039;&amp;#039;&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
&amp;#039;&amp;#039;&amp;#039;Conclusion&amp;#039;&amp;#039;&amp;#039;&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
&amp;#039;&amp;#039;&amp;#039;References&amp;#039;&amp;#039;&amp;#039;&lt;br /&gt;
1 The HARK Documentation available online at https://www.hark.jp/document/3.0.0/hark-document-en/sect0002.html&lt;br /&gt;
2 The KALDI Decoder online available at https://kaldi-asr.org/&lt;br /&gt;
3 D. Povey, “KALDI – Home”, KALDI, n.d. [Online] Available at: http://kaldi-asr.org/ [Accessed: September 2019].&lt;/div&gt;</summary>
		<author><name>A1758286</name></author>
		
	</entry>
	<entry>
		<id>https://projectswiki.eleceng.adelaide.edu.au/projects/index.php?title=Projects:2019s2-24101_Improving_Usability_and_User_Interaction_with_KALDI_Open-_Source_Speech_Recogniser&amp;diff=13064</id>
		<title>Projects:2019s2-24101 Improving Usability and User Interaction with KALDI Open- Source Speech Recogniser</title>
		<link rel="alternate" type="text/html" href="https://projectswiki.eleceng.adelaide.edu.au/projects/index.php?title=Projects:2019s2-24101_Improving_Usability_and_User_Interaction_with_KALDI_Open-_Source_Speech_Recogniser&amp;diff=13064"/>
		<updated>2019-10-06T17:48:10Z</updated>

		<summary type="html">&lt;p&gt;A1758286: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;&amp;#039;&amp;#039;&amp;#039;Introduction&amp;#039;&amp;#039;&amp;#039;&lt;br /&gt;
This project is about improving the word error rate in audio transcription by KALDI. KALDI is used to transcript audio using various language and acoustic models. &lt;br /&gt;
The previous work in improving the word error rate is done by improving the language and acoustic models of audio. our project focus on improving the input signal quality by audio processing using the software HARK. HARK works on Missing feature theory and eliminates noise by various algorithms.&lt;br /&gt;
----&lt;br /&gt;
&amp;#039;&amp;#039;&amp;#039;Background&amp;#039;&amp;#039;&amp;#039;&lt;br /&gt;
Speech transcription is a process done by computer hardware and software to transform an audio input into a text output.&lt;br /&gt;
Speech transcription increases the accessibility and understand-ability of voice recordings and other audio materials for many different use cases.&lt;br /&gt;
----&lt;br /&gt;
&amp;#039;&amp;#039;&amp;#039;Project students&amp;#039;&amp;#039;&amp;#039;&lt;br /&gt;
Riya Parth Dube&lt;br /&gt;
Pengyue Song&lt;br /&gt;
----&lt;br /&gt;
&amp;#039;&amp;#039;&amp;#039;Supervisors&amp;#039;&amp;#039;&amp;#039;&lt;br /&gt;
Dr. Said- Al- Sarawi&lt;br /&gt;
&lt;br /&gt;
DSTG Dr Ahmad Hashemi-Sakhtsari&lt;br /&gt;
----&lt;br /&gt;
&amp;#039;&amp;#039;&amp;#039;Method&amp;#039;&amp;#039;&amp;#039;&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
&amp;#039;&amp;#039;&amp;#039;Results&amp;#039;&amp;#039;&amp;#039;&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
&amp;#039;&amp;#039;&amp;#039;Conclusion&amp;#039;&amp;#039;&amp;#039;&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
&amp;#039;&amp;#039;&amp;#039;References&amp;#039;&amp;#039;&amp;#039;&lt;br /&gt;
1 The HARK Documentation available online at https://www.hark.jp/document/3.0.0/hark-document-en/sect0002.html&lt;br /&gt;
2 The KALDI Decoder online available at https://kaldi-asr.org/&lt;br /&gt;
3 D. Povey, “KALDI – Home”, KALDI, n.d. [Online] Available at: http://kaldi-asr.org/ [Accessed: September 2019].&lt;/div&gt;</summary>
		<author><name>A1758286</name></author>
		
	</entry>
	<entry>
		<id>https://projectswiki.eleceng.adelaide.edu.au/projects/index.php?title=Projects:2019s2-24101_Improving_Usability_and_User_Interaction_with_KALDI_Open-_Source_Speech_Recogniser&amp;diff=13063</id>
		<title>Projects:2019s2-24101 Improving Usability and User Interaction with KALDI Open- Source Speech Recogniser</title>
		<link rel="alternate" type="text/html" href="https://projectswiki.eleceng.adelaide.edu.au/projects/index.php?title=Projects:2019s2-24101_Improving_Usability_and_User_Interaction_with_KALDI_Open-_Source_Speech_Recogniser&amp;diff=13063"/>
		<updated>2019-10-06T17:47:52Z</updated>

		<summary type="html">&lt;p&gt;A1758286: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;&amp;#039;&amp;#039;&amp;#039;Introduction&amp;#039;&amp;#039;&amp;#039;&lt;br /&gt;
This project is about improving the word error rate in audio transcription by KALDI. KALDI is used to transcript audio using various language and acoustic models. &lt;br /&gt;
The previous work in improving the word error rate is done by improving the language and acoustic models of audio. our project focus on improving the input signal quality by audio processing using the software HARK. HARK works on Missing feature theory and eliminates noise by various algorithms.&lt;br /&gt;
----&lt;br /&gt;
&amp;#039;&amp;#039;&amp;#039;Background&amp;#039;&amp;#039;&amp;#039;&lt;br /&gt;
Speech transcription is a process done by computer hardware and software to transform an audio input into a text output.&lt;br /&gt;
Speech transcription increases the accessibility and understand-ability of voice recordings and other audio materials for many different use cases.&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
&amp;#039;&amp;#039;&amp;#039;Bold text&amp;#039;&amp;#039;&amp;#039;Project students&amp;#039;&amp;#039;&amp;#039;&lt;br /&gt;
Riya Parth Dube&lt;br /&gt;
Pengyue Song&lt;br /&gt;
----&lt;br /&gt;
&amp;#039;&amp;#039;&amp;#039;Supervisors&amp;#039;&amp;#039;&amp;#039;&lt;br /&gt;
Dr. Said- Al- Sarawi&lt;br /&gt;
&lt;br /&gt;
DSTG Dr Ahmad Hashemi-Sakhtsari&lt;br /&gt;
----&lt;br /&gt;
&amp;#039;&amp;#039;&amp;#039;Method&amp;#039;&amp;#039;&amp;#039;&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
&amp;#039;&amp;#039;&amp;#039;Results&amp;#039;&amp;#039;&amp;#039;&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
&amp;#039;&amp;#039;&amp;#039;Conclusion&amp;#039;&amp;#039;&amp;#039;&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
&amp;#039;&amp;#039;&amp;#039;References&amp;#039;&amp;#039;&amp;#039;&lt;br /&gt;
1 The HARK Documentation available online at https://www.hark.jp/document/3.0.0/hark-document-en/sect0002.html&lt;br /&gt;
2 The KALDI Decoder online available at https://kaldi-asr.org/&lt;br /&gt;
3 D. Povey, “KALDI – Home”, KALDI, n.d. [Online] Available at: http://kaldi-asr.org/ [Accessed: September 2019].&lt;/div&gt;</summary>
		<author><name>A1758286</name></author>
		
	</entry>
	<entry>
		<id>https://projectswiki.eleceng.adelaide.edu.au/projects/index.php?title=Projects:2019s2-24101_Improving_Usability_and_User_Interaction_with_KALDI_Open-_Source_Speech_Recogniser&amp;diff=13062</id>
		<title>Projects:2019s2-24101 Improving Usability and User Interaction with KALDI Open- Source Speech Recogniser</title>
		<link rel="alternate" type="text/html" href="https://projectswiki.eleceng.adelaide.edu.au/projects/index.php?title=Projects:2019s2-24101_Improving_Usability_and_User_Interaction_with_KALDI_Open-_Source_Speech_Recogniser&amp;diff=13062"/>
		<updated>2019-10-06T17:47:16Z</updated>

		<summary type="html">&lt;p&gt;A1758286: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;&amp;#039;&amp;#039;&amp;#039;Introduction&amp;#039;&amp;#039;&amp;#039;&lt;br /&gt;
This project is about improving the word error rate in audio transcription by KALDI. KALDI is used to transcript audio using various language and acoustic models. &lt;br /&gt;
The previous work in improving the word error rate is done by improving the language and acoustic models of audio. our project focus on improving the input signal quality by audio processing using the software HARK. HARK works on Missing feature theory and eliminates noise by various algorithms.&lt;br /&gt;
----&lt;br /&gt;
&amp;#039;&amp;#039;&amp;#039;Background&amp;#039;&amp;#039;&amp;#039;&lt;br /&gt;
Speech transcription is a process done by computer hardware and software to transform an audio input into a text output.&lt;br /&gt;
Speech transcription increases the accessibility and understand-ability of voice recordings and other audio materials for many different use cases.&lt;br /&gt;
----&lt;br /&gt;
&amp;#039;&amp;#039;&amp;#039;Project team&lt;br /&gt;
Project students&amp;#039;&amp;#039;&amp;#039;&lt;br /&gt;
Riya Parth Dube&lt;br /&gt;
Pengyue Song&lt;br /&gt;
----&lt;br /&gt;
&amp;#039;&amp;#039;&amp;#039;Supervisors&amp;#039;&amp;#039;&amp;#039;&lt;br /&gt;
Dr. Said- Al- Sarawi&lt;br /&gt;
&lt;br /&gt;
DSTG Dr Ahmad Hashemi-Sakhtsari&lt;br /&gt;
----&lt;br /&gt;
&amp;#039;&amp;#039;&amp;#039;Method&amp;#039;&amp;#039;&amp;#039;&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
&amp;#039;&amp;#039;&amp;#039;Results&amp;#039;&amp;#039;&amp;#039;&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
&amp;#039;&amp;#039;&amp;#039;Conclusion&amp;#039;&amp;#039;&amp;#039;&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
&amp;#039;&amp;#039;&amp;#039;References&amp;#039;&amp;#039;&amp;#039;&lt;br /&gt;
1 The HARK Documentation available online at https://www.hark.jp/document/3.0.0/hark-document-en/sect0002.html&lt;br /&gt;
2 The KALDI Decoder online available at https://kaldi-asr.org/&lt;br /&gt;
3 D. Povey, “KALDI – Home”, KALDI, n.d. [Online] Available at: http://kaldi-asr.org/ [Accessed: September 2019].&lt;/div&gt;</summary>
		<author><name>A1758286</name></author>
		
	</entry>
	<entry>
		<id>https://projectswiki.eleceng.adelaide.edu.au/projects/index.php?title=Projects:2019s2-24101_Improving_Usability_and_User_Interaction_with_KALDI_Open-_Source_Speech_Recogniser&amp;diff=13061</id>
		<title>Projects:2019s2-24101 Improving Usability and User Interaction with KALDI Open- Source Speech Recogniser</title>
		<link rel="alternate" type="text/html" href="https://projectswiki.eleceng.adelaide.edu.au/projects/index.php?title=Projects:2019s2-24101_Improving_Usability_and_User_Interaction_with_KALDI_Open-_Source_Speech_Recogniser&amp;diff=13061"/>
		<updated>2019-10-06T17:36:50Z</updated>

		<summary type="html">&lt;p&gt;A1758286: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;&amp;#039;&amp;#039;&amp;#039;Introduction&amp;#039;&amp;#039;&amp;#039;&lt;br /&gt;
This project is about improving the word error rate in audio transcription by KALDI. KALDI is used to transcript audio using various language and acoustic models. &lt;br /&gt;
The previous work in improving the word error rate is done by improving the language and acoustic models of audio. our project focus on improving the input signal quality by audio processing using the software HARK. HARK works on Missing feature theory and eliminates noise by various algorithms.&lt;br /&gt;
&lt;br /&gt;
&amp;#039;&amp;#039;&amp;#039;Project team&lt;br /&gt;
Project students&amp;#039;&amp;#039;&amp;#039;&lt;br /&gt;
Riya Parth Dube&lt;br /&gt;
Pengyue Song&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
&amp;#039;&amp;#039;&amp;#039;Supervisors&amp;#039;&amp;#039;&amp;#039;&lt;br /&gt;
Dr. Said- Al- Sarawi&lt;br /&gt;
&lt;br /&gt;
DSTG Dr Ahmad Hashemi-Sakhtsari&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&amp;#039;&amp;#039;&amp;#039;Method&amp;#039;&amp;#039;&amp;#039;&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
&amp;#039;&amp;#039;&amp;#039;Results&amp;#039;&amp;#039;&amp;#039;&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
&amp;#039;&amp;#039;&amp;#039;Conclusion&amp;#039;&amp;#039;&amp;#039;&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
&amp;#039;&amp;#039;&amp;#039;References&amp;#039;&amp;#039;&amp;#039;&lt;br /&gt;
1 The HARK Documentation available online at https://www.hark.jp/document/3.0.0/hark-document-en/sect0002.html&lt;br /&gt;
2 The KALDI Decoder online available at https://kaldi-asr.org/&lt;br /&gt;
3 D. Povey, “KALDI – Home”, KALDI, n.d. [Online] Available at: http://kaldi-asr.org/ [Accessed: September 2019].&lt;/div&gt;</summary>
		<author><name>A1758286</name></author>
		
	</entry>
	<entry>
		<id>https://projectswiki.eleceng.adelaide.edu.au/projects/index.php?title=Projects:2019s2-24101_Improving_Usability_and_User_Interaction_with_KALDI_Open-_Source_Speech_Recogniser&amp;diff=13060</id>
		<title>Projects:2019s2-24101 Improving Usability and User Interaction with KALDI Open- Source Speech Recogniser</title>
		<link rel="alternate" type="text/html" href="https://projectswiki.eleceng.adelaide.edu.au/projects/index.php?title=Projects:2019s2-24101_Improving_Usability_and_User_Interaction_with_KALDI_Open-_Source_Speech_Recogniser&amp;diff=13060"/>
		<updated>2019-10-06T17:36:25Z</updated>

		<summary type="html">&lt;p&gt;A1758286: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;&amp;#039;&amp;#039;&amp;#039;Introduction&amp;#039;&amp;#039;&amp;#039;&lt;br /&gt;
This project is about improving the word error rate in audio transcription by KALDI. KALDI is used to transcript audio using various language and acoustic models. &lt;br /&gt;
The previous work in improving the word error rate is done by improving the language and acoustic models of audio. our project focus on improving the input signal quality by audio processing using the software HARK. HARK works on Missing feature theory and eliminates noise by various algorithms.&lt;br /&gt;
&lt;br /&gt;
&amp;#039;&amp;#039;&amp;#039;Project team&lt;br /&gt;
Project students&amp;#039;&amp;#039;&amp;#039;&lt;br /&gt;
Riya Parth Dube&lt;br /&gt;
Pengyue Song&lt;br /&gt;
&amp;#039;&amp;#039;&amp;#039;Supervisors&amp;#039;&amp;#039;&amp;#039;&lt;br /&gt;
Dr. Said- Al- Sarawi&lt;br /&gt;
&lt;br /&gt;
DSTG Dr Ahmad Hashemi-Sakhtsari&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&amp;#039;&amp;#039;&amp;#039;Method&amp;#039;&amp;#039;&amp;#039;&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
&amp;#039;&amp;#039;&amp;#039;Results&amp;#039;&amp;#039;&amp;#039;&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
&amp;#039;&amp;#039;&amp;#039;Conclusion&amp;#039;&amp;#039;&amp;#039;&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
&amp;#039;&amp;#039;&amp;#039;References&amp;#039;&amp;#039;&amp;#039;&lt;br /&gt;
1 The HARK Documentation available online at https://www.hark.jp/document/3.0.0/hark-document-en/sect0002.html&lt;br /&gt;
2 The KALDI Decoder online available at https://kaldi-asr.org/&lt;br /&gt;
3 D. Povey, “KALDI – Home”, KALDI, n.d. [Online] Available at: http://kaldi-asr.org/ [Accessed: September 2019].&lt;/div&gt;</summary>
		<author><name>A1758286</name></author>
		
	</entry>
	<entry>
		<id>https://projectswiki.eleceng.adelaide.edu.au/projects/index.php?title=Projects:2019s2-24101_Improving_Usability_and_User_Interaction_with_KALDI_Open-_Source_Speech_Recogniser&amp;diff=13059</id>
		<title>Projects:2019s2-24101 Improving Usability and User Interaction with KALDI Open- Source Speech Recogniser</title>
		<link rel="alternate" type="text/html" href="https://projectswiki.eleceng.adelaide.edu.au/projects/index.php?title=Projects:2019s2-24101_Improving_Usability_and_User_Interaction_with_KALDI_Open-_Source_Speech_Recogniser&amp;diff=13059"/>
		<updated>2019-10-06T17:34:10Z</updated>

		<summary type="html">&lt;p&gt;A1758286: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;&amp;#039;&amp;#039;&amp;#039;Introduction&amp;#039;&amp;#039;&amp;#039;&lt;br /&gt;
This project is about improving the word error rate in audio transcription by KALDI. KALDI is used to transcript audio using various language and acoustic models. &lt;br /&gt;
The previous work in improving the word error rate is done by improving the language and acoustic models of audio. our project focus on improving the input signal quality by audio processing using the software HARK. HARK works on Missing feature theory and eliminates noise by various algorithms.&lt;br /&gt;
&lt;br /&gt;
&amp;#039;&amp;#039;&amp;#039;Project team&lt;br /&gt;
Project students&amp;#039;&amp;#039;&amp;#039;&lt;br /&gt;
Riya Parth Dube&lt;br /&gt;
Pengyue Song&lt;br /&gt;
&amp;#039;&amp;#039;&amp;#039;Supervisors&amp;#039;&amp;#039;&amp;#039;&lt;br /&gt;
Dr. Said- Al- Sarawi&lt;br /&gt;
DSTG&lt;br /&gt;
&lt;br /&gt;
&amp;#039;&amp;#039;&amp;#039;Method&amp;#039;&amp;#039;&amp;#039;&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
&amp;#039;&amp;#039;&amp;#039;Results&amp;#039;&amp;#039;&amp;#039;&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
&amp;#039;&amp;#039;&amp;#039;Conclusion&amp;#039;&amp;#039;&amp;#039;&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
&amp;#039;&amp;#039;&amp;#039;References&amp;#039;&amp;#039;&amp;#039;&lt;br /&gt;
1 The HARK Documentation available online at https://www.hark.jp/document/3.0.0/hark-document-en/sect0002.html&lt;br /&gt;
2 The KALDI Decoder online available at https://kaldi-asr.org/&amp;lt;nowiki&amp;gt;&lt;br /&gt;
3 D. Povey, “KALDI – Home”, KALDI, n.d. [Online] Available at: http://kaldi-asr.org/ [Accessed: September 2019].&lt;/div&gt;</summary>
		<author><name>A1758286</name></author>
		
	</entry>
	<entry>
		<id>https://projectswiki.eleceng.adelaide.edu.au/projects/index.php?title=Projects:2019s2-24101_Improving_Usability_and_User_Interaction_with_KALDI_Open-_Source_Speech_Recogniser&amp;diff=13058</id>
		<title>Projects:2019s2-24101 Improving Usability and User Interaction with KALDI Open- Source Speech Recogniser</title>
		<link rel="alternate" type="text/html" href="https://projectswiki.eleceng.adelaide.edu.au/projects/index.php?title=Projects:2019s2-24101_Improving_Usability_and_User_Interaction_with_KALDI_Open-_Source_Speech_Recogniser&amp;diff=13058"/>
		<updated>2019-10-06T17:33:25Z</updated>

		<summary type="html">&lt;p&gt;A1758286: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;&amp;#039;&amp;#039;&amp;#039;Introduction&amp;#039;&amp;#039;&amp;#039;&lt;br /&gt;
This project is about improving the word error rate in audio transcription by KALDI. KALDI is used to transcript audio using various language and acoustic models. &lt;br /&gt;
The previous work in improving the word error rate is done by improving the language and acoustic models of audio. our project focus on improving the input signal quality by audio processing using the software HARK. HARK works on Missing feature theory and eliminates noise by various algorithms.&lt;br /&gt;
&lt;br /&gt;
&amp;#039;&amp;#039;&amp;#039;Project team&lt;br /&gt;
Project students&amp;#039;&amp;#039;&amp;#039;&lt;br /&gt;
Riya Parth Dube&lt;br /&gt;
Pengyue Song&lt;br /&gt;
&amp;#039;&amp;#039;&amp;#039;Supervisors&amp;#039;&amp;#039;&amp;#039;&lt;br /&gt;
Dr. Said- Al- Sarawi&lt;br /&gt;
DSTG&lt;br /&gt;
&lt;br /&gt;
&amp;#039;&amp;#039;&amp;#039;Method&amp;#039;&amp;#039;&amp;#039;&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
&amp;#039;&amp;#039;&amp;#039;Results&amp;#039;&amp;#039;&amp;#039;&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
&amp;#039;&amp;#039;&amp;#039;Conclusion&amp;#039;&amp;#039;&amp;#039;&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
&amp;#039;&amp;#039;&amp;#039;References&amp;#039;&amp;#039;&amp;#039;&lt;br /&gt;
1 The HARK Documentation available online at https://www.hark.jp/document/3.0.0/hark-document-en/sect0002.html&amp;lt;nowiki&amp;gt;Insert non-formatted text here&amp;lt;/nowiki&amp;gt;&lt;br /&gt;
2 The KALDI Decoder online available at https://kaldi-asr.org/&amp;lt;nowiki&amp;gt;Insert non-formatted text here&amp;lt;/nowiki&amp;gt;&lt;br /&gt;
3 D. Povey, “KALDI – Home”, KALDI, n.d. [Online] Available at: http://kaldi-asr.org/ [Accessed: September 2019].&amp;lt;nowiki&amp;gt;Insert non-formatted text here&amp;lt;/nowiki&amp;gt;&lt;/div&gt;</summary>
		<author><name>A1758286</name></author>
		
	</entry>
	<entry>
		<id>https://projectswiki.eleceng.adelaide.edu.au/projects/index.php?title=Projects:2019s2-24101_Improving_Usability_and_User_Interaction_with_KALDI_Open-_Source_Speech_Recogniser&amp;diff=13057</id>
		<title>Projects:2019s2-24101 Improving Usability and User Interaction with KALDI Open- Source Speech Recogniser</title>
		<link rel="alternate" type="text/html" href="https://projectswiki.eleceng.adelaide.edu.au/projects/index.php?title=Projects:2019s2-24101_Improving_Usability_and_User_Interaction_with_KALDI_Open-_Source_Speech_Recogniser&amp;diff=13057"/>
		<updated>2019-10-06T17:32:33Z</updated>

		<summary type="html">&lt;p&gt;A1758286: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;&amp;#039;&amp;#039;&amp;#039;Introduction&amp;#039;&amp;#039;&amp;#039;&lt;br /&gt;
This project is about improving the word error rate in audio transcription by KALDI. KALDI is used to transcript audio using various language and acoustic models. &lt;br /&gt;
The previous work in improving the word error rate is done by improving the language and acoustic models of audio. our project focus on improving the input signal quality by audio processing using the software HARK. HARK works on Missing feature theory and eliminates noise by various algorithms.&lt;br /&gt;
&lt;br /&gt;
&amp;#039;&amp;#039;&amp;#039;Project team&lt;br /&gt;
Project students&amp;#039;&amp;#039;&amp;#039;&lt;br /&gt;
Riya Parth Dube&lt;br /&gt;
Pengyue Song&lt;br /&gt;
&amp;#039;&amp;#039;&amp;#039;Supervisors&amp;#039;&amp;#039;&amp;#039;&lt;br /&gt;
Dr. Said- Al- Sarawi&lt;br /&gt;
DSTG&lt;br /&gt;
&lt;br /&gt;
&amp;#039;&amp;#039;&amp;#039;Method&amp;#039;&amp;#039;&amp;#039;&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
&amp;#039;&amp;#039;&amp;#039;Results&amp;#039;&amp;#039;&amp;#039;&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
&amp;#039;&amp;#039;&amp;#039;Conclusion&amp;#039;&amp;#039;&amp;#039;&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
&amp;#039;&amp;#039;&amp;#039;References&amp;#039;&amp;#039;&amp;#039;&lt;br /&gt;
1 The HARK Documentation available online at https://www.hark.jp/document/3.0.0/hark-document-en/sect0002.html&lt;br /&gt;
2 The KALDI Decoder online available at https://kaldi-asr.org/&lt;br /&gt;
3 D. Povey, “KALDI – Home”, KALDI, n.d. [Online] Available at: http://kaldi-asr.org/ [Accessed: September 2019].&lt;/div&gt;</summary>
		<author><name>A1758286</name></author>
		
	</entry>
	<entry>
		<id>https://projectswiki.eleceng.adelaide.edu.au/projects/index.php?title=Projects:2019s2-24101_Improving_Usability_and_User_Interaction_with_KALDI_Open-_Source_Speech_Recogniser&amp;diff=13056</id>
		<title>Projects:2019s2-24101 Improving Usability and User Interaction with KALDI Open- Source Speech Recogniser</title>
		<link rel="alternate" type="text/html" href="https://projectswiki.eleceng.adelaide.edu.au/projects/index.php?title=Projects:2019s2-24101_Improving_Usability_and_User_Interaction_with_KALDI_Open-_Source_Speech_Recogniser&amp;diff=13056"/>
		<updated>2019-10-06T17:31:24Z</updated>

		<summary type="html">&lt;p&gt;A1758286: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;Introduction&lt;br /&gt;
This project is about improving the word error rate in audio transcription by KALDI. KALDI is used to transcript audio using various language and acoustic models. &lt;br /&gt;
The previous work in improving the word error rate is done by improving the language and acoustic models of audio. our project focus on improving the input signal quality by audio processing using the software HARK. HARK works on Missing feature theory and eliminates noise by various algorithms.&lt;br /&gt;
&lt;br /&gt;
Project team&lt;br /&gt;
Project students&lt;br /&gt;
Riya Parth Dube&lt;br /&gt;
Pengyue Song&lt;br /&gt;
Supervisors&lt;br /&gt;
Dr. Said- Al- Sarawi&lt;br /&gt;
DSTG&lt;br /&gt;
&lt;br /&gt;
Method&lt;br /&gt;
Results&lt;br /&gt;
Conclusion&lt;br /&gt;
References&lt;br /&gt;
1 The HARK Documentation available online at https://www.hark.jp/document/3.0.0/hark-document-en/sect0002.html&lt;br /&gt;
2 The KALDI Decoder online available at https://kaldi-asr.org/&lt;br /&gt;
3 D. Povey, “KALDI – Home”, KALDI, n.d. [Online] Available at: http://kaldi-asr.org/ [Accessed: September 2019].&lt;/div&gt;</summary>
		<author><name>A1758286</name></author>
		
	</entry>
	<entry>
		<id>https://projectswiki.eleceng.adelaide.edu.au/projects/index.php?title=Projects_talk:2019s2-24101_Improving_Usability_and_User_Interaction_with_KALDI_Open-_Source_Speech_Recogniser&amp;diff=13055</id>
		<title>Projects talk:2019s2-24101 Improving Usability and User Interaction with KALDI Open- Source Speech Recogniser</title>
		<link rel="alternate" type="text/html" href="https://projectswiki.eleceng.adelaide.edu.au/projects/index.php?title=Projects_talk:2019s2-24101_Improving_Usability_and_User_Interaction_with_KALDI_Open-_Source_Speech_Recogniser&amp;diff=13055"/>
		<updated>2019-10-06T17:13:45Z</updated>

		<summary type="html">&lt;p&gt;A1758286: Blanked the page&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;&lt;/div&gt;</summary>
		<author><name>A1758286</name></author>
		
	</entry>
	<entry>
		<id>https://projectswiki.eleceng.adelaide.edu.au/projects/index.php?title=Projects_talk:2019s2-24101_Improving_Usability_and_User_Interaction_with_KALDI_Open-_Source_Speech_Recogniser&amp;diff=13054</id>
		<title>Projects talk:2019s2-24101 Improving Usability and User Interaction with KALDI Open- Source Speech Recogniser</title>
		<link rel="alternate" type="text/html" href="https://projectswiki.eleceng.adelaide.edu.au/projects/index.php?title=Projects_talk:2019s2-24101_Improving_Usability_and_User_Interaction_with_KALDI_Open-_Source_Speech_Recogniser&amp;diff=13054"/>
		<updated>2019-10-06T17:13:23Z</updated>

		<summary type="html">&lt;p&gt;A1758286: /* PROJECT AIM AND MOTIVATION. */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;To enable users to access functionalities of KALDI (http://kaldi.sourceforge.net/about.html) without the&lt;br /&gt;
knowledge of scripting, a language like Bash, or detailed knowledge of the internal algorithms of KALDI. Furthermore&lt;br /&gt;
attempts will be made to transcribe live audio speech continuously.&lt;br /&gt;
Project Proposals: The proposal consists of two parts. For the first part is focused on improving usability and User&lt;br /&gt;
Interaction with KALDI through a GUI that has the following features:&lt;br /&gt;
• Availability of a microphone soft ON and OFF switch&lt;br /&gt;
• Minimal scripting knowledge or commands to operate.&lt;br /&gt;
• Provide users the ability to select acoustic and language models of their choice. This can be done by allowing&lt;br /&gt;
the users either to select one of the pre-trained models or to perform their own acoustic and language model&lt;br /&gt;
training in order to subsequently use those models.&lt;br /&gt;
• Allow the user to select transcribing from continuous live speech input or from recorded audio. Recording&lt;br /&gt;
audio from the speaker during live input allows the audio to be played back in order to correct errors in the&lt;br /&gt;
transcript.&lt;br /&gt;
• Isolating Utterance/Speaker ID and Speaker ID/Utterance pairs from decoded results for later analysis of&lt;br /&gt;
recognition performance of each user. This process also allows plain transcript for each user to be produced&lt;br /&gt;
that is free from labels and indices.&lt;br /&gt;
• A facility whereby a user can improve her/his recognition performance with KALDI through user adaptive&lt;br /&gt;
training i.e. by saving changes to her/his acoustic model after each decoding session.&lt;br /&gt;
The second part is reporting the project outcomes through&lt;br /&gt;
• Documenting the developed graphical user interface design and functionality for KALDI including the&lt;br /&gt;
processes for selecting acoustic and language models, and incorporating online decoding features.&lt;br /&gt;
• Documenting the results of evaluation studies related to the usability of the new GUI design.&lt;br /&gt;
• Presenting the work to interested staff in Intelligence Analytics Branch of DST Group.&lt;/div&gt;</summary>
		<author><name>A1758286</name></author>
		
	</entry>
	<entry>
		<id>https://projectswiki.eleceng.adelaide.edu.au/projects/index.php?title=Projects_talk:2019s2-24101_Improving_Usability_and_User_Interaction_with_KALDI_Open-_Source_Speech_Recogniser&amp;diff=13053</id>
		<title>Projects talk:2019s2-24101 Improving Usability and User Interaction with KALDI Open- Source Speech Recogniser</title>
		<link rel="alternate" type="text/html" href="https://projectswiki.eleceng.adelaide.edu.au/projects/index.php?title=Projects_talk:2019s2-24101_Improving_Usability_and_User_Interaction_with_KALDI_Open-_Source_Speech_Recogniser&amp;diff=13053"/>
		<updated>2019-10-06T17:13:06Z</updated>

		<summary type="html">&lt;p&gt;A1758286: /* SUPERVISORS: */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;To enable users to access functionalities of KALDI (http://kaldi.sourceforge.net/about.html) without the&lt;br /&gt;
knowledge of scripting, a language like Bash, or detailed knowledge of the internal algorithms of KALDI. Furthermore&lt;br /&gt;
attempts will be made to transcribe live audio speech continuously.&lt;br /&gt;
Project Proposals: The proposal consists of two parts. For the first part is focused on improving usability and User&lt;br /&gt;
Interaction with KALDI through a GUI that has the following features:&lt;br /&gt;
• Availability of a microphone soft ON and OFF switch&lt;br /&gt;
• Minimal scripting knowledge or commands to operate.&lt;br /&gt;
• Provide users the ability to select acoustic and language models of their choice. This can be done by allowing&lt;br /&gt;
the users either to select one of the pre-trained models or to perform their own acoustic and language model&lt;br /&gt;
training in order to subsequently use those models.&lt;br /&gt;
• Allow the user to select transcribing from continuous live speech input or from recorded audio. Recording&lt;br /&gt;
audio from the speaker during live input allows the audio to be played back in order to correct errors in the&lt;br /&gt;
transcript.&lt;br /&gt;
• Isolating Utterance/Speaker ID and Speaker ID/Utterance pairs from decoded results for later analysis of&lt;br /&gt;
recognition performance of each user. This process also allows plain transcript for each user to be produced&lt;br /&gt;
that is free from labels and indices.&lt;br /&gt;
• A facility whereby a user can improve her/his recognition performance with KALDI through user adaptive&lt;br /&gt;
training i.e. by saving changes to her/his acoustic model after each decoding session.&lt;br /&gt;
The second part is reporting the project outcomes through&lt;br /&gt;
• Documenting the developed graphical user interface design and functionality for KALDI including the&lt;br /&gt;
processes for selecting acoustic and language models, and incorporating online decoding features.&lt;br /&gt;
• Documenting the results of evaluation studies related to the usability of the new GUI design.&lt;br /&gt;
• Presenting the work to interested staff in Intelligence Analytics Branch of DST Group.&lt;br /&gt;
&lt;br /&gt;
== PROJECT AIM AND MOTIVATION. ==&lt;br /&gt;
&lt;br /&gt;
Aim&lt;br /&gt;
To improve the user interact ability of KALDI systems.&lt;br /&gt;
To improve the audio transcription quality by text to word accuracy rate.&lt;br /&gt;
Interfacing KALDI decoder to implement Neural Network with Kaldi decoder and HARK. &lt;br /&gt;
----&lt;br /&gt;
Motivation&lt;br /&gt;
To create an open source environment for audio transcription using KALDI.&lt;/div&gt;</summary>
		<author><name>A1758286</name></author>
		
	</entry>
	<entry>
		<id>https://projectswiki.eleceng.adelaide.edu.au/projects/index.php?title=Projects:2019s2-24101_Improving_Usability_and_User_Interaction_with_KALDI_Open-_Source_Speech_Recogniser&amp;diff=13052</id>
		<title>Projects:2019s2-24101 Improving Usability and User Interaction with KALDI Open- Source Speech Recogniser</title>
		<link rel="alternate" type="text/html" href="https://projectswiki.eleceng.adelaide.edu.au/projects/index.php?title=Projects:2019s2-24101_Improving_Usability_and_User_Interaction_with_KALDI_Open-_Source_Speech_Recogniser&amp;diff=13052"/>
		<updated>2019-10-06T17:10:56Z</updated>

		<summary type="html">&lt;p&gt;A1758286: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;Aim&lt;br /&gt;
To improve the user interact ability of KALDI systems.&lt;br /&gt;
To improve the audio transcription quality by text to word accuracy rate.&lt;br /&gt;
Interfacing KALDI decoder to implement Neural Network with Kaldi decoder and HARK. &lt;br /&gt;
----&lt;br /&gt;
Motivation&lt;br /&gt;
To create an open source environment for audio transcription using KALDI.&lt;br /&gt;
----&lt;br /&gt;
SUPERVISORS:&lt;br /&gt;
----&lt;br /&gt;
SAS:Dr Said Al-Sarawi&lt;br /&gt;
&lt;br /&gt;
DSTG (Dr Hashemi-Sakhtsari)&lt;/div&gt;</summary>
		<author><name>A1758286</name></author>
		
	</entry>
	<entry>
		<id>https://projectswiki.eleceng.adelaide.edu.au/projects/index.php?title=Projects:2019s2-24101_Improving_Usability_and_User_Interaction_with_KALDI_Open-_Source_Speech_Recogniser&amp;diff=13051</id>
		<title>Projects:2019s2-24101 Improving Usability and User Interaction with KALDI Open- Source Speech Recogniser</title>
		<link rel="alternate" type="text/html" href="https://projectswiki.eleceng.adelaide.edu.au/projects/index.php?title=Projects:2019s2-24101_Improving_Usability_and_User_Interaction_with_KALDI_Open-_Source_Speech_Recogniser&amp;diff=13051"/>
		<updated>2019-10-06T17:09:15Z</updated>

		<summary type="html">&lt;p&gt;A1758286: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;Aim&lt;br /&gt;
To improve the user interact ability of KALDI systems.&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
To improve the audio transcription quality by text to word accuracy rate.&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Interfacing KALDI decoder to implement Neural Network with Kaldi decoder and HARK. &lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Motivation&lt;br /&gt;
To create an open source environment for audio transcription using KALDI.&lt;br /&gt;
&lt;br /&gt;
SUPERVISORS:&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
SAS:Dr Said Al-Sarawi&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
DSTG (Dr Hashemi-Sakhtsari)&lt;/div&gt;</summary>
		<author><name>A1758286</name></author>
		
	</entry>
	<entry>
		<id>https://projectswiki.eleceng.adelaide.edu.au/projects/index.php?title=Projects:2019s2-24101_Improving_Usability_and_User_Interaction_with_KALDI_Open-_Source_Speech_Recogniser&amp;diff=13050</id>
		<title>Projects:2019s2-24101 Improving Usability and User Interaction with KALDI Open- Source Speech Recogniser</title>
		<link rel="alternate" type="text/html" href="https://projectswiki.eleceng.adelaide.edu.au/projects/index.php?title=Projects:2019s2-24101_Improving_Usability_and_User_Interaction_with_KALDI_Open-_Source_Speech_Recogniser&amp;diff=13050"/>
		<updated>2019-10-06T17:08:04Z</updated>

		<summary type="html">&lt;p&gt;A1758286: Created page with &amp;quot;Aim To improve the user interact ability of KALDI systems. To improve the audio transcription quality by text to word accuracy rate. Interfacing KALDI decoder to implement Neu...&amp;quot;&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;Aim&lt;br /&gt;
To improve the user interact ability of KALDI systems.&lt;br /&gt;
To improve the audio transcription quality by text to word accuracy rate.&lt;br /&gt;
Interfacing KALDI decoder to implement Neural Network with Kaldi decoder and HARK. &lt;br /&gt;
Motivation&lt;br /&gt;
To create an open source environment for audio transcription using KALDI.&lt;/div&gt;</summary>
		<author><name>A1758286</name></author>
		
	</entry>
	<entry>
		<id>https://projectswiki.eleceng.adelaide.edu.au/projects/index.php?title=Projects_talk:2019s2-24101_Improving_Usability_and_User_Interaction_with_KALDI_Open-_Source_Speech_Recogniser&amp;diff=13049</id>
		<title>Projects talk:2019s2-24101 Improving Usability and User Interaction with KALDI Open- Source Speech Recogniser</title>
		<link rel="alternate" type="text/html" href="https://projectswiki.eleceng.adelaide.edu.au/projects/index.php?title=Projects_talk:2019s2-24101_Improving_Usability_and_User_Interaction_with_KALDI_Open-_Source_Speech_Recogniser&amp;diff=13049"/>
		<updated>2019-10-06T17:04:17Z</updated>

		<summary type="html">&lt;p&gt;A1758286: /* PROJECT AIM AND MOTIVATION. */ new section&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;To enable users to access functionalities of KALDI (http://kaldi.sourceforge.net/about.html) without the&lt;br /&gt;
knowledge of scripting, a language like Bash, or detailed knowledge of the internal algorithms of KALDI. Furthermore&lt;br /&gt;
attempts will be made to transcribe live audio speech continuously.&lt;br /&gt;
Project Proposals: The proposal consists of two parts. For the first part is focused on improving usability and User&lt;br /&gt;
Interaction with KALDI through a GUI that has the following features:&lt;br /&gt;
• Availability of a microphone soft ON and OFF switch&lt;br /&gt;
• Minimal scripting knowledge or commands to operate.&lt;br /&gt;
• Provide users the ability to select acoustic and language models of their choice. This can be done by allowing&lt;br /&gt;
the users either to select one of the pre-trained models or to perform their own acoustic and language model&lt;br /&gt;
training in order to subsequently use those models.&lt;br /&gt;
• Allow the user to select transcribing from continuous live speech input or from recorded audio. Recording&lt;br /&gt;
audio from the speaker during live input allows the audio to be played back in order to correct errors in the&lt;br /&gt;
transcript.&lt;br /&gt;
• Isolating Utterance/Speaker ID and Speaker ID/Utterance pairs from decoded results for later analysis of&lt;br /&gt;
recognition performance of each user. This process also allows plain transcript for each user to be produced&lt;br /&gt;
that is free from labels and indices.&lt;br /&gt;
• A facility whereby a user can improve her/his recognition performance with KALDI through user adaptive&lt;br /&gt;
training i.e. by saving changes to her/his acoustic model after each decoding session.&lt;br /&gt;
The second part is reporting the project outcomes through&lt;br /&gt;
• Documenting the developed graphical user interface design and functionality for KALDI including the&lt;br /&gt;
processes for selecting acoustic and language models, and incorporating online decoding features.&lt;br /&gt;
• Documenting the results of evaluation studies related to the usability of the new GUI design.&lt;br /&gt;
• Presenting the work to interested staff in Intelligence Analytics Branch of DST Group.&lt;br /&gt;
&lt;br /&gt;
== SUPERVISORS: ==&lt;br /&gt;
&lt;br /&gt;
SAS:Dr Said Al-Sarawi&lt;br /&gt;
----&lt;br /&gt;
DSTG (Dr Hashemi-Sakhtsari)&lt;br /&gt;
&lt;br /&gt;
== PROJECT AIM AND MOTIVATION. ==&lt;br /&gt;
&lt;br /&gt;
Aim&lt;br /&gt;
To improve the user interact ability of KALDI systems.&lt;br /&gt;
To improve the audio transcription quality by text to word accuracy rate.&lt;br /&gt;
Interfacing KALDI decoder to implement Neural Network with Kaldi decoder and HARK. &lt;br /&gt;
----&lt;br /&gt;
Motivation&lt;br /&gt;
To create an open source environment for audio transcription using KALDI.&lt;/div&gt;</summary>
		<author><name>A1758286</name></author>
		
	</entry>
	<entry>
		<id>https://projectswiki.eleceng.adelaide.edu.au/projects/index.php?title=Projects_talk:2019s2-24101_Improving_Usability_and_User_Interaction_with_KALDI_Open-_Source_Speech_Recogniser&amp;diff=13048</id>
		<title>Projects talk:2019s2-24101 Improving Usability and User Interaction with KALDI Open- Source Speech Recogniser</title>
		<link rel="alternate" type="text/html" href="https://projectswiki.eleceng.adelaide.edu.au/projects/index.php?title=Projects_talk:2019s2-24101_Improving_Usability_and_User_Interaction_with_KALDI_Open-_Source_Speech_Recogniser&amp;diff=13048"/>
		<updated>2019-10-06T17:02:04Z</updated>

		<summary type="html">&lt;p&gt;A1758286: /* SUPERVISORS: */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;To enable users to access functionalities of KALDI (http://kaldi.sourceforge.net/about.html) without the&lt;br /&gt;
knowledge of scripting, a language like Bash, or detailed knowledge of the internal algorithms of KALDI. Furthermore&lt;br /&gt;
attempts will be made to transcribe live audio speech continuously.&lt;br /&gt;
Project Proposals: The proposal consists of two parts. For the first part is focused on improving usability and User&lt;br /&gt;
Interaction with KALDI through a GUI that has the following features:&lt;br /&gt;
• Availability of a microphone soft ON and OFF switch&lt;br /&gt;
• Minimal scripting knowledge or commands to operate.&lt;br /&gt;
• Provide users the ability to select acoustic and language models of their choice. This can be done by allowing&lt;br /&gt;
the users either to select one of the pre-trained models or to perform their own acoustic and language model&lt;br /&gt;
training in order to subsequently use those models.&lt;br /&gt;
• Allow the user to select transcribing from continuous live speech input or from recorded audio. Recording&lt;br /&gt;
audio from the speaker during live input allows the audio to be played back in order to correct errors in the&lt;br /&gt;
transcript.&lt;br /&gt;
• Isolating Utterance/Speaker ID and Speaker ID/Utterance pairs from decoded results for later analysis of&lt;br /&gt;
recognition performance of each user. This process also allows plain transcript for each user to be produced&lt;br /&gt;
that is free from labels and indices.&lt;br /&gt;
• A facility whereby a user can improve her/his recognition performance with KALDI through user adaptive&lt;br /&gt;
training i.e. by saving changes to her/his acoustic model after each decoding session.&lt;br /&gt;
The second part is reporting the project outcomes through&lt;br /&gt;
• Documenting the developed graphical user interface design and functionality for KALDI including the&lt;br /&gt;
processes for selecting acoustic and language models, and incorporating online decoding features.&lt;br /&gt;
• Documenting the results of evaluation studies related to the usability of the new GUI design.&lt;br /&gt;
• Presenting the work to interested staff in Intelligence Analytics Branch of DST Group.&lt;br /&gt;
&lt;br /&gt;
== SUPERVISORS: ==&lt;br /&gt;
&lt;br /&gt;
SAS:Dr Said Al-Sarawi&lt;br /&gt;
----&lt;br /&gt;
DSTG (Dr Hashemi-Sakhtsari)&lt;/div&gt;</summary>
		<author><name>A1758286</name></author>
		
	</entry>
	<entry>
		<id>https://projectswiki.eleceng.adelaide.edu.au/projects/index.php?title=Projects_talk:2019s2-24101_Improving_Usability_and_User_Interaction_with_KALDI_Open-_Source_Speech_Recogniser&amp;diff=13047</id>
		<title>Projects talk:2019s2-24101 Improving Usability and User Interaction with KALDI Open- Source Speech Recogniser</title>
		<link rel="alternate" type="text/html" href="https://projectswiki.eleceng.adelaide.edu.au/projects/index.php?title=Projects_talk:2019s2-24101_Improving_Usability_and_User_Interaction_with_KALDI_Open-_Source_Speech_Recogniser&amp;diff=13047"/>
		<updated>2019-10-06T16:59:28Z</updated>

		<summary type="html">&lt;p&gt;A1758286: /* SUPERVISORS: */ new section&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;To enable users to access functionalities of KALDI (http://kaldi.sourceforge.net/about.html) without the&lt;br /&gt;
knowledge of scripting, a language like Bash, or detailed knowledge of the internal algorithms of KALDI. Furthermore&lt;br /&gt;
attempts will be made to transcribe live audio speech continuously.&lt;br /&gt;
Project Proposals: The proposal consists of two parts. For the first part is focused on improving usability and User&lt;br /&gt;
Interaction with KALDI through a GUI that has the following features:&lt;br /&gt;
• Availability of a microphone soft ON and OFF switch&lt;br /&gt;
• Minimal scripting knowledge or commands to operate.&lt;br /&gt;
• Provide users the ability to select acoustic and language models of their choice. This can be done by allowing&lt;br /&gt;
the users either to select one of the pre-trained models or to perform their own acoustic and language model&lt;br /&gt;
training in order to subsequently use those models.&lt;br /&gt;
• Allow the user to select transcribing from continuous live speech input or from recorded audio. Recording&lt;br /&gt;
audio from the speaker during live input allows the audio to be played back in order to correct errors in the&lt;br /&gt;
transcript.&lt;br /&gt;
• Isolating Utterance/Speaker ID and Speaker ID/Utterance pairs from decoded results for later analysis of&lt;br /&gt;
recognition performance of each user. This process also allows plain transcript for each user to be produced&lt;br /&gt;
that is free from labels and indices.&lt;br /&gt;
• A facility whereby a user can improve her/his recognition performance with KALDI through user adaptive&lt;br /&gt;
training i.e. by saving changes to her/his acoustic model after each decoding session.&lt;br /&gt;
The second part is reporting the project outcomes through&lt;br /&gt;
• Documenting the developed graphical user interface design and functionality for KALDI including the&lt;br /&gt;
processes for selecting acoustic and language models, and incorporating online decoding features.&lt;br /&gt;
• Documenting the results of evaluation studies related to the usability of the new GUI design.&lt;br /&gt;
• Presenting the work to interested staff in Intelligence Analytics Branch of DST Group.&lt;br /&gt;
&lt;br /&gt;
== SUPERVISORS: ==&lt;br /&gt;
&lt;br /&gt;
SAS:Dr Said Al-Sarawi&lt;br /&gt;
DSTG (Dr Hashemi-Sakhtsari)&lt;/div&gt;</summary>
		<author><name>A1758286</name></author>
		
	</entry>
	<entry>
		<id>https://projectswiki.eleceng.adelaide.edu.au/projects/index.php?title=Projects_talk:2019s2-24101_Improving_Usability_and_User_Interaction_with_KALDI_Open-_Source_Speech_Recogniser&amp;diff=13046</id>
		<title>Projects talk:2019s2-24101 Improving Usability and User Interaction with KALDI Open- Source Speech Recogniser</title>
		<link rel="alternate" type="text/html" href="https://projectswiki.eleceng.adelaide.edu.au/projects/index.php?title=Projects_talk:2019s2-24101_Improving_Usability_and_User_Interaction_with_KALDI_Open-_Source_Speech_Recogniser&amp;diff=13046"/>
		<updated>2019-10-06T16:57:03Z</updated>

		<summary type="html">&lt;p&gt;A1758286: Created page with &amp;quot;To enable users to access functionalities of KALDI (http://kaldi.sourceforge.net/about.html) without the knowledge of scripting, a language like Bash, or detailed knowledge of...&amp;quot;&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;To enable users to access functionalities of KALDI (http://kaldi.sourceforge.net/about.html) without the&lt;br /&gt;
knowledge of scripting, a language like Bash, or detailed knowledge of the internal algorithms of KALDI. Furthermore&lt;br /&gt;
attempts will be made to transcribe live audio speech continuously.&lt;br /&gt;
Project Proposals: The proposal consists of two parts. For the first part is focused on improving usability and User&lt;br /&gt;
Interaction with KALDI through a GUI that has the following features:&lt;br /&gt;
• Availability of a microphone soft ON and OFF switch&lt;br /&gt;
• Minimal scripting knowledge or commands to operate.&lt;br /&gt;
• Provide users the ability to select acoustic and language models of their choice. This can be done by allowing&lt;br /&gt;
the users either to select one of the pre-trained models or to perform their own acoustic and language model&lt;br /&gt;
training in order to subsequently use those models.&lt;br /&gt;
• Allow the user to select transcribing from continuous live speech input or from recorded audio. Recording&lt;br /&gt;
audio from the speaker during live input allows the audio to be played back in order to correct errors in the&lt;br /&gt;
transcript.&lt;br /&gt;
• Isolating Utterance/Speaker ID and Speaker ID/Utterance pairs from decoded results for later analysis of&lt;br /&gt;
recognition performance of each user. This process also allows plain transcript for each user to be produced&lt;br /&gt;
that is free from labels and indices.&lt;br /&gt;
• A facility whereby a user can improve her/his recognition performance with KALDI through user adaptive&lt;br /&gt;
training i.e. by saving changes to her/his acoustic model after each decoding session.&lt;br /&gt;
The second part is reporting the project outcomes through&lt;br /&gt;
• Documenting the developed graphical user interface design and functionality for KALDI including the&lt;br /&gt;
processes for selecting acoustic and language models, and incorporating online decoding features.&lt;br /&gt;
• Documenting the results of evaluation studies related to the usability of the new GUI design.&lt;br /&gt;
• Presenting the work to interested staff in Intelligence Analytics Branch of DST Group.&lt;/div&gt;</summary>
		<author><name>A1758286</name></author>
		
	</entry>
</feed>