Projects:2020s2-7410 Speech Enhancement for Automatic Speech Recognition

From Projects
Revision as of 14:49, 20 September 2020 by A1789638 (talk | contribs)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to: navigation, search


An increasing number of applications require the joint use of signal processing and AI techniques on time series and sensor data. These techniques can be used for the reduction of noises such as air conditioning, computer fan, or environmentally generated noises such as in a street, an airport, in a metro station, or in an airplane cockpit. Developing AI models for signal obtained from a variety of situations as exemplified above is not trivial, but these have been attempted using Recurrent and Convolutional Networks such as Speech Enhanced Generative Adversarial Neural networks (SEGAN).

Introduction

Project Aims

  • Use HARK with PyKALDI on the High-Performance Computer.
  • Develop an algorithm using HARK for noise processing on HPC.
  • Evaluate the performance of HARK relative to a number of noise types.
  • Perform speaker identification using PyKALDI on HPC.

Project Team

Students

  • Muhammad Haniff Derani
  • Shuyang Shen

Supervisors

  • Dr. Said Al-Sarawi (Shool of EEE, University of Adelaide)
  • Dr. Ahmad Hashemi-Sakhtsari (DST Group)

Abstract

Method

Results

Conclusion

Referrences