phy696 Advanced Topics Speech and Audio Processing (Complete module description)

phy696 Advanced Topics Speech and Audio Processing (Complete module description)

Deutsch English PDF Download
Module label Advanced Topics Speech and Audio Processing
Modulkürzel phy696
Credit points 6.0 KP
Workload 180 h
(

Attendance: 56 hrs, Self study: 124 hrs

)
Verwendbarkeit des Moduls
  • Master Engineering Physics > Schwerpunkt: Acoustics
Zuständige Personen
  • Doclo, Simon (module responsibility)
  • Doclo, Simon (Prüfungsberechtigt)
  • Enzner, Gerald (Prüfungsberechtigt)
  • Meyer, Bernd (Prüfungsberechtigt)
Prerequisites

Basic principles of discrete-time signal processing (preferably completed the course Digital Signal Processing). In addition, Matlab programming skills are required.

Skills to be acquired in this module

The students will acquire in-depth knowledge in the field of speech and audio processing. The practical component of the course provides insight into the key properties of the covered methods through a self-study approach, while the implementation of algorithms on a computer supports the application and transfer of theoretical concepts to practical problems.

Module contents

After reviewing the basic principles of speech processing and statistical signal processing, including adaptive filtering and estimation theory, this course covers techniques and underlying algorithms that are essential for many modern speech communication and audio processing systems, such as mobile phones, smart speakers, hearing aids, and headphones. These include acoustic echo and feedback cancellation, noise reduction, dereverberation, microphone and loudspeaker array processing, and active noise control. The course addresses both algorithms based on statistical signal processing and approaches based on deep learning. During the exercises, a typical hands-free speech communication or audio processing system is implemented in Matlab or Python.

Literaturempfehlungen
  • J. Benesty, M. M. Sondhi, Y. Huang: Handbook of Speech Processing, Springer, 2008;
  • P. Vary, R. Martin: Digital Speech Transmission, Wiley, 2006;
  • P. Loizou: Speech Enhancement: Theory and Practice, CRC Press, 2017;
  • S. Haykin: Adaptive Filter Theory, Prentice Hall, 2013,
  • E. Vincent, T. Virtanen, S. Gannot: Audio source separation and speech Enhancement, Wiley, 2018.
Links
Language of instruction English
Duration (semesters) 1 Semester
Module frequency jährlich
Module capacity unrestricted
Modulart Wahlpflicht / Elective
Modullevel MM (Mastermodul / Master module)
Lehr-/Lernform Lecture: 2hrs/week, Exercise: 2hrs/week
Vorkenntnisse Basic principles of discrete-time signal processing (preferably completed the course Digital Signal Processing). In addition, Matlab programming skills are required
Examination Prüfungszeiten Type of examination
Final exam of module

oral exam (30 minutes) or homework or practical report

Form of instruction Lecture
SWS 4
Frequency SoSe oder WiSe
Workload Präsenzzeit 56 h