Keynote Speech II | Speech Processing at Cocktail Party |
Date/Time | 10 Nov 2020 / 10:00 – 11:00 hrs |
Speaker | Professor Haizhou Li Department of Electrical and Computer Engineering, National University of Singapore, Singapore Bremen Excellence Chair Professor, University of Bremen, Germany |
Chiar | Prof. Wenjian Cai |
Biography
Haizhou Li is a Professor at the Department of Electrical and Computer Engineering, National
University of Singapore, and a Bremen Excellence Chair Professor at the University of Bremen, Germany. His research interests include
speech information processing, natural language processing, and neuromorphic computing. Professor Li has served as the Editor-in-Chief of
IEEE/ACM Transactions on Audio, Speech and Language Processing (2015-2018), the President of the International Speech Communication
Association (ISCA, 2015-2017), and the President of Asia Pacific Signal and Information Processing Association (APSIPA, 2015-2016). He is
a Fellow of the IEEE and the ISCA.
Abstract
Humans have a remarkable ability to pay their auditory attention only to a sound source of interest, that we call selective listening, in a multi-talker environment or a Cocktail Party. However, signal processing approach to speech separation and/or speaker extraction from multi-talker speech remains a challenge for machines. In this talk, we study the deep learning solutions to monaural speech separation and speaker extraction that enable selective listening, speech recognition, speaker recognition at Cocktail Party. We discuss the computational auditory models, technical challenges and the recent advances in the field.