With the success of voice assistants such as Alexa/Siri/Google, deep learning has brought sound and speech processing into the mainstream. Topics include speech recognition, denoising, classification, audio tagging and audio separation (speech & music)…
Theoretical courses mixed with examples and case studies. This training course aims to present the main problems encountered
LSTM, U-Net, CNN, Fourier, Wiener filter, ngram, language model, acoustic model, state-space model, Kaldi, PyTorch, deep clustering, TASnet, tacotron, wavenet