Research Areas and Projects

Artificial Neural Network as a powerful modelling tool
Barriers to depth: The recalcitrance of convergence & the computational complexity

References

Y. Bengio, "Learning Deep Architectures for AI," Foundations and Trends in Machine Learning, 2(1): 1-127, 2009.
A. Fischer & C. Igel, "An Introduction to Restricted Boltzmann Machines," 2012.
H. Zen, "Deep Learning in Speech Synthesis," Google, 2013.
Z. Ling et al., "Deep Learning for Acoustic Modeling in Parametric Speech Generation: A systematic review of existing techniques and future trends," IEEE Signal Processing Magazine, 35-52, May 2015.
A. Oord et al.,WaveNet: A generative model for raw audio, Google 2016
A. Tamamori et al., “Speaker-dependent WaveNet vocoder, ” Interspeech 2017
S. Kim et al., “FloWaveNet: A Generative Flow for Raw Audio,” PMLR 2019
R. Prenger et al., “Waveglow: A Flow-based Generative Network for Speech Synthesis,” ICASSP 2019
J. Shen et al., “Natural TTS Synthesis By Conditioning WaveNet on Mel Spectrogram Predictions,” ICASSP 2018
N. Li et al., “Neural Speech Synthesis with Transformer Network,” AAAI 2019
Y. Ren et al., "FastSpeech 2: Fast and High-Quality End-to-End Text to Speech," ICLR 2021
X. Tan et al., "A Survey on Neural Speech Synthesis," Microsoft Asia 2021

Links