Module overview
Speech is humanity's most natural interface. Speech is all around us, but have you ever wondered what a voice really is, and what it takes to build one? In this module, you will develop expertise across the full speech AI pipeline: from speech signal fundamentals and data preparation to synthesis architectures that generate remarkably human voices. You will explore the theory and AI architectures that underlie state of the art speech technologies.
Building on your knowledge of machine learning, this module dives deeper into open and unsolved problems in the field, driven by cutting edge research. You will learn why speech processing is challenging and you will use creative problem solving to explore both the limitations and promises of speech AI.
Linked modules
prerequisites: COMP6246 Machine Learning Technologies