Carahsoft, in conjunction with its vendor partners, sponsors hundreds of events each year, ranging from webcasts and tradeshows to executive roundtables and technology forums.
Fill out the form below to view this archived event.
Text-to-speech (also known as speech synthesis), the concept of generating spoken audio from written text, is a common use case for deep learning and neural networks. Speech Synthesis is commonly solved by leveraging multiple deep learning algorithms and is tuned very often to improve the intonation and inflections of the synthesized speech.
In this recorded workshop, you will learn how to use NVIDIA’s NeMo toolkit to train an end-to-end TTS system and Weights & Biases to keep track of various experiments and performance metrics. We will walk you through setting up the environment, explain different code blocks and tools as we execute the Jupyter Notebook, and then deploy the model to test its performance on specific blocks of text.
Fill out the form below to view this Resource.