text to speech has evolved using deep learning, it is now possible to produce very aural-sounding speech that includes changes to pitch, rate, pronunciation, and inflection. Today, computer-generated speech is used in a variety of use cases and is turning into a ubiquitous element of user interfaces.