GettyImages 996731458
Image Credits:tomazl / Getty Images

AWS’ new text-to-speech engine sounds like a newscaster

Thanks to modern machine learning techniques, text-to-speech engines have made massive strides over the last few years. It used to be incredibly easy to know that it was a computer that was reading a text and not a human being. But that’s changing quickly. Amazon’s AWS cloud computing arm today launched a number of new neural text-to-speech models, as well as a new newscaster style that is meant to mimic the way… you guessed it… newscasters sound.

“Speech quality is certainly important, but more can be done to make a synthetic voice sound even more realistic and engaging,” the company notes in today’s announcement. “What about style? For sure, human ears can tell the difference between a newscast, a sportscast, a university class and so on; indeed, most humans adopt the right style of speech for the right context, and this certainly helps in getting their message across.”

The new newscaster style is now available in two U.S. voices (Joanna and Matthew) and Amazon is already working with USA Today and Canada’s The Globe and Mail, among a number of other companies, to help them voice their texts.

Have a listen for yourself:

Topics

, , , , , , , , , , , ,
Loading the next article
Error loading the next article