Google Text-to-Speech

Google Text-to-Speech is a screen reader application developed by Google for its Android operating system. It powers applications to read aloud the text on the screen with support for many languages. Text-to-Speech may be used by apps such as Google Play Books for reading books aloud, by Google Translate for reading aloud translations providing useful insight to the pronunciation of words, by Google Talkback and other spoken feedback accessibility-based applications, as well as by third-party apps. Users must install voice data for each language.

Supported languages

Google Text-To-Speech Android application

Bengali, Bengali, Cantonese, Chinese, Chinese, Czech, Danish, Dutch, English, English, English, English, English, Estonian, Filipino, Finnish, French, French, German, Greek, Gujarati, Hindi, Hungarian, Indonesian, Italian, Japanese, Javanese, Kannada, Khmer, Korean, Malayalam, Marathi, Nepali, Norwegian Bomkål, Polish, Portuguese, Portuguese, Romanian, Russian, Sinhala, Slovak, Spanish, Spanish, Sundanese, Swedish, Tamil, Telugu, Thai, Turkish, Ukranian, Urdu, Vietnamese

Google Cloud Text-To-Speech

Arabic, Bengali, Burmese, Czech, Danish, Dutch, English, English, English, English, Filipino, Finnish, French, French, German, Greek, Hindi, Hungarian, Indonesian, Italian, Japanese, Korean, Mandarin Chinese, Norwegian, Polish, Portuguese, Portuguese, Russian, Slovak, Spanish, Swedish, Turkish, Ukrainian and Vietnamese

Evolution

Some app developers have started adapting and tweaking their Android Auto apps to include Text-to-Speech, such as Hyundai in 2015. Apps such as textPlus and WhatsApp use Text-to-Speech to read notifications aloud and provide voice-reply functionality.
Cloud Text-to-Speech is powered by WaveNet, software created by Google's UK-based AI subsidiary DeepMind. Since Google bought DeepMind in 2014, it's been exploring ways to turn the company's AI talent into tangible products. Integrating WaveNet into its cloud service is significant as Google tries to win the cloud business away from Amazon and Microsoft, presenting its AI skills as its differentiating factor.
DeepMind's AI voice synthesis tech is notably advanced and realistic. Most voice synthesizers use concatenative synthesis, in which a program stores individual syllables — sounds such as “ba,” “sht,” and “oo” — and pieces them together to form words and sentences. WaveNet instead uses machine learning to generate speech. It then waveforms from a database of human speech and re-creates them at a rate of 24,000 samples per second. The end result includes voices with subtleties like lip smacks and accents. When Google first unveiled WaveNet in 2016, it was too computationally intensive to work outside of research environments, but it's since been slimmed down significantly, showing a clear pipeline from research to product. Google Cloud Text-to-Speech converts text into human-like speech in more than 180 voices across 30+ languages and variants. It applies groundbreaking research in speech synthesis and Google's powerful neural networks to deliver high-fidelity audio.
Includes exclusive access to WaveNet technology DeepMind has done groundbreaking research in machine learning models to generate speech that mimics human voices and sounds more natural, reducing the gap with human performance by 70%. Cloud Text-to-Speech offers exclusive access to 90+ WaveNet voices and will continue to add more over time.

Version history

November 2013

Korean now supported.
March 2014
Google announced that Arabic would never be supported despite having more than 467 million native speakers.
Version 3.0 added support for natural high-quality voices. High-quality voices were now featured in English as Female whilst English also now featured three new high-quality voices; Male, Female and Male. These new high-quality voices are much larger than the prior versions in terms of file size with 244MB for the English US female voice compared to just 6.8MB for the regular female voice version. These high-quality voices were added to ensure higher quality pronunciation and enunciation with intonations that are more natural.
Support for Brazilian, Portuguese and Spanish brought the total number of languages supported to nine at this point., English, Spanish, Spanish, French, Italian, Korean, and Portuguese. Only English and English German, English UK, English US, Spanish ES, Spanish US, French, Italian, Korean, and Portuguese. Only English US and English UK had high-quality voice packs for now.
User Interface tweaks: Due to having multiple voices for some languages a toggle was added to voices with 2 or more voice packs.
May 2014
Russian, Dutch, Polish and English were added to the currently supported list of languages.
September 2014
Support for Japanese output added.
December 2014
Version 4 available
Support for Hindi and Indonesian output.
Improved output quality. Standard-quality voices now surpass the quality of the high-quality voices from previous releases.
July 2015
Four new languages now supported: Cantonese, Mandarin, Thai and Turkish.
Bug fixes and other improvements.
February 2016
Improved voice quality.
Added support for Bengali, Danish, English, Finnish, Hungarian, Norwegian, and Mandarin and Swedish.
The offline voices can now speak at a faster rate.
Plus lots of bug fixes and performance improvements.
June 2016
Added support for Swedish and Vietnamese.
Bug fixes and improvements.
October 2016
Alternative voice variations now available on every device.
Added support to amplify speech volume over other audio.
Extended support for emoji verbalisation in Chinese, Dutch, Danish, English, French, German, Italian, Japanese, Korean, Polish, Portuguese, Russian and Spanish.
Bug fixes and improvements.
April 2017
Added support for Bengali, Czech, Khmer, Nepali, Sinhala and Ukrainian.
Number processing can now be turned off in settings. This produces a more literal pronunciation of the text. For example, 09/10/2017 will be pronounced as oh nine slash ten... Only available for English voices.
Intonation control is now available for more voices.
Various other improvements to various voices.
October 2017
Added support for Filipino and Greek.
January 2018
Added support for Estonian, Romanian and Slovak.
Various other improvements to our voices.
July 2018
Added support for French, Javanese and Sundanese.
More voices to choose from: English, English and French
All voices for a language are now downloaded together, saving storage space on a device.
Performance improvements for 64-bit devices.
Various other improvements to voices.
August 2019
Added support for English, Gujarati, Kannada, Malayalam, Marathi, Portuguese, Tamil, Telugu and Urdu.
New app icon.and many more feataures

Popular movies

The Hunger Games (film) - 2012 American dystopian action thriller science fiction-adventure film directed by Gary Ross and based on Suzanne Collins’s 2008 novel of the same name. It is the first insta...
untitled Captain Marvel sequel - part of Marvel Cinematic Universe....
Killers of the Flower Moon (film project) - Killers of the Flower Moon - film project in United States of America. It was presented as drama, detective fiction, thriller. The film project starred Leonardo Dicaprio, Robert De Niro. Director of...
Five Nights at Freddy's (film) - Five Nights at Freddy's - film published in 2017 in United States of America. Scenarist of the film - Scott Cawthon....

Popular books

Book of Revelation - The Book of Revelation is the final book of the New Testament, and consequently is also the final book of the Christian Bible. Its title is derived from the first word of the Koine Greek text: apok...
Book of Genesis - account of the creation of the world, the early history of humanity, Israel's ancestors and the origins...
Gospel of Matthew - The Gospel According to Matthew is the first book of the New Testament and one of the three synoptic gospels. It tells how Israel's Messiah, rejected and executed in Israel, pronounces judgement on ...
Michelin Guide - Michelin Guides are a series of guide books published by the French tyre company Michelin for more than a century. The term normally refers to the annually published Michelin Red Guide , the oldest...
Psalms - The Book of Psalms , commonly referred to simply as Psalms , the Psalter or "the Psalms", is the first book of the Ketuvim , the third section of the Hebrew Bible, and thus a book of th...
Ecclesiastes - Ecclesiastes is one of 24 books of the Tanakh , where it is classified as one of the Ketuvim . Originally written c. 450–200 BCE, it is also among the canonical Wisdom literature of the Old Tes...
The 48 Laws of Power - non-fiction book by American author Robert Greene. The book...

Popular television series

The Crown (TV series) - historical drama web television series about the reign of Queen Elizabeth II, created and principally written by Peter Morgan, and produced by Left Bank Pictures and Sony Pictures Tel...
Friends - American sitcom television series, created by David Crane and Marta Kauffman, which aired on NBC from September 22, 1994, to May 6, 2004, lasting ten seasons. With an ensemble cast sta...
Young Sheldon - spin-off prequel to The Big Bang Theory and begins with the character Sheldon...
Modern Family - American television mockumentary family sitcom created by Christopher Lloyd and Steven Levitan for the American Broadcasting Company. It ran for eleven seasons, from September 23...
Loki (TV series) - upcoming American web television miniseries created for Disney+ by Michael Waldron, based on the Marvel Comics character of the same name. It is set in the Marvel Cinematic Universe, shar...
Game of Thrones - American fantasy drama television series created by David Benioff and D. B. Weiss for HBO. It...
Shameless (American TV series) - American comedy-drama television series developed by John Wells which debuted on Showtime on January 9, 2011. It...

Google Text-to-Speech

Supported languages