Amazon Polly is a service that turns text into lifelike speech, allowing you to create applications that talk, and build entirely new categories of speech-enabled products. Amazon Polly is a Text-to-Speech service that uses advanced deep learning technologies to synthesize speech that sounds like a human voice.
With dozens of lifelike voices across a variety of languages, you can select the ideal voice and build speech-enabled applications that work in many different countries.
|English||Joanna||Matthew||Hello. Do you speak a foreign language? One language is never enough.|
|Brazilian Portuguese||Vitória||Ricardo||Oi. Você fala algum idioma estrangeiro? Somente um idioma nunca é bastante.|
|Danish||Naja||Mads||Hej. Taler du et fremmed sprog? Et sprog er aldrig nok.|
|French||Léa||Mathieu||Bonjour. Parlez-vous une autre langue que le français? Une langue n'est jamais assez.|
|Korean||Seoyeon||안녕하세요? 외국어를 구사하십니까? 이 세상에는 수많은 언어들이 있답니다.|
|Spanish||Penélope||Miguel||Hola. ¿Hablas algún idioma extranjero? Un solo idioma no es suficiente.|
Natural Sounding Voices
Amazon Polly provides dozens of languages and a wide selection of natural-sounding male and female voices. Amazon Polly's fluid pronunciation of text enables you to deliver high-quality voice output for a global audience.
Store & redistribute speech
Amazon Polly allows for unlimited replays of generated speech without any additional fees. You can create speech files in standard formats like MP3 and OGG, and serve them from the cloud or locally with apps or devices for offline playback.
Delivering lifelike voices and conversational user experiences requires consistently fast response times. When you send text to Amazon Polly’s API, it returns the audio to your application as a stream so you can play the voices immediately.
Customize & Control Speech Output
Modify Amazon Polly voices to best suit your needs – Amazon Polly supports lexicons and SSML tags which enable you to control aspects of speech, such as pronunciation, volume, pitch, speed rate, etc.
Amazon Polly’s pay-as-you-go pricing, low cost per character converted, and unlimited replays make it a cost-effective way to voice your applications.
Audio can be used as a complementary media to written and/or visual communication. By voicing your content, you can provide your audience with an alternative way to consume information and meet the needs of a larger pool of readers. Amazon Polly can generate speech in dozens of languages, making it easy to add speech to applications with a global audience, such as RSS feeds, websites, or videos.
Amazon Polly enables developers to provide their applications with an enhanced visual experience such as speech-synchronized facial animation or karaoke-style word highlighting. Amazon Polly makes it easy to request an additional stream of metadata with information about when particular sentences, words and sounds are being pronounced. Using this metadata stream alongside the synthesized speech audio stream, customers can animate avatars and highlight text as it is currently spoken text in their app.
With Amazon Polly, your contact centers can engage customers with natural sounding voices. You can cache and replay Amazon Polly’s speech output to prompt callers through interactive voice response (IVR) systems, such as Amazon Connect. Additionally, you can leverage Amazon Polly’s API to deliver automated real-time information such as service status, account and billing inquiries, addresses, and contact information.