Hi!
The programming language of Amazon Polly (TTS) is Python. However, I'm not very familiar with it yet.
I have a list with words and want Amazon Polly to pronounce them and save every word as a new mp3 file. (name could be the word's name)
My list is like:
"
apple
banana
peach
orange
...
"
On the following website there is a code provided by Amazon which result is more mp3 file so seems promising: (I inserted it at the end of my post)
https://aws.amazon.com/blogs/machine-lea...on-script/
My question is, how can I modify that code, so I can use it to generate the pronouncation for each word as a new mp3 file?
Thank you for your help! It means to me a lot!
Have a very nice day!
____
The programming language of Amazon Polly (TTS) is Python. However, I'm not very familiar with it yet.
I have a list with words and want Amazon Polly to pronounce them and save every word as a new mp3 file. (name could be the word's name)
My list is like:
"
apple
banana
peach
orange
...
"
On the following website there is a code provided by Amazon which result is more mp3 file so seems promising: (I inserted it at the end of my post)
https://aws.amazon.com/blogs/machine-lea...on-script/
My question is, how can I modify that code, so I can use it to generate the pronouncation for each word as a new mp3 file?
Thank you for your help! It means to me a lot!
Have a very nice day!
____
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 |
# coding: utf-8 import subprocess import codecs f = codecs. open ( "story.txt" , encoding = 'utf-8' ) cnt = 0 file_names = '' for line in f: rendered = '' line = line.replace( '"' , '\\"' ) command = 'aws polly synthesize-speech --text-type ssml --output-format "mp3" --voice-id "Salli" --text "{0}" {1}' if '\r\n' = = line: #A pause after a paragraph rendered = '<speak><break time= "2s"/></speak>' else : #A pause after a sentence rendered = '<speak><amazon:effect name=\\"drc\\">' + line.strip() + '<break time=\\"1s\\"/></amazon:effect></speak>' file_name = ' polly_out{0}.mp3' . format (u' '.join(str(cnt)).encode(' utf - 8 ')) cnt + = 1 command = command. format (rendered.encode( 'utf-8' ), file_name) file_names + = file_name print command subprocess.call(command, shell = True ) print file_names execute_command = 'cat ' + file_names + '>result.mp3' subprocess.call(execute_command, shell = True ) execute_command = 'rm ' + file_names print 'Removing temporary files: ' + execute_command subprocess.call(execute_command, shell = True ) |