Unit selection synthesis for audiobooks
Name of the audiobook
Author
Read by
Running time
Olive (Voice 1)
Dinah Maria Mulock CRAIK
Arielle Liipshaw
14:03:13
Table 1: Details of the audiobook downloaded fromn Librivox for building unit selection voice.
Name of the course
Instructor
Running time
Introduction to Public Speaking (Voice 2)
Dr. Matt McGarrity
~12 hrs
Table 2: Details of the course downloaded from Coursera for building unit selection voice.
Percentage units used
Voice 1
Voice 2
ASR trained on OLIVE data
ASR trained on Librispeech
ASR trained on Lecture data
ASR trained on Librispeech
100
70
50
30
Table 3: Synthesized versions of semantically unrelated sentence (SUS) taken from Blizzard 2013 test corpus
Percentage units used
Voice 1
Voice 2
ASR trained on OLIVE data
ASR trained on Librispeech
ASR trained on Lecture data
ASR trained on Librispeech
100
70
50
30
Table 4: Synthesized versions of "news" sentence taken from Blizzard 2013 test corpus