INFORMATICA

Informatica

0868-4952 0868-4952

INFO1039

10.15388/Informatica.2014.29

Article

Building Text Corpus for Unit Selection Synthesis

Kasparaitis

Pijus

pkasparaitis@yahoo.com * Anbinderis

Tomas

tomas@anbinderis.lt Department of Computer Science II, Faculty of Mathematics and Informatics, Vilnius University, Naugarduko 24, LT-03225 Vilnius, Lithuania

*Corresponding author.

01012014

254551562 01022012 01102014

Vilnius University

2014

Abstract

The present paper deals with building the text corpus for unit selection text-to-speech synthesis. During synthesis the target and concatenation costs are calculated and these costs are usually based on the prosodic and acoustic features of sounds. If the cost calculation is moved to the phonological level, it is possible to simulate unit selection synthesis without any real recordings; in this case text transcriptions are sufficient. We propose to use the cost calculated during the test data synthesis simulation to evaluate the text corpus quality. The greedy algorithm that maximizes coverage of certain phonetic units will be used to build the corpus. In this work the corpora optimized to cover phonetic units of different size and weight are evaluated.

Keywords text-to-speech synthesis unit selection greedy algorithm