Text Normalization for Croatian Speech Synthesis (CROSBI ID 573255)
Prilog sa skupa u zborniku | izvorni znanstveni rad | međunarodna recenzija
Podaci o odgovornosti
Beliga, Slobodan ; Martinčić-Ipšić, Sanda
engleski
Text Normalization for Croatian Speech Synthesis
This paper presents text normalization which is an integral part of any text-to-speech (TTS) synthesis system. Text normalization is a set of methods with a task to write non-standard words (NSW) in full expanded form. The algorithms which transform NSW into Croatian text: numbers, dates, times, abbreviations, acronyms and the most common symbols into their expanded form are presented. The whole taxonomy for classification of non-standard words in Croatian language together with rule-based normalization methods combined with a lookup dictionary are proposed. The paper concludes with a discussion on the possible integration of proposed text normalization into the existing text-to-speech synthesis system.
text normalization; text-to-speech; speech synthesis system
nije evidentirano
nije evidentirano
nije evidentirano
nije evidentirano
nije evidentirano
nije evidentirano
Podaci o prilogu
382-387.
2011.
objavljeno
Podaci o matičnoj publikaciji
D. Čišić, Ž. Hutinski, M. Baranović, M. Mauher, L. Ordanić
Opatija: Hrvatska udruga za informacijsku i komunikacijsku tehnologiju, elektroniku i mikroelektroniku - MIPRO
978-953-233-064-9