Title: An Overview of Prosodic Modelling for Croatian Speech Synthesis
Abstract: In order to include prosody into the text to speech (TTS)systems prosody knowledge needs to be acquired, represented and incorporated. Two main features of prosody important for modelling prosody for TTS systems are duration and F0 contour. There are various approaches to modelling those features and they can be categorized into three main groups: rule based, statistical and minimalistic. Some of the best known approaches to duration acquiring are Klatt’ s model, classification and regression trees (CARTS) and neural networks and to F0 modelling TOBI, Fujisaki and Tilt. A procedure for automatic intonation event detection on Croatian texts based on the Tilt model was evaluated in terms of Root Mean Square Error values for generated F0 contours.
Publication Year: 2013
Publication Date: 2013-01-01
Language: en
Type: article
Access and Citation
AI Researcher Chatbot
Get quick answers to your questions about the article from our AI researcher chatbot