The Power of Special Characters in Prosodic Word Prediction for Chinese TTS

Page view(s)

Checked on Aug 20, 2025

Please use this identifier to cite or link to this item: https://oar.a-star.edu.sg/communities-collections/articles/12072

Title:

The Power of Special Characters in Prosodic Word Prediction for Chinese TTS

Journal Title:

Chinese Spoken Language Processing (ISCSLP), 2014 9th International Symposium on

DOI:

10.1109/ISCSLP.2014.6936693

Publication URL:

http://ieeexplore.ieee.org/xpl/articleDetails.jsp?arnumber=6936693

Authors:

Zhengchen Zhang, Minghui Dong

Keywords:

Publication Date:

14 September 2014

Citation:

Zhengchen Zhang; Minghui Dong, "The power of special characters in prosodicword prediction for Chinese TTS," Chinese Spoken Language Processing (ISCSLP), 2014 9th International Symposium on , vol., no., pp.280,283, 12-14 Sept. 2014 doi: 10.1109/ISCSLP.2014.6936693

Abstract:

Prosodic word (PW) prediction in Chinese Text-To-Speech (TTS) can be formulated as a classification problem that one predicts the tag of every character boundary in a sentence is the PW boundary or not. In this paper, a set of new features called special characters are introduced and put into classifiers to address the PW prediction problem. Some characters often appear at the beginning or at the end of a PW, which make them a strong clue of a PW boundary. Besides, quite a lot of PWs have only one character, which makes such characters special. We select a set of special single characters, special starting characters, and special ending characters to help predict PW boundaries. Some special lexical words are often taken as PWs, and we collect a list of such words for PW boundary prediction. Decision tree, Supporting Vector Machine (SVM), MultiLayer Perceptron, and Random Forests are employed as the classifiers. Other features like part-of-speech (POS) of characters, word length, etc. are also used for PW prediction. In our experiments, we got 90.5% and 91.3% accuracies on two corpora containing 8, 000 and 1, 349 sentences respectively, which proved the efficiency of the method.

License type:

PublisherCopyrights

Funding Info:

Description:

URI:

https://oar.a-star.edu.sg/communities-collections/articles/12072

ISBN:

Collections:

Institute for Infocomm Research

Files uploaded:

Manuscripts in This Item:

File	Size	Format	Action
prosodicword.pdf	80.54 KB	PDF	Open