语调短语预测中的特征模板自动生成

Automatic feature template generation for intonational phrase prediction

下载PDF

导出

摘要在语音合成系统中,语调短语的自动预测是影响合成语音的自然度和可懂度的关键因素之一。采用了最大熵(Maximum Entropy,ME)模型从无限制的文本中预测语调短语,并且提出了一个自动生成特征模板的层次聚类算法,从而减少了最大熵模型训练过程中的人工参与。实验结果表明,对于语调短语预测而言,最大熵模型明显优于分类与回归树(Classification And Regression Trees,CART)。相比手工总结的特征模板,自动生成的特征模板不仅将语调短语预测的F-score提高了3.18,而且将最大熵模型的大小缩小了78.38。 In Text-To-Speech（TTS） systems,intonational phrase prediction is important for both the naturalness and intelligibility of synthetic speech.This paper presents a Maximum Entropy（ME） model to predict intonational phrases from unrestricted text.Furthermore,a hierarchical clustering algorithm is proposed for automatic generation of feature templates,which minimizes the need for human supervision during ME model training.Results of comparative experiments show that,for the task of intonational phrase prediction,ME model obviously outperforms Classification And Regression Tree（CART）.Compared with manual templates,templates automatically generated by the proposed approach not only make an improvement of 3.18 on the F-score of ME based intonational phrase prediction,but also reduce the size of ME model by up to 78.38.

作者刘方舟陶建华

机构地区湖南师范大学数学与计算机科学学院中国科学院自动化研究所模式识别国家重点实验室

出处《计算机工程与应用》 CSCD 北大核心 2011年第16期19-21,34,共4页 Computer Engineering and Applications

基金国家自然科学基金No.61011140075 湖南省科技计划(No.2010FJ4131) 湖南省教育厅科研项目(No.10C0955)~~

关键词语调短语特征模板最大熵(ME) 分类与回归树(CART) intonational phrase feature template Maximum Entropy（ME） Classification And Regression Tree（CART）

分类号 TP391 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献8

1Bolinger D.Intonation and its uses:melody in grammar and discourse[M].London:Edward Arnold,1989. 被引量：1
2Taylor P,Black A.Assigning phrase breaks from pert-of-speech sequences[J].Computer Speech and Language,1998,12(2):99-117. 被引量：1
3Wightman C,Ostendorf M.Automatic labeling of prosodic patterns[J].IEEE Trans on Speech and Audio Processing,1994,2(4):469-481. 被引量：1
4李剑锋,胡国平,王仁华.基于最大熵模型的韵律短语边界预测[J].中文信息学报,2004,18(5):56-63. 被引量：20
5赵晟,陶建华,蔡莲红.基于规则学习的韵律结构预测[J].中文信息学报,2002,16(5):30-37. 被引量：25
6Berger A,Pietra S,Pietra V.A maximum entropy approach to natural language processing[J].Computational Linguistics,1996,22(1):39-71. 被引量：1
7Li Jianfeng,Hu Guoping,Wang Renhua,et al.Sliding window smoothing for maximum entropy based intonational phrase prediction in Chinese[C]//Proc 30th International Conference on Acoustics,Speech,and Signal Processing,Philadelphia,2005:285-258. 被引量：1
8Ratnaparkhi A.Maximum entropy models for natural language ambiguity resolution[D].Philadelphia:University of Pennsylvania,1998. 被引量：1

二级参考文献13

1M. Chu, Y. Qian, Locating Boundaries for Prosodic Constituents in Unrestricted Mandarin Texts[J]. Computational Linguistics and Chinese Language Processing, February 2001,6(1) :61 - 82. 被引量：2
2Bachenko J, Fitzpatrick E. A computational grammar of discourse-neutral prosodic phrasing in English[J]. Computational Linguistics, 1990, 16(3): 155-170. 被引量：2
3J. Hirschberg, P. Prieto. Training intonational phrasing rules automnatically for English and Spanish text-to-speech[J]. Speech Communication, 1996. 被引量：2
4G. J. Busser, W. Daelemans, Van den Bosch, A. Predicting phrase breaks with memory-based learning[A]. Proceedings 4th ISCA Tutorial and Research Workshop on Speech Synthesis[ C], Perthshire Scotland, August 29th - September 1st, 2001. 被引量：2
5Adam L. Berger, Stephen A. Della Pietra, Vincent J. Della Pietra. A maximum entropy approach to natural language processing[J]. Computational Linguistics 1996, 23(4): 597-618. 被引量：2
6Adwait Ratnaparkhi. A Maximum Entropy Part-Of-Speech Tagger[ A]. Proceedings of the Empirical Methods in Natural Language Processing Conference[C], May 17- 18, 1996. 被引量：2
7Hanna Wallach. Efficient training of conditional random fields[D]. Master's thesis, University of Edinburgh, 2002. 被引量：2
8Adwait Ratnaparkhi. (1998). Maximum Entropy Models for Natural Language Ambiguity Resolution[ D ]. Ph. D.Dissertation. University of Pennsylvania, 1998. 被引量：1
9应宏,蔡莲红.基于结构助词驱动的韵律短语界定的研究[J].中文信息学报,1999,13(6):41-46. 被引量：18
10牛正雨,柴佩琪.基于边界点词性特征统计的韵律短语切分[J].中文信息学报,2001,15(5):19-25. 被引量：13

共引文献32

1夏耕.声调作为二语习得中的韵律意识和声学意识[J].语文学刊（外语教育与教学）,2013(7):137-140.
2李剑锋,胡国平,王仁华.基于最大熵模型的韵律短语边界预测[J].中文信息学报,2004,18(5):56-63. 被引量：20
3冯丽萍,焦莉娟.基于最大熵的中文组织机构名识别模型[J].计算机与数字工程,2010,38(12):36-40. 被引量：2
4郑敏,蔡莲红.基于概率频度的普通话韵律结构预测统计模型[J].清华大学学报（自然科学版）,2006,46(1):78-81. 被引量：3
5荀恩东,钱揖丽,郭庆,宋柔.应用二叉树剪枝识别韵律短语边界[J].中文信息学报,2006,20(3):1-5. 被引量：4
6钱揖丽,荀恩东,宋柔.基于SLM的二叉树在语音停顿预测中的应用[J].计算机工程,2006,32(19):23-25. 被引量：2
7张鹏,王琳,刘胜.基于韵律匹配代价和韵律拼接代价的汉语语音合成[J].哈尔滨工业大学学报,2006,38(11):2006-2008. 被引量：1
8董宏辉,陶建华,徐波.基于约束模型的韵律短语预测[J].中文信息学报,2007,21(1):54-59. 被引量：6
9陈龙,杨鸿武,蔡莲红.基于TBL算法的汉语韵律词预测[J].西北师范大学学报（自然科学版）,2008,44(1):47-51. 被引量：6
10蔡润身,朱耀庭.基于韵律规则的基频曲线预测模型[J].计算机工程与设计,2008,29(1):105-108.

1Samaneh Sorournejad,Amtrhassan Monadjemi,Zahra Zojaji.A model for adoption of mobile banking services using classification and regression trees[J].Journal of US-China Public Administration,2010,7(11):66-73. 被引量：2
2杨智宇,吴志红,赵启军,张艺衡.基于差分特征的多视角人脸检测[J].太赫兹科学与电子信息学报,2015,13(2):272-278.
3吴晓,种玉珍,倪红波,王海鹏.一种CBR与RBR相结合的智能家庭推理系统[J].计算机应用研究,2009,26(3):977-979. 被引量：3
4王立柱,赵大宇.用分类与回归树算法进行人才识别[J].沈阳师范大学学报（自然科学版）,2005,23(1):44-47. 被引量：4
5刘丹丹,陈启军,森一之,木田幸夫.基于数据的建筑能耗分析与建模[J].同济大学学报（自然科学版）,2010,38(12):1841-1845. 被引量：11
6刘凤霞,陈国栋,黄洪海,翟朝亮,余轮.面向富互联网应用的数据传输策略研究[J].计算机工程与设计,2012,33(7):2837-2841. 被引量：1
7李倩,李彩丽,芮菡艺,郑大鹏,都金康.基于遥感影像不同亮度的CART方法估算不透水率研究[J].水电能源科学,2010,28(12):45-48. 被引量：2
8孔颖,裘彬强,徐从富.基于CART算法的垃圾邮件过滤模型设计与实现[J].计算机应用,2009,29(2):374-376. 被引量：4
9邢欣来,何中市,王健.组合分类器辅助诊断肺栓塞的研究[J].计算机工程与应用,2009,45(12):214-217. 被引量：2
10吴晓如,王仁华,刘庆峰.基于韵律特征和语法信息的韵律边界检测模型[J].中文信息学报,2003,17(5):48-54. 被引量：7

计算机工程与应用

2011年第16期

浏览历史

内容加载中请稍等...

语调短语预测中的特征模板自动生成

参考文献8

二级参考文献13

共引文献32

相关作者

相关机构

相关主题

浏览历史