期刊文献+

基于协方差描述子和黎曼流形的语音情感识别 被引量:5

Speech Emotion Recognition Based on Covariance Descriptor and Riemannian Manifold
原文传递
导出
摘要 提出一种基于协方差描述子和黎曼流形的语音情感识别方法.根据提取的语音声学特征,计算协方差矩阵用于表征语句的情感信息.考虑到非奇异协方差矩阵所构成空间的高维特性,引入一种仿射不变度量使得该空间满足黎曼流形的要求.进而根据微分几何,建立基于黎曼流形的算法架构.实验证明,该方法在语音情感识别中获得较好的识别效果,尤其在噪声环境下能更有效地提高识别准确率. An algorithm for speech emotion recognition is proposed based on covariance descriptor and Riemannian manifold. According to the extracted acoustic features, covariance matrices are computed as the emotion descriptors of sentences. With the consideration of high dimensional characteristic of the space constructed by non-singular covariance matrices, an affine invariance metric is adopted to make the space meet the requirement of Riemannian manifold. With differential geometry, the speech emotion recognition is performed on the manifold. The experimental results show a significant improvement in recognition accuracy, especially under noisy environments.
出处 《模式识别与人工智能》 EI CSCD 北大核心 2009年第5期673-677,共5页 Pattern Recognition and Artificial Intelligence
基金 国家自然科学基金项目(No.60873124) 国家科技支撑计划项目(No.2008BAH26B02)资助
关键词 语音情感识别 协方差描述子 黎曼流形 噪声环境 支持向量机(SVM) Speech Emotion Recognition, Covariance Descriptor, Riemannian Manifold, Noisy Environment, Support Vector Machine (SVM)
  • 相关文献

参考文献10

  • 1林奕琳,韦岗,杨康才.语音情感识别的研究进展[J].电路与系统学报,2007,12(1):90-98. 被引量:33
  • 2王志良,陈锋军,薛为民.人脸表情识别方法综述[J].计算机应用与软件,2003,20(12):63-66. 被引量:39
  • 3刘丹,张乃尧,朱汉城.基于曲式和情感识别的音乐动画CAD系统[J].模式识别与人工智能,2003,16(3):283-287. 被引量:2
  • 4赵力,将春辉,邹采荣,吴镇扬.语音信号中的情感特征分析和识别的研究[J].电子学报,2004,32(4):606-609. 被引量:49
  • 5Pao T L, Chen Y T, Yeh J H. Emotion Recognition from Mandarin Speech Signals// Proc of the International Symposium on Chinese Spoken Language Processing. Hongkong, China, 2004 : 301 - 304. 被引量:1
  • 6Nicholson J, Takahashi K, Nakatsu R. Emotion Recognition in Speech Using Neural Networks. Neural Computing and Applications, 2000, 2(2) : 495 -501. 被引量:1
  • 7Yu Feng, Chang E, Xu Yingqing, et al. Emotion Detection from Speech to Enrich Multimedia Content// Proc of the IEEE Pacific Rim Conference on Multimedia. Beijing, China, 2001:550 -557. 被引量:1
  • 8Tuzel O, Porikli F, Meer P. Region Covariance : A Fast Descriptor for Detection and Classification// Proc of the 9th European Conference on Computer Vision. Graz, Austria, 2006, Ⅱ : 589 -600. 被引量:1
  • 9Fletcher P T, Joshi S. Riemannian Geometry for the Statistical Analysis of Diffusion Tensor Data. Signal Processing, 2007, 87 ( 2 ) : 250 - 262. 被引量:1
  • 10Pennec X, Fillard P, Ayache N. A Riemannian Framework for Tensor Computing. International Journal of Computer Vision, 2006, 66(1) : 41 -66. 被引量:1

二级参考文献68

  • 1陶建华,谭铁牛.数字化人类情感——和谐人机交互环境中的情感计算[J].微电脑世界,2004(1):29-32. 被引量:12
  • 2詹永照,曹鹏.语音情感特征提取和识别的研究与实现[J].江苏大学学报(自然科学版),2005,26(1):72-75. 被引量:16
  • 3周迪伟 高东杰(译).计算机语音处理[M].国防工业出版社,1987.. 被引量:3
  • 4斯托曼(张燕云译).情绪心理学[M].辽宁出版社,.. 被引量:1
  • 5刘丹 张乃尧.模糊数学在音乐特征描述中的应用[A]..见:2001 年中国智能自动化会议论文集[C].昆明,2000,I.304—309. 被引量:1
  • 6伊·杜波夫斯基 等 著陈敏 译.和声学教程(增订重译本)[M].北京:人民音乐出版社,1998.. 被引量:1
  • 7Radocy R E, Boyle J D. Psychological Foundations of Musical Behavior. Illinois: Charles C Thomas Publisher, 1988. 被引量:1
  • 8Liu D, Zhang N Y, Zhu H C. A Fuzzy Expert System to Recognize Sentiment of Music and Its Application in Music Fountains.In: Proc of the Imemational Conference on Control and Automation, Xiamen, China, 2002, 1854- 1857. 被引量:1
  • 9Roth R. Music and Animation Tool Kit: Modules for Computer Multimedia Composition.Computer Applications,1996, 32( 1 ):137- 144. 被引量:1
  • 10Fels S, Mase K. Iamascope: A Graphical Musical Instrument.Computers & Graphics, 1999, 23:277-286. 被引量:1

共引文献115

同被引文献123

引证文献5

二级引证文献44

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部