期刊文献+

一种结合支持向量机训练的锚模型语种识别方法 被引量:1

Research on Language Identification Method Based on Anchor Model Combined With Support Vector Machine
下载PDF
导出
摘要 在针对电话语音的语种识别系统中,训练语音和测试语音之间存在不同说话人的个性差异带来的干扰,是影响系统识别性能的一个重要因素.基于此,本文首先对当前语种识别系统中消除此影响的方法进行研究,对比分析它们各自的优缺点,选择将锚模型方法引入语种识别系统中,该方法将语料映射至说话人无关的锚超矩阵进而消除说话人相关信息.针对锚超矩阵的选择存在语种混淆和信息冗余等问题,本文并提出一种结合支持向量机的锚模型训练算法,该方法下得到的锚超矩阵更具语种区分性,并去除了混淆信息的影响,增强了矩阵的紧致性.实验结果表明,新方法下的锚模型映射方法能有效提高基线系统的识别性能,并降低了语种识别系统训练和识别时的计算量. In language identification on the telephone conversation speech, there is a disturbance in the recognition rate caused by speaker variability between test and train utterances, which is a key factor in increasing the system's performance. To tackle this problem, firstly we study the different ways to eliminate this influence, contrast and analyze the advantages and disadvantages of them, then introduce the anchor model into the language identification system, in which the influence is eliminated by a projection of utterance into the speaker independent anchor super matrix. Then in order to solve the problems of languages mixture and information waste, a training algorithm of anchor model based on support vector machine is proposed, by which a language discriminative and information compact anchor super matrix is constructed, then we choose the projection again to achieve the goal of avoiding the impartibility sample in high dimensionality. The results indicate that new language discriminative anchor model trained by support vector machine is better for improving the performance and reducing the system's calculation of training and testing than the original baseline model.
出处 《小型微型计算机系统》 CSCD 北大核心 2013年第4期837-842,共6页 Journal of Chinese Computer Systems
基金 国家"八六三"高技术研究发展计划重点项目(2008AA011002)资助
关键词 语种识别 锚模型 支持向量机 区分性 language identification anchor model support vector machine discriminative
  • 相关文献

参考文献16

  • 1Torres-Carrasquillo P A, Singer E, Kohler M A, et al. Approaches to language identification using Gaussian mixture models and shif- ted delta cepstral features[ C]. In Proc. ICSLP,2002:89-92. 被引量:1
  • 2Reynolds D A, Quatieri T F, Dunn R B. Speaker verification using adapted Gaussian mixture models [ J ]. Digital Signal Processing, 2000,10( 1-3 ) : 19-41. 被引量:1
  • 3Deng Nai-yang, Tian Ying-jie. Support vector machine: theory arith- metic and expansion [ M ]. Beijing: Science Press,2009. 被引量:1
  • 4Campbell W M, Sturim D E, Reynolds D A, et al. SVM basedspeaker verification using a GMM supervector kernel and NAP vari- ability compensation[ C]. In Proc. ICASSP, 2006:97-100. 被引量:1
  • 5Elad N, Aronowitz H. Efficient language identification using anchor model and support vector machines [ C ]. Proc of the Speaker and Language Recognition Workshop, IEEE Press, 2006 : 145. 被引量:1
  • 6Elad N, Aronowitz H. Efficient language identification using anchor model and support vector machines [ C ]. Proc of the Speaker and Language Recognition Workshop, IEEE Press, 2006 : 145. 被引量:1
  • 7Lin C. LIBSVM: a library for support vector machines [M]. ht- tp ://www. csic. ntu. tw/cjlin/libsvrn/index, htm1,2010. 被引量:1
  • 8Burger L ,Matejka P,Cemocky J. Discriminative training techniques for acoustic language identification[ C]. Proc of the IEEE Interna- tional Conference on Acoustics, Speech and Signal Processing, Tou- louse, France, 2006:209 -212. 被引量:1
  • 9Aronowitz H, Burshtein D, Amir A. Speaker indexing in audio ar- chives using test utterance Ganssian mixture modeling [ C ]. In Proc. ICSLP, 2004. 被引量:1
  • 10Castaldo F, Colibro D,Dalmasso E. Acoustic language identification using fast discriminative training [ C ]. Inter Speech' 07, 2007,8 ( 2 ) : 346 -349. 被引量:1

二级参考文献13

  • 1Torres-Carrasquillo P A,Singer E,Kchler M A,et.al.Approaches to Language Identification Using Gaussian Mixture Models and Shifted Delta Cepstral Features// Proc of the International Conference on Spoken Language Processing.Denver,USA,2002:89 -92. 被引量:1
  • 2Pelecanos J,Sridharan S.Feature Warping for Robust Speaker Verification// Proc of the Speaker and Language Recognition Workshop.Crete,Greece,2001:213 -218. 被引量:1
  • 3Burget L,Matejka P,Cernocky J.Discriminative Training Techniques for Acoustic Language Identification//Proc of the IEEE International Conference on Acoustics,Speech and Signal Processing.Toulouse,France,2006:209-212. 被引量:1
  • 4Qu Dan,Wen Bingxi.Discriminative Training of GMM for Language Identification//Proc of the ICSA and IEEE Workshop on Spontaneous Speech Processing and Recognition.Tokyo,Japan,2003:108 -110. 被引量:1
  • 5Campbell W M,Sturim D E,Reynolds D A.SVM Based Speaker Verification Using a GMM Supervector Kernel and NAP Variability //Proc of the IEEE International Conference on Acoustics,Speech and Signal Processing.Toulouse,France,2006:97-100. 被引量:1
  • 6Smith N,Gales M.Data-Dependent Kernels in SVM Classification of Speech Patterns// Proc of the 6th International Conference on Spoken Language Processing.Beijing,China,2000,Ⅰ:297 -300. 被引量:1
  • 7Campbell M,Campbell J P,Reynolds D A,et al.Support Vector Machines for Speaker and Language Recognition.Computer Speech and Language,2006,20(2/3):210-229. 被引量:1
  • 8Bishop C M.Pattern Recognition and Machine Learning.New York,USA:Springer,2006. 被引量:1
  • 9Chang C C,Lin C J.LIBSVM:A Library for Support Vector Machines[DB/OL].[2008-10-20].http://www.csie.ntu.edu.tw/_cjlin/libsvm. 被引量:1
  • 10Hatch A O,Stolcke A.Generalized Linear Kernels for One Versus All Classification:Application to Speaker Recognition//Proc of the IEEE International Conference on Acoustics,Speech and Signal Processing.Toulouse,France,2006,Ⅴ:585-588. 被引量:1

共引文献3

同被引文献5

引证文献1

二级引证文献2

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部