一种结合支持向量机训练的锚模型语种识别方法被引量：1

Research on Language Identification Method Based on Anchor Model Combined With Support Vector Machine

下载PDF

导出

摘要在针对电话语音的语种识别系统中,训练语音和测试语音之间存在不同说话人的个性差异带来的干扰,是影响系统识别性能的一个重要因素.基于此,本文首先对当前语种识别系统中消除此影响的方法进行研究,对比分析它们各自的优缺点,选择将锚模型方法引入语种识别系统中,该方法将语料映射至说话人无关的锚超矩阵进而消除说话人相关信息.针对锚超矩阵的选择存在语种混淆和信息冗余等问题,本文并提出一种结合支持向量机的锚模型训练算法,该方法下得到的锚超矩阵更具语种区分性,并去除了混淆信息的影响,增强了矩阵的紧致性.实验结果表明,新方法下的锚模型映射方法能有效提高基线系统的识别性能,并降低了语种识别系统训练和识别时的计算量. In language identification on the telephone conversation speech, there is a disturbance in the recognition rate caused by speaker variability between test and train utterances, which is a key factor in increasing the system＇s performance. To tackle this problem, firstly we study the different ways to eliminate this influence, contrast and analyze the advantages and disadvantages of them, then introduce the anchor model into the language identification system, in which the influence is eliminated by a projection of utterance into the speaker independent anchor super matrix. Then in order to solve the problems of languages mixture and information waste, a training algorithm of anchor model based on support vector machine is proposed, by which a language discriminative and information compact anchor super matrix is constructed, then we choose the projection again to achieve the goal of avoiding the impartibility sample in high dimensionality. The results indicate that new language discriminative anchor model trained by support vector machine is better for improving the performance and reducing the system＇s calculation of training and testing than the original baseline model.

作者常振超张兴明杨镇西张丽

机构地区国家数字交换系统工程技术研究中心

出处《小型微型计算机系统》 CSCD 北大核心 2013年第4期837-842,共6页 Journal of Chinese Computer Systems

基金国家"八六三"高技术研究发展计划重点项目(2008AA011002)资助

关键词语种识别锚模型支持向量机区分性 language identification anchor model support vector machine discriminative

分类号 TP391 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献16

1Torres-Carrasquillo P A, Singer E, Kohler M A, et al. Approaches to language identification using Gaussian mixture models and shif- ted delta cepstral features[ C]. In Proc. ICSLP,2002:89-92. 被引量：1
2Reynolds D A, Quatieri T F, Dunn R B. Speaker verification using adapted Gaussian mixture models [ J ]. Digital Signal Processing, 2000,10( 1-3 ) : 19-41. 被引量：1
3Deng Nai-yang, Tian Ying-jie. Support vector machine: theory arith- metic and expansion [ M ]. Beijing: Science Press,2009. 被引量：1
4Campbell W M, Sturim D E, Reynolds D A, et al. SVM basedspeaker verification using a GMM supervector kernel and NAP vari- ability compensation[ C]. In Proc. ICASSP, 2006:97-100. 被引量：1
5Elad N, Aronowitz H. Efficient language identification using anchor model and support vector machines [ C ]. Proc of the Speaker and Language Recognition Workshop, IEEE Press, 2006 : 145. 被引量：1
6Elad N, Aronowitz H. Efficient language identification using anchor model and support vector machines [ C ]. Proc of the Speaker and Language Recognition Workshop, IEEE Press, 2006 : 145. 被引量：1
7Lin C. LIBSVM: a library for support vector machines [M]. ht- tp ://www. csic. ntu. tw/cjlin/libsvrn/index, htm1,2010. 被引量：1
8Burger L ,Matejka P,Cemocky J. Discriminative training techniques for acoustic language identification[ C]. Proc of the IEEE Interna- tional Conference on Acoustics, Speech and Signal Processing, Tou- louse, France, 2006:209 -212. 被引量：1
9Aronowitz H, Burshtein D, Amir A. Speaker indexing in audio ar- chives using test utterance Ganssian mixture modeling [ C ]. In Proc. ICSLP, 2004. 被引量：1
10Castaldo F, Colibro D,Dalmasso E. Acoustic language identification using fast discriminative training [ C ]. Inter Speech' 07, 2007,8 ( 2 ) : 346 -349. 被引量：1

二级参考文献13

1Torres-Carrasquillo P A,Singer E,Kchler M A,et.al.Approaches to Language Identification Using Gaussian Mixture Models and Shifted Delta Cepstral Features// Proc of the International Conference on Spoken Language Processing.Denver,USA,2002:89 -92. 被引量：1
2Pelecanos J,Sridharan S.Feature Warping for Robust Speaker Verification// Proc of the Speaker and Language Recognition Workshop.Crete,Greece,2001:213 -218. 被引量：1
3Burget L,Matejka P,Cernocky J.Discriminative Training Techniques for Acoustic Language Identification//Proc of the IEEE International Conference on Acoustics,Speech and Signal Processing.Toulouse,France,2006:209-212. 被引量：1
4Qu Dan,Wen Bingxi.Discriminative Training of GMM for Language Identification//Proc of the ICSA and IEEE Workshop on Spontaneous Speech Processing and Recognition.Tokyo,Japan,2003:108 -110. 被引量：1
5Campbell W M,Sturim D E,Reynolds D A.SVM Based Speaker Verification Using a GMM Supervector Kernel and NAP Variability //Proc of the IEEE International Conference on Acoustics,Speech and Signal Processing.Toulouse,France,2006:97-100. 被引量：1
6Smith N,Gales M.Data-Dependent Kernels in SVM Classification of Speech Patterns// Proc of the 6th International Conference on Spoken Language Processing.Beijing,China,2000,Ⅰ:297 -300. 被引量：1
7Campbell M,Campbell J P,Reynolds D A,et al.Support Vector Machines for Speaker and Language Recognition.Computer Speech and Language,2006,20(2/3):210-229. 被引量：1
8Bishop C M.Pattern Recognition and Machine Learning.New York,USA:Springer,2006. 被引量：1
9Chang C C,Lin C J.LIBSVM:A Library for Support Vector Machines[DB/OL].[2008-10-20].http://www.csie.ntu.edu.tw/_cjlin/libsvm. 被引量：1
10Hatch A O,Stolcke A.Generalized Linear Kernels for One Versus All Classification:Application to Speaker Recognition//Proc of the IEEE International Conference on Acoustics,Speech and Signal Processing.Toulouse,France,2006,Ⅴ:585-588. 被引量：1

共引文献3

1张丽,杨镇西,吉立新.语种识别算法中GSV计算的定点仿真与实现[J].计算机工程与设计,2012,33(2):679-683. 被引量：1
2聂智良,张兴明,杨镇西,张丽.区分性锚模型应用于语种识别的研究[J].计算机工程,2012,38(3):172-175. 被引量：1
3常振超,刘斌,石远超,张兴明,杨镇西,张丽.一种层次化空间分析方法在语种识别系统中的应用[J].计算机应用研究,2012,29(10):3651-3654.

同被引文献5

1陈世雄,宫琴,金慧君.用Gammatone滤波器组仿真人耳基底膜的特性[J].清华大学学报（自然科学版）,2008,48(6):1044-1048. 被引量：33
2宋鹏,王浩,赵力.基于混合Gauss归一化的语音转换方法[J].清华大学学报（自然科学版）,2013,53(6):757-761. 被引量：3
3花城,李辉.小训练语料下基于均值超矢量聚类的说话人确认方法[J].数据采集与处理,2014,29(2):238-242. 被引量：4
4詹海峰,田红心,牛博,李从林.基于多分辨率高斯滤波器组的时频分析方法[J].中国电子科学研究院学报,2017,12(6):654-661. 被引量：5
5倪纪伟,彭妙颜.基于Fisher比的Bark倒谱系数混合特征参数提取方法[J].电声技术,2019,43(1):30-33. 被引量：3

引证文献1

1陈旭,蒋晔.基于高斯滤波器组混合特征的录音回放攻击检测研究[J].计算机工程,2021,47(3):291-297. 被引量：2

二级引证文献2

1陈露,周欣,陈洪刚,何小海,王正勇,卿粼波.基于特征压缩和残差网络的语音重放检测[J].智能计算机与应用,2021,11(11):54-58.
2王军华,李富强,胡晓华,吴健军.变电站电气设备运行状态自动化预测方法[J].自动化与仪表,2023,38(11):17-20. 被引量：1

1李贵华.获取个性化文件列表[J].网络运维与管理,2014(6):63-64.
2陈学梁,常振超,杨镇西,张丽.一种多路实时语种识别系统设计与实现[J].计算机应用研究,2013,30(7):2041-2043.
3朱义鑫,闵东.基于网络的HMM异常检测方法研究[J].计算机工程与应用,2006,42(24):145-148. 被引量：1
4杨宏伟,王焕坤,李庆全.网络结构的软件系统可靠性分配方法[J].微型机与应用,2012,31(19):8-11.
5梁春燕,安茂波,刘振业,索宏彬,汪俊杰.高斯超向量-支持向量机鉴别性语种识别系统[J].计算机工程与应用,2013,49(2):174-176.
6周娟,杨鼎才.基于GA-SVM的说话人辨认的参数优化[J].电子技术（上海）,2008,0(2):52-53. 被引量：2
7南楚.锁屏就要锁出特色[J].电脑爱好者,2015,0(18):46-47.
8毕振威.网络分析法在物流服务供应商选择中的应用[J].电子世界,2012(17):88-89. 被引量：1
9徐嘉明,张卫强,杨登舟,刘加,夏善红.基于流形正则化极限学习机的语种识别系统[J].自动化学报,2015,41(9):1680-1685. 被引量：12
10董艳艳,章桥新,赵颖.基于技术层面的供应商选择与评价研究[J].机械制造,2009,47(7):15-18. 被引量：2

小型微型计算机系统

2013年第4期

浏览历史

内容加载中请稍等...

一种结合支持向量机训练的锚模型语种识别方法被引量：1

参考文献16

二级参考文献13

共引文献3

同被引文献5

引证文献1

二级引证文献2

相关作者

相关机构

相关主题

浏览历史

一种结合支持向量机训练的锚模型语种识别方法 被引量：1

参考文献16

二级参考文献13

共引文献3

同被引文献5

引证文献1

二级引证文献2

相关作者

相关机构

相关主题

浏览历史

一种结合支持向量机训练的锚模型语种识别方法被引量：1