期刊文献+

基于分层识别的快速说话人识别研究 被引量:3

Fast speaker recognition based on hierarchical recognition
下载PDF
导出
摘要 随着说话人模型数量的增加,说话人识别系统的识别速度下降,不能满足实时性要求。针对这个问题,提出了基于分层识别模型的快速说话人识别方法。将变分法求解的KL散度的近似值作为模型间的相似性度量准则,并设计了说话人模型聚类的方法。结果表明,本文方法能够保证说话人模型聚类结果的有效性,在系统识别率损失很小的情况下,使系统的识别速度得到大幅度提升。 As the number of speaker models increases,the recognition speed of the speaker recognition system decreases,thus it cannot meet real-time requirement.To solve this problem,we propose a fast speaker recognition method based on hierarchical recognition model.The approximate value of the KL divergence solved by the variational method is used as the similarity measure between speaker models and a speaker model clustering method is designed.Experimental results show that the proposed method can ensure the validity of speaker model clustering results and improve the recognition speed of the system greatly while maintaining a small system recognition rate loss.
作者 茅正冲 涂文辉 MAO Zheng-chong;TU Wen hui(Key Laboratory of Advanced Process Control for Light Industry Ministry of Education,Jiangnan University,Wuxi 214122,China)
出处 《计算机工程与科学》 CSCD 北大核心 2018年第7期1244-1249,共6页 Computer Engineering & Science
基金 国家自然科学基金(60973095) 江苏省自然科学基金(BK20131107)
关键词 高斯混合模型 说话人识别 KL散度 模型聚类 Gauss mixture model speaker recognition KL divergence model clustering
  • 相关文献

参考文献4

二级参考文献37

  • 1刘文举,孙兵,钟秋海.基于说话人分类技术的分级说话人识别研究[J].电子学报,2005,33(7):1230-1233. 被引量:5
  • 2Campbell J P.Speaker Recognition:A Tutorial.Proc of the IEEE,1997,85(9):1437-1462. 被引量:1
  • 3Pellom B L,Hansen J H L.An Efficient Scoring Algorithm for Gaussian Mixture Model Based Speaker Identification.IEEE Signal Processing Letter,1998,5(11):281-284. 被引量:1
  • 4McLaughlin J,Reynolds D A,Gleeson T.A Study of Computation Speed-Ups of the GMM-UBM Speaker Recognition System // Proc of the 6th European Conference on Speech Communication and Technology.Budapest,Hungary,1999:1215-1218. 被引量:1
  • 5Kinnunen T,Karpov E,Franti P.Real-Time Speaker Identification and Verification.IEEE Trans on Audio,Speech,and Language Processing,2006,14(1):277-288. 被引量:1
  • 6Jhanwar N,Raina A K.Pitch Correlogram Clustering for Fast Speaker Identification.EURASIP Journal on Applied Signal Processing,2004,17:2640-2649. 被引量:1
  • 7Xiong Zhenyu,Zheng T F,Song Zhanjiang,et al.Combining Selection Tree with Observation Reordering Pruning for Efficient Speaker Identification Using GMM-UBM // Proc of the IEEE International Conference on Acoustics,Speech and Signal Processing.Philadelphia,USA,2005:625-628. 被引量:1
  • 8Aronowitz H,Burshtein D.Efficient Speaker Recognition Using Approximated Cross Entropy (ACE).IEEE Trans on Audio,Speech and Language Processing,2007,15(7):2033-2043. 被引量:1
  • 9Apsingekar V R,Leon P L D.Efficient Speaker Identification Using Speaker Model Clustering // Proc of the 16th European Signal Processing Conference.Lausanne,Switzerland,2008:64-68. 被引量:1
  • 10Kullback S,Leibler R A.On Information and Sufficiency.Annals of Mathematical Statistics,1951,22(1):79-86. 被引量:1

共引文献8

同被引文献17

引证文献3

二级引证文献8

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部