期刊文献+

基于GMM的说话人识别技术研究 被引量:6

Research on GMM based speaker recognition technology
下载PDF
导出
摘要 为了探讨高斯混合模型在说话人识别中的作用,设计了一个基于GMM的说话人识别系统。整个系统由音频信号预处理,语音活动检测,说话人模型建立以及音频信号识别4个模块组成。前三个模块构成了系统的模型训练部分,最后一个模块构成了系统的语音识别部分。包含在第二个模块中的由GMM模型搭建的语音活动检测器是研究的创新之处。利用增强的多方互动会议语料库中的视听会议对系统中的部分可调参数以及系统的识别错误率进行了测试。仿真结果表明,在语音活动检测器和若干滤波算法的帮助下,系统对包含重叠语音的音频信号的识别准确率可以达到83.02%。 In order to investigate the function of Gaussian Mixture Model(GMM) in speaker recognition,a GMM based speaker recognition system is designed.The system consists of four modules that are audio signal pre-processing,speech activity detection,speaker modeling as well as audio signal recognition.The first three modules constitute the model training segment of the system and the last module constitutes the speech recognition segment of the system.A speech activity detector which is built by GMM in the second module is the innovation of the research.Some tunable parameters and recognition error rate of the system are tested using audio-visual meetings in the Augmented Multi-party Interaction(AMI) corpus.Simulations show that with the help of the speech activity detector and several filter algorithms,recognition accuracy rate of the system for audio signal with overlap speech can reach 83.02%.
作者 曹洁 潘鹏
出处 《计算机工程与应用》 CSCD 北大核心 2011年第11期114-117,共4页 Computer Engineering and Applications
基金 甘肃省自然科学基金No.1010RJZA046 甘肃省教育厅研究生导师基金项目(No.0914ZTB003)~~
关键词 高斯混合模型 语音活动检测 识别错误率 Gaussian Mixture Mode(lGMM) speech activity detection recognition error rate
  • 相关文献

参考文献11

  • 1Wooters C, Ftmg J, Peskin B, et al.Towards robust speaker seg- mentation: The ICSI-SRI fall 2004 diarization system[C]//Proc of Fall 2004 Rich Transcription Workshop,New York,Palisades, 2004:315-320. 被引量:1
  • 2Anguera X, Wooters C, Peskin B, et al.Robust speaker segmentation for meetings: The ICSI-SRI spring 2005 diarization system[J].Machine Learning for Multimodal Interaction,2006,3869:402-414. 被引量:1
  • 3Anguera X, Wooters C, Pardo J M.Robust speaker diarization for meetings:ICSI RT06s evaluation system[J].Lecture Notes in Computer Science,2006,4299 : 346-358. 被引量:1
  • 4Wooters C, Huijbregts M.The ICSI RT07s speaker diarization system[J].Multimodal Technologies for Perception of Humans, 2008,4625 : 509-519. 被引量:1
  • 5蒋晔,唐振民.GMM文本无关的说话人识别系统研究[J].计算机工程与应用,2010,46(11):179-182. 被引量:27
  • 6彭竹苗,张正道.主机型异常检测的隐半马尔可夫模型方法[J].计算机工程与应用,2008,44(33):115-118. 被引量:2
  • 7李杰,刘贺平.高斯序列核支持向量机用于说话人识别[J].计算机工程与应用,2010,46(18):183-185. 被引量:5
  • 8李邵梅,郭云飞,卫红权.基于分布特征统计的说话人识别[J].计算机工程与应用,2009,45(34):118-120. 被引量:2
  • 9Carletta J,Ashby S,Bourban S,et al.The AMI meeting corpus: A preannouncement[C]//Proc of the Workshop on Machine Learning for Multimodal Interaction(MLMI), Edinburgh,2005 : 325-336. 被引量:1
  • 10Chen S, Gopalakrishnan P.Speaker, environment, and channel change detection and clustering via the bayesian information criterion[C]//Proc of Broadcast News Transcription and Under-standing Workshop, San Mateo, 1998 : 127-132. 被引量:1

二级参考文献21

共引文献31

同被引文献55

  • 1王伟,邓辉文.基于MFCC参数和VQ的说话人识别系统[J].仪器仪表学报,2006,27(z3):2253-2255. 被引量:30
  • 2陈芬菲.基于GMM的说话人识别系统[J].微处理机,2006,27(4):76-77. 被引量:3
  • 3吴礼福,解焱陆,戴蓓蒨,李辉.基于CGMM-UBM的电话短语音说话人确认[J].电路与系统学报,2007,12(5):131-136. 被引量:2
  • 4李圆.基于GMM说话人分类的说话人识别系统研究[D].保定:华北电力大学,2008. 被引量:1
  • 5E.C.Lin and R.A.Rutenbar.A multi-FPGA 10x-real-time high-speed search engine for a 5000-word vocabulary speech recognizer[J].ACM SIGDA International SymposiumonFPGA,2009,12:83-92. 被引量:1
  • 6Phaklen EhKan,Timothy Allen and Steven F.FPGA Implementation for GMM-Based Speaker Identification[J].International Journal of Reconfigurable Computing,2011,01:1-8. 被引量:1
  • 7R.Ramos-Lara,M.Lopez Garcia.SVM speaker verification system based on a low-cost FPGA[J].FPL.2009.In proceeding of Field Programmable Logic and Applications,2011,09:582-586. 被引量:1
  • 8J.N.Holmes and W.Holmes.Speech Synthesis and Recognition[M].CRC press,2011. 被引量:1
  • 9M.shi and A.Bermak.An efficient digital VLSI implementation of Gaussian mixture models-based classifier[J].IEEE Transactions on Very Large Scale Integration (VLSI) Systems,2006,14:962-974. 被引量:1
  • 10http://china.xilinx.com/support/documentation/virtex-ii.htm[OL]. 被引量:1

引证文献6

二级引证文献17

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部