期刊文献+

复杂环境下基于深度学习的声音识别研究

Research on Deep Learning Based Sound Recognition in Complex Environments
下载PDF
导出
摘要 针对复杂环境下的声音识别问题,提出一种基于深度学习的声音识别方法。首先,通过自适应滤波降噪和梅尔频率倒谱系数(Mel-Frequency Cepstral Coefficient,MFCC)提取等方法提取声音特征。其次,采用L2正则化的卷积神经网络(Convolutional Neural Network,CNN)识别声音,以提高模型的泛化能力和准确性。最后,使用ESC-50数据集对所提方法进行验证和测试。实验结果表明,该方法的精确率、准确率及召回率均优于对比方法。 A deep learning based sound recognition method is proposed for the problem of sound recognition in complex environments.Firstly,sound features are extracted through methods such as adaptive filtering noise reduction and Mel-Frequency Cepstral Coefficient(MFCC)extraction.Secondly,L2 regularized Convolutional Neural Network(CNN)are used to recognize sounds,in order to improve the model’s generalization ability and accuracy.Finally,validate and test the proposed method using the ESC-50 dataset.The experimental results show that the accuracy,precision,and recall of this method are superior to the comparison methods.
作者 付兆婷 FU Zhaoting(Baiyin College,Baiyin Open University,Baiyin 730900,China)
出处 《电声技术》 2024年第5期40-42,共3页 Audio Engineering
关键词 复杂环境 卷积神经网络(CNN) 声音识别 complex environment Convolutional Neural Network(CNN) voice recognition
  • 相关文献

参考文献10

二级参考文献148

  • 1徐静,李卫红,孙懋珩,魏捷,陈圆,李昕.基于麦克风阵列的车辆鸣笛嗅探器[J].数据采集与处理,2012,27(S2):262-266. 被引量:2
  • 2傅初黎,李洪芳,熊向团.不适定问题的迭代Tikhonov正则化方法[J].计算数学,2006,28(3):237-246. 被引量:33
  • 3姜洪臣,郑榕,张树武,徐波.基于SDC特征和GMM-UBM模型的自动语种识别[J].中文信息学报,2007,21(1):49-53. 被引量:14
  • 4王书诏,邱天爽.说话人识别研究综述[J].电声技术,2007,31(1):51-55. 被引量:9
  • 5赵力.语音信号处理[M].北京:机械工业出版社,2008. 被引量:10
  • 6Temko A,Malkin R,Zieger C,et al.CLEAR Evaluation of Acoustic Event Detection and Classification Systems[C]//Proc.of the 1st International Evaluation Conference on Classification of Events,Activities and Relationships.Heidelberg,Germany: Springer-Verlag,2007: 311-322. 被引量:1
  • 7Heittola T,Klapuri A.TUT Acoustic Event Detection System[C]// Proc.of the 2nd International Evaluation Conference on Classification of Events,Activities and Relationships.Heidelberg,Germany: Springer-Verlag,2008: 364-370. 被引量:1
  • 8常西畅,周艳玲,陈进.机械设备噪声故障诊断的新进展[C].北京:全国振动(诊断、模态、噪声与结构动力学) 工程及应用学术会议论文集,2002:140-143. 被引量:5
  • 9ZAMPARO M,STRAMAGLIA S,BANAVAR J,et al.Inverse Problem for Multivariate Time Series Using Dynamical Latent Variables[J].Physica A:Statistical Mechanics and its Applications,2012,391(11):3159-3169. 被引量:1
  • 10WERBOS P.Beyond regression:New Tools for Prediction and Analysis in the Behavioral Sciences[D].Massachusetts:Harvard University,1974. 被引量:1

共引文献2290

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部