期刊文献+

A separation method of singing and accompaniment combining discriminative training deep neural network 被引量:2

A separation method of singing and accompaniment combining discriminative training deep neural network
原文传递
导出
摘要 For the difficulty of separation between singing and accompaniment in the musical signals,an improved music separation method of based on discriminative training depth neural network(DNN) was proposed.Firstly,based on the DNN model,considering the reconstruction errors and discrimination information between singing and accompaniment,an improved objective function was presented to discriminate the training;Then,an additional layer was added to DNN model,introducing the time-frequency masking to optimize the estimated accompaniment of the song,and the corresponding time-domain signal was obtained by inverse Fourier transform;Finally,the influence of different parameters on the separation performance was verified,and compared it with the existing music separation methods.The experimental results showed that the improved objective function and the introduction of time-frequency masking significantly improved the separation performance of the DNN,and the separation performance was improved about 4 dB compared with other existing music separation methods,thus verifying that the proposed method was an effective music separation algorithm. For the difficulty of separation between singing and accompaniment in the musical signals,an improved music separation method of based on discriminative training depth neural network(DNN) was proposed.Firstly,based on the DNN model,considering the reconstruction errors and discrimination information between singing and accompaniment,an improved objective function was presented to discriminate the training;Then,an additional layer was added to DNN model,introducing the time-frequency masking to optimize the estimated accompaniment of the song,and the corresponding time-domain signal was obtained by inverse Fourier transform;Finally,the influence of different parameters on the separation performance was verified,and compared it with the existing music separation methods.The experimental results showed that the improved objective function and the introduction of time-frequency masking significantly improved the separation performance of the DNN,and the separation performance was improved about 4 dB compared with other existing music separation methods,thus verifying that the proposed method was an effective music separation algorithm.
出处 《Chinese Journal of Acoustics》 CSCD 2019年第2期227-239,共13页 声学学报(英文版)
基金 supported by the National Natural Science Foundation of China(61671095,61371164,61702065,61701067,61771085) the Project of Key Laboratory of Signal and Information Processing of Chongqing(CSTC2009CA2003) Chongqing Graduate Research and Innovation Project(CYS17219) the Research Project of Chongqing Educational Commission(KJ130524,KJ1600427,KJ1600429)
  • 相关文献

参考文献3

二级参考文献31

  • 1TAO Ran,DENG Bing,WANG Yue.Research progress of the fractional Fourier transform in signal processing[J].Science in China(Series F),2006,49(1):1-25. 被引量:99
  • 2Wang D L. On ideal binary mask as the computational goal of auditory scene analysis, in Speech separation by humans and machines, Kluwer Academic Pub, 2005:181-197. 被引量:1
  • 3Li Y, Wang D L. On the optimality of ideal binary time- frequency masks. Speech Communication, 2009; 51: 230- 239. 被引量:1
  • 4Kim G, Lu Y, Loizou P C. An algorithm that improves speech intelligibility in noise for normal-hearing listeners. The Journal of the Acoustical Society of America, 2009;126:1486-1495. 被引量:1
  • 5Hu G, Wang D L. Monaural speech segregation based on pitch tracking and amplitude modulation. IEEE Trans. Neural Net., 2004; 15(5): 1135 1150. 被引量:1
  • 6Hu G, Wang D L. A tandem algorithm for pitch estimation and voiced speech segregation. IEEE Trans. Audio Speech Lang. Process., 2010; 18:2067 2079. 被引量:1
  • 7Rangachari S, Loizou P C. A noise-estimation algorithm for highly non-stationary environments. Speech Communi- cation, 2006; 48:220-231. 被引量:1
  • 8Hendriks R C, Jensen J, Heusdens R. Noise tracking us- ing DFT domain subspace decomposition. IEEE Trans. Audio Speech Lang. Process., 2008; 16:541-553. 被引量:1
  • 9Metropolis N, Rosenbluth A W, Rosenbluth M N, Teller A, Teller H. Equations of state calculations by fast computing machines. J. Chem. Phys., 1953; 21:1087-1091. 被引量:1
  • 10Wasserman L. All of statistics: a concise course in statisti- cal inference, ch.6, Springer-Verlag, Berlin, 2003. 被引量:1

共引文献25

同被引文献1

引证文献2

二级引证文献5

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部