期刊文献+

时频分区扰动实现音频分类对抗样本生成

Adversarial Example Generation for Audio Classification Based on Time-Frequency Partitioned Perturbatio
下载PDF
导出
摘要 现有方法生成的音频分类对抗样本(adversarial example, AE)攻击成功率低,易被感知。鉴于此,设计了一种基于时频分区扰动(time-frequency partitioned perturbation, TFPP)的音频AE生成框架。音频信号的幅度谱根据时频特性被划分为关键和非关键区域,并生成相应的对抗扰动。在TFPP基础上,提出了一种基于生成对抗网络(generative adversarial network, GAN)的AE生成方法TFPPGAN,以分区幅度谱为输入,通过对抗训练自适应调整扰动约束系数,同时优化关键和非关键区域的扰动。3个典型音频分类数据集上的实验表明,与基线方法相比,TFPPGAN可将AE的攻击成功率、信噪比分别提高4.7%和5.5 dB,将生成的语音对抗样本的质量感知评价得分提高0.15。此外,理论分析了TFPP框架与其他攻击方法相结合的可行性,并通过实验验证了这种结合的有效性。 The adversarial examples generated by the existing methods generally suffer from a low attack success rate and are easy to perceive.To address these problems,this paper first designs an audio adversarial example generation framework based on Time-Frequency Partitioned Perturbation(TFPP).Leveraging the time-spectral characteristics of the audio signal,the framework divides the magnitude spectrum of the input audio signal into critical regions and non-critical regions,and generates the corresponding perturbations.Building upon this framework,this paper further proposes a Generative Adversarial Network(GAN)-based adversarial example generation method named TFPPGAN.TFPPGAN takes magnitude spectra as inputs and uses adversarial training to simultaneously optimize the adversarial perturbations in critical and non-critical regions by adaptively adjusting the partitioned perturbation constraint coefficients.Exhaustive comparison experiments are conducted on three typical audio classification datasets.The experimental results show that,compared with baseline methods,TFPPGAN can improve the attack success rate and signal-to-noise ratio by 4.7%and 5.5 dB respectively.The perceptual evaluation score of generated adversarial speech quality also improves by 0.15.Besides,this paper theoretically analyzes the feasibility of the combination of TFPP with other attack methods,and experimentally verify the effectiveness of this combination.
作者 张雄伟 张强 杨吉斌 孙蒙 李毅豪 ZHANG Xiongwei;ZHANG Qiang;YANG Jibin;SUN Meng;LI Yihao(College of Command&Control Engineering,Army Engineering University of PLA,Nanjing 210007,China)
出处 《陆军工程大学学报》 2024年第1期1-11,共11页 Journal of Army Engineering University of PLA
基金 国家自然科学基金(62071484)。
关键词 音频分类 对抗样本 生成对抗网络 分区扰动 audio classification adversarial example generative adversarial network partitioned perturbation
  • 相关文献

参考文献2

二级参考文献21

  • 1Lin C S,Wang D R.Spectrogram image encoding based on dynamic Hilbert curve routing[C]//International Confer- ence on Image Processing Theory Tools and Applications.IEEE,2010:107-111. 被引量:1
  • 2Ke Y,Hoiem D,Sukthankar R.Computer vision for music i- dentification[C]//IEEE Computer Society Conference on Computer Vision and Pattern Recognition.IEEE,2005:597-604. 被引量:1
  • 3Schutte K,Glass J.Speech recognition with localized time- frequency pattern detectors[C]//lEEE Workshop on Auto- matic Speech Recognition and Understanding.IEEE,2007:341-346. 被引量:1
  • 4Koch C,Ullman S.Shifts in selective visual attention:to- wards the underlying neural circuitry[J],Human Neurobi- ology,1985,4(4):219-245. 被引量:1
  • 5Itti L, Koch C,Niebur E.A model of saliency-based visual attention for rapid scene analysis[J].IEEE Transactions on Pattern Analysis and Machine Intelligence,1998,20(11):1254-1259. 被引量:1
  • 6Dick M,Ullman S,Sagi D.Parallel and serial processes in motion detection.[J].Science,1987,237(4813):400-402. 被引量:1
  • 7Achanta R,Hemami S,Estrada F,et al.Frequency-tuned salient region detection[C]//International Conference on Computer Vision and Pattern Recognition.IEEE,2009:1597-1604. 被引量:1
  • 8Harel J,Koch C,Perona P.Graph-based visual saliency[J].Advances in Neural Information Processing Systems,2006,19:545-552. 被引量:1
  • 9Bruce N D B,Tsotsos J K.Saliency based on information maximization[J].Advances in Neural Information Process- ing Systems,2005,18(3):298-308. 被引量:1
  • 10Rahtu E,Kannala J,Salo M,et al.Segmenting salient ob- jects from images and videos[C]//Proceedings of the Ilth European Conference on Computer Vision:Part V.Spring- er-Verlag,2010:366-379. 被引量:1

共引文献8

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部