一种面向微控制器上环境声音分类的DNN压缩方法

A DNN Compression Method for Environmental Sound Classification on Microcontroller Unit

下载PDF

导出

摘要环境声音分类(Environmental Sound Classification,ESC)是非语音音频分类任务最重要的课题之一。近年来,深度神经网络(Deep Neural Network,DNN)方法在ESC方面取得了许多进展。然而,DNN是计算和存储密集型的,无法直接部署到基于微控制器(Microcontroller Unit,MCU)的物联网设备上。针对这一问题,本文提出一种用于资源高度受限设备的DNN压缩方法。由于DNN模型参数规模较大无法直接部署,因此提出使用剪枝方法进行大幅压缩,并针对该操作带来的精度损失问题,设计一种基于模型中间层特征信息的知识蒸馏方法。基于STM32F746ZG设备在公开的数据集(UrbanSound8K、ESC-50)上进行测试,实验结果表明,本文方法能够获得高达97%的压缩率,同时保持良好的推理精度和速度。 Environmental Sound Classification(ESC)is known as one of the most important topics of the non-speech audio clas sification task.In recent years,deep neural networks(DNNs)have made a lot of progress in ESC.However,DNNs are computa tionally and memory-intensive,and cannot be directly deployed on IoT devices based on microcontroller units(MCU).To ad dress this problem,this paper proposes a DNN compression method for highly resource-constrained devices.Since DNNs have a large number of parameters,which cannot be directly deployed,so this paper proposes to use the pruning method for substantial compression.Afterwards,aiming at the problem of accuracy loss caused by this operation,we design a knowledge distillation based on the feature information of multiple intermediate layers.Tests are carried out on public datasets(UrbanSound8K,ESC-50)using the STM32F746ZG device.The experimental results demonstrate that proposed method can achieve up to 97%com pression rate while maintaining good inference performance and speed.

作者孟娜方维维路红英 MENG Na;FANG Wei-wei;LU Hong-ying(School of Computer and Information Technology,Beijing Jiaotong University,Beijing 100044,China)

机构地区北京交通大学计算机与信息技术学院

出处《计算机与现代化》 2024年第1期80-86,共7页 Computer and Modernization

关键词环境声音分类边缘计算微控制器剪枝知识蒸馏量化 environmental sound classification edge computing microcontroller unit pruning knowledge distillation quanti zation

分类号 TP81 [自动化与计算机技术—检测技术与自动化装置]

引文网络
相关文献

参考文献1

1Jian CHENG,Pei-song WANG,Gang LI,Qing-hao HU,Han-qing LU.Recent advances in efficient computation of deep convolutional neural networks[J].Frontiers of Information Technology & Electronic Engineering,2018,19(1):64-77. 被引量：36

共引文献35

1艾祖鹏,刘雨帆,阮晓峰,李兵.深度卷积神经网络压缩与加速研究进展[J].中国基础科学,2022(3):1-9.
2杨本臣,裴欢菲.灰狼优化支持向量机的推荐算法[J].辽宁工程技术大学学报（自然科学版）,2021,40(6):552-557. 被引量：2
3郭乔进,胡杰,宫世杰,梁中岩.深度学习计算平台发展综述[J].信息化研究,2019,45(3):1-7. 被引量：5
4高明慧,张尤赛,王亚军,李垣江.应用卷积神经网络的纹理合成优化方法[J].计算机工程与设计,2019,40(12):3551-3556. 被引量：3
5张勇,魏靖林.可编程逻辑器件架构设计展望[J].辽宁大学学报（自然科学版）,2019,46(4):327-336. 被引量：2
6庞涛,丘海华,潘碧莹.手机终端人工智能关键技术研究[J].电信科学,2020,36(5):145-151. 被引量：5
7林雄锋,李新海,邱天怡,范德和,曾令诚,肖星,罗海鑫,凌霞.基于智能视频识别技术的变电站安全监控系统研究[J].广西电力,2020,43(5):51-57. 被引量：12
8Hao WANG,Zhi-yuan WANG,Ben-dong WANG,Zhuo-qun YU,Zhong-he JIN,John L.CRASSIDIS.An artificial intelligence enhanced star identification algorithm[J].Frontiers of Information Technology & Electronic Engineering,2020,21(11):1661-1670. 被引量：1
9孟祥环,罗素云,张玉祖,陈亚,陈思涛.基于TensorFlow的车牌字符识别方法[J].上海工程技术大学学报,2020,34(3):247-252.
10陈明浩,陈庆奎.基于边缘节点的深度神经网络任务分配方法[J].计算机工程与设计,2021,42(1):113-121. 被引量：2

1邵广亚,冯雪,张维,田松,张博,林海峰.基于ISO 26262的车身域控制器开发[J].北京信息科技大学学报（自然科学版）,2024,39(1):69-75.
2刘静,郑铜亚,郝沁汾.图知识蒸馏综述:算法分类与应用分析[J].软件学报,2024,35(2):675-710.

计算机与现代化

2024年第1期

浏览历史

内容加载中请稍等...

一种面向微控制器上环境声音分类的DNN压缩方法

参考文献1

共引文献35

相关作者

相关机构

相关主题

浏览历史