一种支持跨幅访存的向量化代码生成方法

Effective Vectorization Technique for Interleaved Data with Constant Strides

下载PDF

导出

摘要随着SIMD扩展部件的迅速发展,自动向量化工具已逐渐成熟。现阶段的工具能对连续访存程序进行较好的处理,然而,大部分非连续访存的多媒体程序并不能被转换为高效的向量化代码。提出并实现了一种支持跨幅访存的向量化代码生成方法,其利用目标系统已有的基本数据处理指令实现多个向量间的任意重组来解决含有非连续访存语句的向量化代码生成问题。经过实验分析和验证,提出的代码生成方法能够将含有跨幅访存的语句转化为面向目标系统的高效向量化代码,以提高程序执行效率。 Due to the development of the SIMD extensions in general processors, automatic vectorizing compilers are widely used in various fields,especially in scientific and engineering computing area. Conventional vectorizing compilers can parallelize applications with continuous access successfully, but most irregular multimedia applications which access interleaved data cannot be vectorized correctly. To address this issue, this paper presented an effective vectorization technique for interleaved data with constant strides. We achieved any form of data regroupings with the help of the data processing instructions provided hy targeted platforms. As a result, programs with interleaved data access are veetorized and vector codes are generated. The experimental results show that the proposed method can translate irregular applications with interleaved data access into high-performance targeted vectorized codes, thereby advancing the execution efficiency adequately.

作者李朋远赵荣彩高伟张庆花

机构地区信息工程大学数学工程与先进计算国家重点实验室

出处《计算机科学》 CSCD 北大核心 2015年第5期194-199,203,共7页 Computer Science

基金 "核高基"国家科技重大专项(2009ZX01036)资助

关键词代码生成跨幅访存向量化数据重组 Code generation, Stride access, Vectorization, Data regrouping

分类号 TP311.5 [自动化与计算机技术—计算机软件与理论]

引文网络
相关文献

参考文献18

1Hassaballah M,Omran S,Mahdy Y B.A review of SIMD multimedia extensions and their usage in scientific and engineering applications[J].The Computer Journal,2008,51(6):630-649. 被引量：1
2Allen R,Kennedy K.Automatic translation of Fortran programs to vector form[J].ACM Transactions on Programming Languages and Systems (TOPLAS),1987,9(4):491-542. 被引量：1
3Zima H,Chapman B.Supercompilers for parallel and vectorcomputers[M].ACM,1990. 被引量：1
4Larsen S,Amarasinghe S.Exploiting superword level parallelism with multimedia instruction sets[M].ACM,2000. 被引量：1
5Griffith A.GCC:the complete reference[M].McGraw-Hill,lnc.,2002. 被引量：1
6Lattner C,Adve V.The LLVM compiler framework and infrastructure tutorial[M]∥Languages and Compilers for High Performance Computing.Springer Berlin Heidelberg,2005:15-16. 被引量：1
7Lin M,Yu Z,Zhang D,et al.Retargeting the open64 compiler to powerpc processor[C]∥International Conference on Embedded Software and Systems Symposia,2008(ICESS Symposia'08).IEEE,2008:152-157. 被引量：1
8Naishlos D.Autovectorization in GCC[C]∥Proceedings of the 2004 GCC Developers Summit.2004:105-118. 被引量：1
9Nuzman D,Zaks A.Autovectorization in GCC-two years later[C]∥Proceedings of the 2006 GCC Developers Summit.2006:145-158. 被引量：1
10Nuzman D,Henderson R.Multi-platform auto-vectorization[C]∥Proceedings of the International Symposium on Code Generation and Optimization.IEEE Computer Society,2006:281-294. 被引量：1

二级参考文献12

1冯小安,祁兵.电力信息系统安全体系的构建[J].电网技术,2008,32(S1):77-80. 被引量：13
2曾诚,王爱民,肖田元,范莉娅.面向全生命周期的项目管理系统分析与设计[J].计算机工程与设计,2005,26(4):853-856. 被引量：8
3战德臣,王忠杰,徐晓飞,孟凡超,李晋.面向企业资源计划全生命周期的建模方法及工具[J].计算机集成制造系统,2006,12(9):1345-1351. 被引量：9
4GB/T22239-2008信息安全技术-信息系统安全等级保护基本要求[S].北京:中国标准出版社,2009. 被引量：1
5中国国家标准化管理委员会.信息系统安全等级保护测评过程指南[S].北京:中国标准出版社,2009. 被引量：1
6中国国家标准化管理委员会.信息系统安全等级保护测评要求[S].北京:中国标准出版社,2009. 被引量：1
7陈树勇,宋书芳,李兰欣,沈杰.智能电网技术综述[J].电网技术,2009,33(8):1-7. 被引量：1125
8林宇锋,钟金,吴复立.智能电网技术体系探讨[J].电网技术,2009,33(12):8-14. 被引量：283
9钟金,郑睿敏,杨卫红,吴复立.建设信息时代的智能电网[J].电网技术,2009,33(13):12-18. 被引量：153
10曹荣章,杨争林,朱为民,胡俊,沈利华,宋燕敏,严小文.新一代电力市场运营系统的研究与设计[J].电网技术,2006,30(S2):124-128. 被引量：5

共引文献4

1张海华.基于可信计算和主动防御的等级保护在电厂的应用[J].信息安全与通信保密,2013,11(12):127-129. 被引量：4
2陈孟元.制冷液数控在线信息采集系统的设计与实现[J].重庆理工大学学报（自然科学）,2015,29(4):102-106. 被引量：2
3宋天予,黄立,卢黎明.信息系统全生命周期安全风险评估体系研究[J].湖州师范学院学报,2017,39(2):57-61. 被引量：2
4王瑞民,司群,雷丝萦.铁路信息系统全生命周期质量与风险评估体系研究[J].铁路计算机应用,2023,32(11):1-5. 被引量：1

1赵亮.Sql语句防注入攻击研究[J].企业导报,2013(18). 被引量：1
2吴铁洲,徐元中,武明虎.XML查询语句转换成SQL语句的实现[J].湖北工业大学学报,2005,20(1):59-60.
3魏帅,赵荣彩,姚远,侯永生.面向SIMD的数组重组和对齐优化[J].计算机科学,2012,39(2):305-310. 被引量：3
4高伟,赵荣彩,韩林,庞建民,丁锐.SIMD自动向量化编译优化概述[J].软件学报,2015,26(6):1265-1284. 被引量：30
5刘鹏,赵荣彩,李朋远.一种面向向量化的动态指针别名分析框架[J].计算机科学,2015,42(3):26-30. 被引量：4
6侯永生,赵荣彩,黄磊,韩林.面向SIMD扩展部件的循环优化研究[J].计算机科学,2014,41(5):27-32. 被引量：1
7尹华成.Visual Studio.NET中实现XML Web Services浅析[J].中文信息（程序春秋）,2003(8):30-34. 被引量：1
8高健,陈杰.一种基于数字信号处理器的媒体处理器结构及设计[J].微电子学与计算机,2007,24(4):1-4. 被引量：4
9高伟,韩林,赵荣彩,徐金龙,陈超然.向量并行度指导的循环SIMD向量化方法[J].软件学报,2017,28(4):925-939. 被引量：5
10周芸韬.基于R语言的大数据处理平台的设计与实现[J].现代电子技术,2017,40(2):53-56. 被引量：20

计算机科学

2015年第5期

浏览历史

内容加载中请稍等...

一种支持跨幅访存的向量化代码生成方法

参考文献18

二级参考文献12

共引文献4

相关作者

相关机构

相关主题

浏览历史