全局基因调控网络构建CPU/GPU并行算法

CPU / GPU Parallel Algorithm for Constructing Genome-wide Gene Regulatory Networks

下载PDF

导出

摘要对基因表达谱分块,使之符合GPU并行计算的线程结构特性,根据GPU线程结构特性设计双层并行模式,并利用纹理缓存实现访存高效;依据CPU二级缓存容量对基本块进一步细分成子块以提高缓存命中率,利用数据预取技术减少访存次数,利用线程绑定技术减少线程在核心之间的迁移;依据多核CPU和GPU的计算能力分配CPU和GPU的基因互信息计算任务以平衡CPU与GPU的计算负载;在设计新的阈值计算算法基础上,设计实现了访存高效的构建全局基因调控网络CPU/GPU并行算法.实验结果表明,与已有算法相比,本文算法加速更明显,并且能够构建更大规模的全局基因调控网络. Gene expression profile is parted into some basic blocks to conform GPU parallel computing threads structural characteris- tics, a two-level GPU parallel processing mode is designed by using the characteristic of thread structure for GPU ,and efficient memo- ry access is achieved by using texture cache memory. Each basic block is further divided to several sub-blocks according to the capaci- ty of L2 cache for CPU in order to enhance the hit rate of access cache, the data prefetching techniques are used to reduce memory ac- cess times, the thread-bound techniques are used to reduce migration between threads in the core. The tasks to compute mutual infor- mation between genes are distributed to CPU and GPUs according to their computation ability in order to balance the computation loads for CPU and GPU. Based on designing a new threshold computation algorithm, this paper proposes the cache-efficient CPU/GPU parallel algorithms for constructing genome-wide gene regulatory networks. Experimental results show that, compared with the existing parallel algorithm, our algorithms achieve higher speedup and can construct larger scale genome-wide gene regulatory networks.

作者陈绪伟钟诚

机构地区广西大学计算机与电子信息学院

出处《小型微型计算机系统》 CSCD 北大核心 2015年第2期234-239,共6页 Journal of Chinese Computer Systems

基金国家自然科学基金项目(61462005)资助广西自然科学基金项目(2014GXSFAA118396)资助广西教育厅-广西大学博士点建设基金项目(P11900119)资助广西研究生教育创新计划项目(YCSZ2013006)资助

关键词全局基因调控网络 CPU与GPU协同计算访存高效并行算法 genome-wide gene regulatory network CPU/GPU cooperative computing efficient access cache parallel algorithm

分类号 TP301 [自动化与计算机技术—计算机系统结构]

引文网络
相关文献

参考文献6

1鹿中龙,钟诚,黄华林.多核计算机上非递归并行计算矩阵乘积[J].小型微型计算机系统,2011,32(5):860-866. 被引量：5
2孙啸,陆祖宏,谢建明编著..生物信息学基础[M].北京:清华大学出版社,2005:336.
3傅祖芸编著..信息论基础理论与应用[M].北京:电子工业出版社,2007:514.
4徐宏强..基于信息论的基因调控网络分析与重构方法探索[D].华东师范大学,2008:
5钟诚,陈国良.PRAM和LARPBS模型上的近似串匹配并行算法[J].软件学报,2004,15(2):159-169. 被引量：19
6ZHANG Jingfu, XIE Jingyi, DENG Zhiwei & LU Zhiheng Key Laboratory for Quantum Information and Measurements, Department of Physics, Tsinghua University, Beijing 100084, China,Center for Quantum Information, Tsinghua University, Beijing 100084, China,Department of Materials Science and Engineering, Beijing Normal University, Beijing 100875, China,Testing and Analytical Center, Beijing Normal University, Beijing 100875, China,Department of Physics, Beijing Normal University, Beijing 100875, China.Dense coding scheme using superpositions of Bell-states and its NMR implementation[J].Science China(Physics,Mechanics & Astronomy),2005,48(1):57-67. 被引量：19

二级参考文献14

1陈国良.并行算法的可扩放性分析[J].小型微型计算机系统,1995,16(2):10-16. 被引量：12
2Park JH,George KM.Efficient parallel hardware algorithms for string matching.Microprocessors and Microsystems,1999,23(3):155-168. 被引量：1
3Lester B.The Art of Parallel Programming.Englewood Cliffs:Prentice Hall,1993. 被引量：1
4Alan AB,Mei A.A residue number system on reconfigurable mesh with applications to prefix sums and approximate string matching.IEEE Trans.on Parallel and Distributed Systems,2000,11(11):1186-1199. 被引量：1
5Pan Y,Li Y,Li J,LI K,Zheng SQ.Efficient parallel algorithms for distance maps of 2-D binary images using an optical bus.IEEE Trans.on Systems,Man,and Cybernetics-Part A:Systems and Humans,2002,32(2):228-236. 被引量：1
6Han Y,Pan Y,Shen H.Sublogarithmic deterministic selection on arrays with a reconfigurable optical bus.IEEE Trans.on Computers,2002,51(6):702-706. 被引量：1
7Navarro G.A guided tour to approximate string matching.ACM Computing Surveys,2001,33(1):31-88. 被引量：1
8Navarro G,Baeza-Yates R.A hybrid indexing method for approximate string matching.Journal of Discrete Algorithms,2000,1(1):21-49. 被引量：1
9Lee H-C,Ercal F.RMESH algorithms for parallel string matching.In:Proc.of the 3rd Int'l.Symp.on Parallel Architectures,Algorithms,and Networks(I-SPAN'97).Los Alamitos:IEEE Computer Society Press,1997.223～226.http://ieeexplore.ieee.org/ xpl/tocresult.jsp?i 被引量：1
10Jiang Y,Wright AH.O(k)parallel algorithms for approximate string matching.Journal of Neural Parallel and Scientific Computation,1993,1:443-452. 被引量：1

共引文献40

1钟诚,宋彬.生物序列比对算法分析与比较[J].广西大学学报（自然科学版）,2004,29(3):214-221. 被引量：2
2陈宏建,陈峻,吕为.基于LARPBS模型的快速并行归并排序算法[J].扬州大学学报（自然科学版）,2005,8(3):1-5.
3宋彬,陈国良,鄢超,沈一飞.多序列比对问题的并行近似算法[J].中国科学技术大学学报,2005,35(5):656-664. 被引量：3
4刘玉慧,陈宏建,陈崚.基于流水光总线模型的快速归并排序算法[J].计算机工程与应用,2006,42(3):28-32.
5罗程,钟诚,李智.网络入侵检测系统中无导师学习分析器的设计[J].计算机工程与科学,2006,28(7):28-29.
6宫婧,孙知信,顾强.基于行为特征描述的P2P流识别方法的研究[J].小型微型计算机系统,2007,28(1):48-53. 被引量：5
7LI ChunYan1,2,LI XiHan1,2,DENG FuGuo1,2,3,ZHOU Ping1,2 & ZHOU HongYu1,2,3 1 Key Laboratory of Beam Technology and Material Modification of Ministry of Education,Beijing Normal University,Beijing 100875,China,2 Institute of Low Energy Nuclear Physics,and Department of Material Science and Engineering,Beijing Normal University,Beijing 100875,China,3 Beijing Radiation Center,Beijing 100875,China.Complete multiple round quantum dense coding with quantum logical network[J].Chinese Science Bulletin,2007,52(9):1162-1165. 被引量：5
8范曾,钟诚,莫倩芸,刘萍.机群系统上基于Hashing的多目标串匹配并行算法[J].微电子学与计算机,2007,24(9):165-168.
9沈一飞,陈国良,张强锋.PRAM和LARPBS模型上有向序列翻转距离并行算法(英文)[J].软件学报,2007,18(11):2683-2690.
10范大娟,钟诚,许莉莉.异构机群系统上近似串匹配并行算法[J].计算机工程,2008,34(3):141-144. 被引量：1

1贵得有道理——我选Intel 64位P4[J].电脑采购,2005,0(46):29-29.
2殷均平,孔素然,张铁墨.一种利用水平集的图像分割快速GPU并行算法[J].计算机应用与软件,2013,30(3):280-282.
3费雄伟,李肯立,阳王东.基于CUDA的AES并行算法优化[J].计算机工程,2014,40(9):6-12. 被引量：3
4CPU性能与二级缓存容量的关系——Athlon XP 3000+带来的思考[J].电脑自做,2003(4):31-38.
5赵旭东,梁书秀,孙昭晨,刘忠波,韩松林,任喜峰.基于GPU并行算法的水动力数学模型建立及其效率分析[J].大连理工大学学报,2014,54(2):204-209. 被引量：11
6杨帅.你需要大“仓库”吗？浅谈CPU二级缓存容量[J].微型计算机,2005(4):122-123. 被引量：1
7孟小华,黄丛珊,朱丽莎.基于CUDA的热传导GPU并行算法研究[J].计算机工程,2014,40(5):41-44. 被引量：3
8邢星星,赵国兴,骆祖莹,方浩.基于GPU的全源最短路径算法[J].计算机科学,2012,39(3):299-303. 被引量：3
9丁伟.Intel服务器处理器全面出击[J].微电脑世界,2003(14):63-63.
10旧貌换新颜新款Duron全面接触[J].电脑自做,2003(10):31-37.

小型微型计算机系统

2015年第2期

浏览历史

内容加载中请稍等...

全局基因调控网络构建CPU/GPU并行算法

参考文献6

二级参考文献14

共引文献40

相关作者

相关机构

相关主题

浏览历史