近似串匹配技术在网络信息搜索、数字图书馆、模式识别、文本挖掘、IP路由查找、网络入侵检测、生物信息学、音乐研究计算等领域具有广泛的应用.基于CREW-PRAM(parallel random access machine with concurrent read and exclusive wri...近似串匹配技术在网络信息搜索、数字图书馆、模式识别、文本挖掘、IP路由查找、网络入侵检测、生物信息学、音乐研究计算等领域具有广泛的应用.基于CREW-PRAM(parallel random access machine with concurrent read and exclusive write)模型,采用波前式并行推进的方法直接计算编辑距离矩阵D,设计了一个允许k-差别的近似串匹配动态规划并行算法,该算法使用(m+1)个处理器,时间复杂度为O(n),算法理论上达到线性加速;采取水平和斜向双并行计算编辑距离矩阵D的方法,设计了一个使用a(m+1)个处理器和O(n/a+m)时间的、可伸缩的、允许k-差别的近似串匹配动态规划并行算法,+<11mna.基于分治策略,通过灵活拆分总线和合并子总线动态重构光总线系统,并充分利用光总线的消息播送技术和并行计算前缀和的方法,实现了汉明距离的并行计算,设计了两个基于LARPBS(linear arrays with reconfigurable pipelined bus system)模型的通信高效、可扩放的允许k-误配的近似串匹配并行算法,其中一个算法使用n个处理器,时间为O(m);另一个为常数时间算法,使用mn个处理器.展开更多
Over 17 and 160 types of chemical modifications have been identified in DNA and RNA,respectively.The interest in understanding the various biological functions of DNA and RNA modifications has lead to the cutting-edge...Over 17 and 160 types of chemical modifications have been identified in DNA and RNA,respectively.The interest in understanding the various biological functions of DNA and RNA modifications has lead to the cutting-edged fields of epigenomics and epitranscriptomics.Developing chemical and biological tools to detect specific modifications in the genome or transcriptome has greatly facilitated their study.Here,we review the recent technological advances in this rapidly evolving field.We focus on high-throughput detection methods and biological findings for these modifications,and discuss questions to be addressed as well.We also summarize third-generation sequencing methods,which enable long-read and single-molecule sequencing of DNA and RNA modification.展开更多
文摘近似串匹配技术在网络信息搜索、数字图书馆、模式识别、文本挖掘、IP路由查找、网络入侵检测、生物信息学、音乐研究计算等领域具有广泛的应用.基于CREW-PRAM(parallel random access machine with concurrent read and exclusive write)模型,采用波前式并行推进的方法直接计算编辑距离矩阵D,设计了一个允许k-差别的近似串匹配动态规划并行算法,该算法使用(m+1)个处理器,时间复杂度为O(n),算法理论上达到线性加速;采取水平和斜向双并行计算编辑距离矩阵D的方法,设计了一个使用a(m+1)个处理器和O(n/a+m)时间的、可伸缩的、允许k-差别的近似串匹配动态规划并行算法,+<11mna.基于分治策略,通过灵活拆分总线和合并子总线动态重构光总线系统,并充分利用光总线的消息播送技术和并行计算前缀和的方法,实现了汉明距离的并行计算,设计了两个基于LARPBS(linear arrays with reconfigurable pipelined bus system)模型的通信高效、可扩放的允许k-误配的近似串匹配并行算法,其中一个算法使用n个处理器,时间为O(m);另一个为常数时间算法,使用mn个处理器.
基金This work was supported by the National Natural Science Foundation of China(Grant No.31861143026 to C.Y.)the Ministry of Science and Technology of China(Grant Nos.2019YFA0110902 and 2019YFA08002501 to C.Y.)the Ludwig Institute for Cancer Research(C-X.S.),Cancer Research UK(C63763/A26394 and C63763/A27122 to C-X.S.)NIHR Oxford Biomedical Research Centre(to C-X.S.)and Emerson Collective(to C-X.S.).L-Y.Z.is supported by China Scholarship Council.The views expressed are those of the authors and not necessarily those of the NHS,the NIHR or the Department of Health.We apologize for not being able to cite all the publications related to this topic due to space constraints of the journal.
文摘Over 17 and 160 types of chemical modifications have been identified in DNA and RNA,respectively.The interest in understanding the various biological functions of DNA and RNA modifications has lead to the cutting-edged fields of epigenomics and epitranscriptomics.Developing chemical and biological tools to detect specific modifications in the genome or transcriptome has greatly facilitated their study.Here,we review the recent technological advances in this rapidly evolving field.We focus on high-throughput detection methods and biological findings for these modifications,and discuss questions to be addressed as well.We also summarize third-generation sequencing methods,which enable long-read and single-molecule sequencing of DNA and RNA modification.