序列蛋白质-GDP绑定位点预测被引量：2

Sequential protein-GDP binding residues prediction

下载PDF

导出

摘要正确地识别蛋白质-二磷酸鸟苷(Guanosine Diphosphate,GDP)绑定位点对于蛋白质功能分析和药物设计有非常重要的意义。蛋白质-GDP绑定位点预测是一个典型的不平衡学习问题。直接应用传统的机器学习方法是不合适的,而且会使预测结果偏向大多数类。为了解决这个问题,在基于稀疏表示的位置特异性得分矩阵特征基础上,提出了加权下采样方法来使得样本平衡,采用支持向量机算法来预测。实验结果表明提出的方法能获得更高的预测性能。 Accurately identifying the protein-GDP binding sites is of significant importance for both protein function analysisand drug design. Protein-GDP binding residues prediction is a typical imbalanced learning problem. Directly applyingthe traditional machine learning approach for this task is not suitable as the learning results will be severely biasedtowards the majority class. To circumvent this problem, on the basis of position specific scoring matrix feature based onsparse representation, weighted under-sampling is developed to make samples balanced. Finally support vector machine isused for prediction. Experimental results show that the proposed method achieves higher prediction performances.

作者石大宏何雪

机构地区南京理工大学计算机科学与工程学院

出处《计算机工程与应用》 CSCD 北大核心 2016年第13期55-59,75,共6页 Computer Engineering and Applications

基金国家自然科学基金(No.61373062)

关键词蛋白质-GDP绑定预测位置特异性得分矩阵稀疏表示加权下采样支持向量机 protein-GDP binding prediction position specific scoring matrix sparse representation weighted under-sampling support vector machine

分类号 TP39 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献25

1Gao M,Skolnick J.The distribution of ligand-bindingpockets around protein-protein interfaces suggests a generalmechanism for pocket formation[J].Proc of NationalAcademy of Science of USA,2012,109(10):3784-3789. 被引量：1
2Kokubo H,Tanaka T,Okamoto Y.Ab initio predictionof protein-ligand binding structures by replica-exchangeumbrella sampling simulations[J].Journal of ComputationalChemistry,2011,32(13):2810-2821. 被引量：1
3Schmidtke P,Barril X.Understanding and predicting druggability.A high-throughput method for detection of drugbinding sites[J].Journal of Medicinal Chemistry,2010,53(15):5858-5867. 被引量：1
4Berman H M,Westbrook J,Feng Z,et al.The protein databank[J].Nucleic Acids Res,2000,28(1):235-242. 被引量：1
5Chen K,Mizianty M J,Kurgan L.ATP site:Sequence-basedprediction of ATP-binding residues[J].Proteome Sci,2011,9(S1):295-297. 被引量：1
6Leis S,Schneider S,Zacharias M.In silico prediction ofbinding sites on proteins[J].Curr Med Chem,2010,17(15):1550-1562. 被引量：1
7Campbell N A,Williamson B,Heyden R J.Biology:Exploringlife[M].Boston,Massachusetts:Pearson Prentice Hall,2006. 被引量：1
8Abbaspour A,Baramakeh L.Application of principle componentanalysis-artificial neural network for simultaneousdetermination of zirconium and hafnium in real samples[J].Spectrochim Acta A,2006,64(2):477-482. 被引量：1
9Chen K,Mizianty M J,Kurgan L.Prediction and analysisof nucleotide-binding residues using sequence andsequence-derived structural descriptors[J].Bioinformatics,2012,28(3):331-341. 被引量：1
10Zhou Z H,Liu X Y.On multi-class cost-sensitive learning[J].Comput Intell-Us,2010,26(3):232-257. 被引量：1

同被引文献3

1杜秀全,程家兴,宋杰.基于最大熵模型的蛋白质作用位点识别方法[J].计算机工程,2010,36(18):203-204. 被引量：2
2韩竞男,鲁昊骋,梁静.DNA甲基化与癌症[J].中国生物化学与分子生物学报,2012,28(2):108-114. 被引量：27
3韦云真,刘晓娟,王芳,苏建忠,张岩,刘洪波.癌症DNA甲基化调控位点的识别[J].生物信息学,2015,13(3):170-178. 被引量：3

引证文献2

1孙佳伟,张明,王长宝,徐维艳,程科,段先华.一种新的融合统计特征的DNA甲基化位点识别方法[J].江苏科技大学学报（自然科学版）,2019,33(2):62-68. 被引量：1
2徐淑坦,王俊豪,陈明.基于序列的蛋白质-GDP结合位点预测[J].中国医学物理学杂志,2022,39(11):1425-1430.

二级引证文献1

1于建平,孙亚楠,孙锐,张涛,洪文学.共现语境特征对情态动词语义排歧的限制作用[J].江苏科技大学学报（自然科学版）,2019,33(6):60-67. 被引量：1

1石大宏.基于聚类的下采样及其在蛋白质-核苷酸绑定位点预测中的应用[J].计算机与数字工程,2015,43(6):972-975.
2杨骥.基于序列与结构特征结合的蛋白质与DNA绑定位点预测[J].计算机与现代化,2016(1):20-25. 被引量：1
3朱非易.基于支持向量机集成的蛋白质与维生素绑定位点预测[J].现代电子技术,2015,38(9):90-95.
4魏志森,杨静宇,於东军.基于加权PSSM直方图和随机森林集成的蛋白质交互作用位点预测[J].南京理工大学学报,2015,39(4):379-385. 被引量：7
5王池社,程家兴,汪世义.基于贝叶斯网的蛋白质相互作用位点预测[J].微型电脑应用,2008,24(12):15-17.
6刘冰静,郭红.以位置特异性得分矩阵和基因本体为特征的蛋白质亚细胞定位预测[J].福州大学学报（自然科学版）,2017,45(1):16-24. 被引量：1
7王菲露,宋杨.基于多窗口不同特征的蛋白质相互作用位点预测[J].安徽大学学报（自然科学版）,2010,34(5):64-68.
8王菲露,宋杰,王池社,杜秀全.基于支持向量机的蛋白质相互作用位点预测[J].计算机技术与发展,2008,18(10):127-129.
9郜法启,於东军,沈红斌.基于分类器集成的跨膜蛋白两亲螺旋区域位置预测[J].南京理工大学学报,2016,40(4):431-437. 被引量：4
10李小苇,刘太岗,陶珮莹,王春华.基于ACC变换和RFE算法的蛋白质亚核定位预测[J].计算机工程与应用,2016,52(15):83-87.

计算机工程与应用

2016年第13期

浏览历史

内容加载中请稍等...

序列蛋白质-GDP绑定位点预测被引量：2

参考文献25

同被引文献3

引证文献2

二级引证文献1

相关作者

相关机构

相关主题

浏览历史

序列蛋白质-GDP绑定位点预测 被引量：2

参考文献25

同被引文献3

引证文献2

二级引证文献1

相关作者

相关机构

相关主题

浏览历史

序列蛋白质-GDP绑定位点预测被引量：2