SGCL-LncLoc:An Interpretable Deep Learning Model for Improving IncRNA Subcellular Localization Prediction with Supervised Graph Contrastive Learning

导出

摘要 Understanding the subcellular localization of long non-coding RNAs(IncRNAs)is crucial for unraveling their functional mechanisms.While previous computational methods have made progress in predicting IncRNA subcellular localization,most of them ignore the sequence order information by relying on k-mer frequency features to encode IncRNA sequences.In the study,we develope SGCL-LncLoc,a novel interpretable deep learning model based on supervised graph contrastive learning.SGCL-LncLoc transforms IncRNA sequences into de Bruijn graphs and uses the Word2Vec technique to learn the node representation of the graph.Then,SGCL-LncLoc applies graph convolutional networks to learn the comprehensive graph representation.Additionally,we propose a computational method to map the attention weights of the graph nodes to the weights of nucleotides in the IncRNA sequence,allowing SGCL-LncLoc to serve as an interpretable deep learning model.Furthermore,SGCL-LncLoc employs a supervised contrastive learning strategy,which leverages the relationships between different samples and label information,guiding the model to enhance representation learning for IncRNAs.Extensive experimental results demonstrate that SGCL-LncLoc outperforms both deep learning baseline models and existing predictors,showing its capability for accurate IncRNA subcellular localization prediction.Furthermore,we conduct a motif analysis,revealing that SGCL-LncLoc successfully captures known motifs associated with IncRNA subcellular localization.The SGCL-LncLoc web server is available at http://csuligroup.com:8000/SGCL-LncLoc.The source code can be obtained from https://github.com/CSUBioGroup/SGCL-LncLoc.

作者 Min Li Baoying Zhao Yiming Li Pingjian Ding Rui Yin Shichao Kan Min Zeng

机构地区 School of Computer Science and Engineering Center for Artificial Intelligence in Drug Discovery Department of Health Outcomes and Biomedical Informatics

出处《Big Data Mining and Analytics》 EI CSCD 2024年第3期765-780,共16页 大数据挖掘与分析（英文）

基金 supported by the National Natural Science Foundation of China(No.62102457) the Hunan Provincial Natural Science Foundation of China(No.2023JJ40763) the Hunan Provincial Science and Technology Program(No.2021RC4008) the Fundamental Research Funds for the Central Universities of Central South University(No.CX20230271).

关键词 supervised contrastive learning long non-coding RNA(IncRNA) subcellular localization prediction deep learning Graph Convolutional Network(GCN)

分类号 O15 [理学—数学]

引文网络
相关文献

1Let＇s Talk[J].Women of China,2018(9):6-6.
2Yun Tan,Jun Pu,Hongpeng Li,Dongliang Chao.Water molecular activity management towards stable Zn anodes[J].Science China Chemistry,2024,67(12):4085-4097.
3庞杰,闫晓东,赵小兵.Ko⁃LLaMA:基于LLaMA的朝鲜语大语言模型[J].外语学刊,2025(1):1-8.
4Manh Vu Minh,Cho Do Xuan.A Novel Approach for Android Malware Detection Based on Intelligent Computing[J].Computers, Materials & Continua,2024,81(12):4371-4396.
5Jianyong Wang,Mingliang Gao,Qilei Li,Hyunbum Kim,Gwanggil Jeon.A Survey on Supervised,Unsupervised,and Semi-Supervised Approaches in Crowd Counting[J].Computers, Materials & Continua,2024,81(12):3561-3582.
6Zhaoxu Meng,Cheng Chen,Xuan Zhang,Wei Zhao,Xuefeng Cui.Exploring Fragment Adding Strategies to Enhance Molecule Pretraining in AI-Driven Drug Discovery[J].Big Data Mining and Analytics,2024,7(3):565-576.
7Peng-Sheng Liu,Li-Nan Zheng,Jia-Le Chen,Guang-Fa Zhang,Yang Xu,Jin-Yun Fang.Enhancing Recommendation with Denoising Auxiliary Task[J].Journal of Computer Science & Technology,2024,39(5):1123-1137.
8Yingying Tan,Guowei Huang,Haiyan Fan,Tao Wu,Zhilin Guan,Kede Liu.CNGC20 plays dual roles in regulating plant growth and immunity in Brassica napus[J].The Crop Journal,2024,12(6):1533-1546.
9Guanpeng Huang,Ti Wu,Yinjie Zheng,Qiyun Gu,Qiaobin Chen,Shoukai Lin,Jincheng Wu.Genome-Wide Identification of the GST Gene Family in Loquat (Eriobotrya japonica Lindl.) and Their Expression under Cold Stress with ALA Pretreatment[J].Phyton-International Journal of Experimental Botany,2024,93(11):2715-2735.
10Tao Xing.Counting Its Blessings Southeast China's Fuzhou transforms history into tourism treasure trove[J].Beijing Review,2024,67(51):38-39.

Big Data Mining and Analytics

2024年第3期

浏览历史

内容加载中请稍等...

SGCL-LncLoc:An Interpretable Deep Learning Model for Improving IncRNA Subcellular Localization Prediction with Supervised Graph Contrastive Learning

相关作者

相关机构

相关主题

浏览历史