Polyp-PVT:Polyp Segmentation with Pyramid Vision Transformers 被引量：4

导出

摘要 Most polyp segmentation methods use convolutional neural networks(CNNs)as their backbone,leading to two key issues when exchanging information between the encoder and decoder:(1)taking into account the differences in contribution between different-level features,and(2)designing an effective mechanism for fusing these features.Unlike existing CNN-based methods,we adopt a transformer encoder,which learns more powerful and robust representations.In addition,considering the image acquisition influence and elusive properties of polyps,we introduce three standard modules,including a cascaded fusion module(CFM),a camouflage identification module(CIM),and a similarity aggregation module(SAM).Among these,the CFM is used to collect the semantic and location information of polyps from high-level features;the CIM is applied to capture polyp information disguised in low-level features,and the SAM extends the pixel features of the polyp area with high-level semantic position information to the entire polyp area,thereby effectively fusing cross-level features.The proposed model,named Polyp-PVT,effectively suppresses noises in the features and significantly improves their expressive capabilities.Extensive experiments on five widely adopted datasets show that the proposed model is more robust to various challenging situations(e.g.,appearance changes,small objects,and rotation)than existing representative methods.The proposed model is available at https://github.com/DengPingFan/Polyp-PVT.

作者 Bo Dong Wenhai Wang Deng-Ping Fan Jinpeng Li Huazhu Fu Ling Shao

机构地区 College of Computer Science Shanghai Artificial Intelligence Laboratory Computer Vision Lab Institute of High Performance Computing UCAS-Terminus AI Lab

出处《CAAI Artificial Intelligence Research》 2023年第1期1-15,共15页 CAAI人工智能研究（英文）

关键词 polyp segmentation pyramid vision transformer COLONOSCOPY computer vision

分类号 TN624 [电子电信—电路与系统]

引文网络
相关文献

参考文献2

1Wenhai Wang,Enze Xie,Xiang Li,Deng-Ping Fan,Kaitao Song,Ding Liang,Tong Lu,Ping Luo,Ling Shao.PVT v2:Improved baselines with Pyramid Vision Transformer[J].Computational Visual Media,2022,8(3):415-424. 被引量：70
2Ge-Peng Ji,Guobao Xiao,Yu-Cheng Chou,Deng-Ping Fan,Kai Zhao,Geng Chen,Luc Van Gool.Video Polyp Segmentation: A Deep Learning Perspective[J].Machine Intelligence Research,2022,19(6):531-549. 被引量：11

二级参考文献3

1范登平,季葛鹏,秦雪彬,程明明.认知规律启发的物体分割评价标准及损失函数[J].中国科学：信息科学,2021,51(9):1475-1489. 被引量：12
2Isaac Baffour Senkyire,Zhe Liu.Supervised and Semi-supervised Methods for Abdominal Organ Segmentation: A Review[J].International Journal of Automation and computing,2021,18(6):887-914. 被引量：3
3Wenhai Wang,Enze Xie,Xiang Li,Deng-Ping Fan,Kaitao Song,Ding Liang,Tong Lu,Ping Luo,Ling Shao.PVT v2:Improved baselines with Pyramid Vision Transformer[J].Computational Visual Media,2022,8(3):415-424. 被引量：70

共引文献78

1李敏,乔志远,杨易鑫.基于光学遥感影像的舰船检测研究综述[J].网络安全与数据治理,2023,42(S01):106-114.
2张显杰,张之明.基于卷积神经网络和Transformer的手写体英文文本识别[J].计算机应用,2022,42(8):2394-2400. 被引量：3
3薛相全,庞明宝.基于Transformer-ESIM的高速公路交通状态识别模型[J].物流科技,2022,45(17):71-75.
4单维锋,李志扬,陈俊,刘海军,张秀霞,邢丽莉,胡秀娟,夏庆新,夏金铸.应用卷积神经网络和自注意力机制识别地磁场干扰事件[J].地震地磁观测与研究,2022,43(5):49-63.
5史彩娟,任弼娟,王子雯,闫巾玮,石泽.基于深度学习的伪装目标检测综述[J].计算机科学与探索,2022,16(12):2734-2751. 被引量：8
6Ge-Peng Ji,Guobao Xiao,Yu-Cheng Chou,Deng-Ping Fan,Kai Zhao,Geng Chen,Luc Van Gool.Video Polyp Segmentation: A Deep Learning Perspective[J].Machine Intelligence Research,2022,19(6):531-549. 被引量：11
7刘洋,李相国,连良秀.基于AIOT的安全生产监管平台关键技术研究[J].网络安全技术与应用,2022(12):7-9. 被引量：2
8李翔,张涛,张哲,魏宏杨,钱育蓉.Transformer在计算机视觉领域的研究综述[J].计算机工程与应用,2023,59(1):1-14. 被引量：17
9冯珺,彭梁英,赵帅,潘司晨,郭雪强.基于孪生神经网络的小样本目标检测综述[J].河北科技大学学报,2022,43(6):643-650. 被引量：2
10王甜甜,史卫亚,张世强,张绍文.采用双支路和Transformer的视杯视盘分割方法[J].科学技术与工程,2023,23(6):2499-2508. 被引量：1

同被引文献8

1杨昆,孙宇锋,汪世伟,路宇飞,薛林雁.YOLOF-CBAM:一种新的结直肠息肉实时分类与检测方法[J].电子测量技术,2023,46(16):138-147. 被引量：2
2张欢,刘静,冯毅博,仇大伟.U-Net及其在肝脏和肝脏肿瘤分割中的应用综述[J].计算机工程与应用,2022,58(2):1-14. 被引量：13
3Wenhai Wang,Enze Xie,Xiang Li,Deng-Ping Fan,Kaitao Song,Ding Liang,Tong Lu,Ping Luo,Ling Shao.PVT v2:Improved baselines with Pyramid Vision Transformer[J].Computational Visual Media,2022,8(3):415-424. 被引量：70
4张正杰,程云章,黄陈.影像组学在结直肠癌诊疗中的应用及研究进展[J].生物医学工程研究,2023,42(1):96-99. 被引量：6
5刘铁,段勇.融合CNN和Transformer的机器人室内场景识别[J].电子测量与仪器学报,2023,37(5):223-229. 被引量：2
6刘肇隆,范馨月.基于全尺度跳跃连接的TransUNet医学图像分割网络[J].国外电子测量技术,2023,42(11):42-48. 被引量：2
7金宇锋,陶重犇.基于Transformer的融合信息增强3D目标检测算法[J].仪器仪表学报,2023,44(12):297-306. 被引量：7
8汪鹏程,张波涛,顾进广.融合多尺度门控卷积和窗口注意力的结肠息肉分割[J].计算机系统应用,2024,33(6):70-80. 被引量：1

引证文献4

1聂应旺,王雷,梅晨阳,陈浩.用于精确图像分割的特征细化金字塔视觉转换器[J].温州医科大学学报,2024,54(8):631-640.
2刘国奇,陈宗玉,刘栋,常宝方,王佳佳.融合边界注意力的特征挖掘息肉小目标网络[J].智能系统学报,2024,19(5):1092-1101.
3张攀峰,杨贺,神显豪,程小辉,杜慧.融合局部和全局特征的息肉分割模型[J].电子测量技术,2024,47(16):100-109.
4顾聪,段其强,任思雨.基于上下文感知网络的息肉分割算法[J].计算机应用,2024,44(11):3617-3622.

1刘小强.水下传感网络中能效感知的相似数据融合算法[J].火力与指挥控制,2021,46(6):100-104. 被引量：4
2Shi Qiu,Hongbing Lu,Jun Shu,Ting Liang,Tao Zhou.Colorectal Cancer Segmentation Algorithm Based on Deep Features from Enhanced CT Images[J].Computers, Materials & Continua,2024,80(8):2495-2510.
3Olga Nabochenko,Mykola Sysyn,Norman Krumnow,Szabolcs Fischer.Mechanism of cross-level settlements and void accumulation of wide and conventional sleepers in railway ballast[J].Railway Engineering Science,2024,32(3):361-383.
4Junliang Xing,Zhe Wu,Zhaoke Yu,Renye Yan,Zhipeng Ji,Pin Tao,Yuanchun Shi.Game Interactive Learning:A New Paradigm towards Intelligent Decision-Making[J].CAAI Artificial Intelligence Research,2023,2(1):65-74.
5Maksym Manko,Anton Popov,Juan Manuel Gorriz,Javier Ramirez.Improved organs at risk segmentation based on modified U‐Net with self‐attention and consistency regularisation[J].CAAI Transactions on Intelligence Technology,2024,9(4):850-865.
6Xiao-Yan Li,Juan-Juan Xie,Jin-Hong Wang,Yu-Feng Bao,Yi Dong,Bin Gao,Ting Shen,Pei-Yu Huang,Hao-Chao Ying,Han Xu,Anna Wang Roe,Hsin-Yi Lai,Zhi-Ying Wu.Perivascular spaces relate to the course and cognition of Huntington’s disease[J].Translational Neurodegeneration,2023,12(1):546-549.

CAAI Artificial Intelligence Research

2023年第1期

浏览历史

内容加载中请稍等...