CasNet:A Cascade Coarse-to-Fine Network for Semantic Segmentation 被引量：2

CasNet:A Cascade Coarse-to-Fine Network for Semantic Segmentation

导出

摘要 Semantic segmentation is a fundamental topic in computer vision. Since it is required to make dense predictions for an entire image, a network can hardly achieve good performance on various kinds of scenes. In this paper, we propose a cascade coarse-to-fine network called CasNet, which focuses on regions that are difficult to make pixel-level labels. The CasNet comprises three branches. The first branch is designed to produce coarse predictions for easy-to-label pixel regions. The second one learns to distinguish the relatively difficult-to-label pixels from the entire image. Finally, the last branch generates final predictions by combining both the coarse and the fine prediction results through a weighting coefficient that is estimated by the second branch. Three branches focus on their own objectives and collaboratively learn to predict from coarse-to-fine predictions. To evaluate the performance of the proposed network, we conduct experiments on two public datasets: SIFT Flow and Stanford Background. We show that these three branches can be trained in an end-to-end manner, and the experimental results demonstrate that the proposed CasNet outperforms existing state-of-the-art models, and it achieves prediction accuracy of 91.6% and 89.7% on SIFT Flow and Standford Background, respectively. Semantic segmentation is a fundamental topic in computer vision. Since it is required to make dense predictions for an entire image, a network can hardly achieve good performance on various kinds of scenes. In this paper, we propose a cascade coarse-to-fine network called CasNet, which focuses on regions that are difficult to make pixel-level labels. The CasNet comprises three branches. The first branch is designed to produce coarse predictions for easy-to-label pixel regions. The second one learns to distinguish the relatively difficult-to-label pixels from the entire image. Finally, the last branch generates final predictions by combining both the coarse and the fine prediction results through a weighting coefficient that is estimated by the second branch. Three branches focus on their own objectives and collaboratively learn to predict from coarse-to-fine predictions. To evaluate the performance of the proposed network, we conduct experiments on two public datasets: SIFT Flow and Stanford Background. We show that these three branches can be trained in an end-to-end manner, and the experimental results demonstrate that the proposed CasNet outperforms existing state-of-the-art models, and it achieves prediction accuracy of 91.6% and 89.7% on SIFT Flow and Standford Background, respectively.

作者 Zhenyang Wang Zhidong Deng Shiyao Wang

机构地区 Department of Computer Science

出处《Tsinghua Science and Technology》 SCIE EI CAS CSCD 2019年第2期207-215,共9页 清华大学学报（自然科学版（英文版）

基金 supported in part by the National Key R&D Program of China(No.2017YFB1302200) Joint Fund of NORINCO Group of China for Advanced Research(No.6141B010318)

关键词 SEMANTIC SEGMENTATION convolutional neural NETWORK HARD NEGATIVE mining semantic segmentation convolutional neural network hard negative mining

分类号 N [自然科学总论]

引文网络
相关文献

同被引文献3

1Ruyue Xin,Jiang Zhang,Yitong Shao.Complex Network Classification with Convolutional Neural Network[J].Tsinghua Science and Technology,2020,25(4):447-457. 被引量：16
2罗会兰,张云.基于深度网络的图像语义分割综述[J].电子学报,2019,47(10):2211-2220. 被引量：32
3Qiang Hua,Liyou Chen,Pan Li,Shipeng Zhao,Yan Li.A Pixel–Channel Hybrid Attention Model for Image Processing[J].Tsinghua Science and Technology,2022,27(5):804-816. 被引量：1

引证文献2

1王博,陈颉颢,蒋红海.基于深度学习的烟草垄行分割模型[J].农业装备与车辆工程,2021,59(7):68-73.
2Wenjie Geng,Zhiqiang Cao,Peiyu Guan,Fengshui Jing,Min Tan,Junzhi Yu.Grasp Detection with Hierarchical Multi-Scale Feature Fusion and Inverted Shuffle Residual[J].Tsinghua Science and Technology,2024,29(1):244-256.

1林达华.为什么要深入数学的世界[J].高等数学研究,2018,21(6):64-64.
2EMILY CONRAD.CITY OF DREAMS[J].The World of Chinese,2018(6):68-70.
3曹诗颂,胡德勇,胡卓玮,赵文吉,陈姗姗,于琛.不透水地表盖度视角下中美城市群空间结构对比——以“京津冀”与“波士华”为例（英文）[J].Journal of Geographical Sciences,2018,28(3):306-322. 被引量：4
4李涛,邹静蓉,汤显平,张治强,雷润杰,喻雅琴.贝雷法在半刚性基层中的应用[J].公路,2018,63(12):19-23. 被引量：6
5Yue Wang,Jinlai Liu,Xiaojie Wang.Video Description with Integrated Visual and Textual Information[J].China Communications,2019,16(1):119-128. 被引量：1
6程雅慧,蔡烜,冯瑞.面向车辆检测的扩张全卷积神经网络[J].计算机系统应用,2019,28(1):107-112. 被引量：2
7充电功率高达10W的谷歌无线充来了[J].数字家庭,2018,0(10):92-92.
8Biao Zhao,Tianyu Yu,Wenfeng Ding,Xianying Li,Honghua Su.BP neural network based flexural strength prediction of open-porous Cu-SnTi composites[J].Progress in Natural Science:Materials International,2018,28(3):315-324. 被引量：5
9Chuanxin Tang,Ronggang Wang,Zhu Li,Wenmin Wang,Wen Gao.Frame interpolation with pixel-level motion vector field and mesh based hole filling[J].CAAI Transactions on Intelligence Technology,2016,1(1):72-78.

Tsinghua Science and Technology

2019年第2期

浏览历史

内容加载中请稍等...

CasNet:A Cascade Coarse-to-Fine Network for Semantic Segmentation 被引量：2

同被引文献3

引证文献2

相关作者

相关机构

相关主题

浏览历史