利用显著性检测的机器视觉视频编码算法

A Video Coding Algorithm for Machine Based on Saliency Detection

下载PDF

导出

摘要近年来,面向机器视觉视频的研究和应用越来越广泛,这对此类视频的存储和传输都提出了巨大的挑战。视频编码标准如多功能视频编码(Versatile Video Coding,VVC)能实现高效的全分辨率压缩与重建,但是对机器视觉任务而言,这种压缩方法是有冗余的。因此,提出了一种在VVC编码过程中结合显著性检测的视频编码方法用于机器任务,用实例分割网络掩膜基于区域的卷积神经网络(Mask Region-based Convolutional Neural Network,Mask R-CNN)获得包含对象的二进制掩膜,并依此判定是否为感兴趣区域,指导VVC编码过程中编码树单元(Coding Tree Unit,CTU)的量化参数的偏移。实验证明,与VVC基线方法相比,所提方法可以在相似的检测精度下节省一定的比特率。 In recent years,video coding for machine is increasingly studied and applied in a wide range,which poses great challenges to both storage and transmission of such videos.Video coding standards such as VVC(Versatile Video Coding)enable efficient full resolution compression and reconstruction,but this compression method is redundant for machine vision tasks.Therefore,this paper proposes a video coding method that combines saliency detection in VVC coding for machine tasks,which uses the instance segmentation network Mask R-CNN(Mask Region-based Convolutional Neural Network)to obtain a binary mask containing the object,and uses this to determine whether it is a region of interest or not,thus guides the offset of the quantization parameters of the CTU(Coding Tree Unit)in the VVC coding process.Experimental results demonstrate that the proposed method can save a certain bit rate with similar detection accuracy compared to the VVC baseline method.

作者李鸿耀何小海陈洪刚魏海涛熊淑华 LI Hongyao;HE Xiaohai;CHEN Honggang;WEI Haitao;XIONG Shuhua(College of Electronic Information,Sichuan University,Chengdu Sichuan 610065,China)

机构地区四川大学电子信息学院

出处《通信技术》 2024年第5期436-443,共8页 Communications Technology

基金国家自然科学基金(62271336,62211530110) TCL科技创新基金四川省科技项目基金(24GJHZ0381)。

关键词机器视觉编码 VVC/H.266 显著性检测 Mask R-CNN video coding for machine VVC/H.266 saliency detection Mask R-CNN

分类号 TP391 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献1

1李裕林,谢本亮.一种基于视觉显著性的码率控制算法[J].通信技术,2024,57(3):244-250. 被引量：1

二级参考文献2

1林晓,刘祖祥,郑晓妹,黄继风,马利庄.基于凸包改进的流行排序显著性检测[J].计算机辅助设计与图形学学报,2019,31(5):761-770. 被引量：10
2Wei-Ping Ma,Wen-Xin Li,Jin-Chuan Sun,Peng-Xia Cao.Saliency Detection via Manifold Ranking Based on Robust Foreground[J].International Journal of Automation and computing,2021,18(1):73-84. 被引量：1

1汪泳,蒋涪陵.讲述信息故事:信息可视化教学中的叙事实验[J].装饰,2024(1):136-138.
2陈杰,林琪,张璐,阮文斗.张量结构和活动值的HEVC-SCC帧内快速算法[J].武夷学院学报,2024,43(3):31-38.
3王田.标准/草案在视频编码专利申请的检索中的应用[J].中文科技期刊数据库（全文版）工程技术,2016(9):334-334.
4廖俊东.HEVC中基于四叉树结构的分块模式编码算法研究[J].信息与电脑,2024,36(5):34-36.
5公衍超,王子琳,杨楷芳,刘颖,林庆帆,王富平.色度域亮度域信息融合的监控视频重压缩取证[J].哈尔滨工业大学学报,2024,56(5):46-55.
6钟煜城,黄晓峰,牛伟宏,崔燕.基于统计分析的仿射运动估计快速算法[J].计算机科学,2024,51(S01):474-481.
7Bin Liu,Jianfei Li,Xue Yang,Feng Chen,Yanyan Zhang,Hongjun Li.Diagnosis of primary clear cell carcinoma of the liver based on Faster region-based convolutional neural network[J].Chinese Medical Journal,2023,136(22):2706-2711.
8金雪松,王田田.面向360°全景视频的帧内预测编码的快速算法[J].无线电工程,2024,54(5):1074-1082.
9Tajinder Kumar,Purushottam Sharma,Jaswinder Tanwar,Hisham Alsghier,Shashi Bhushan,Hesham Alhumyani,Vivek Sharma,Ahmed I.Alutaibi.Cloud‐based video streaming services:Trends,challenges,and opportunities[J].CAAI Transactions on Intelligence Technology,2024,9(2):265-285.
10程宝平,陶晓明,黄敏峰,谢小燕,杜金,杨栩.基于时域依赖的编码树单元级零延时码率控制算法[J].计算机应用研究,2024,41(5):1489-1495.

通信技术

2024年第5期

浏览历史

内容加载中请稍等...

利用显著性检测的机器视觉视频编码算法

参考文献1

二级参考文献2

相关作者

相关机构

相关主题

浏览历史