As video compression is one of the core technologies required to enable seamless medical data streaming in mobile healthcare applications,there is a need to develop powerful media codecs that can achieve minimum bitra...As video compression is one of the core technologies required to enable seamless medical data streaming in mobile healthcare applications,there is a need to develop powerful media codecs that can achieve minimum bitrates while maintaining high perceptual quality.Versatile Video Coding(VVC)is the latest video coding standard that can provide powerful coding performance with a similar visual quality compared to the previously developed method that is High Efficiency Video Coding(HEVC).In order to achieve this improved coding performance,VVC adopted various advanced coding tools,such as flexible Multi-type Tree(MTT)block structure which uses Binary Tree(BT)split and Ternary Tree(TT)split.However,VVC encoder requires heavy computational complexity due to the excessive Ratedistortion Optimization(RDO)processes used to determine the optimalMTT block mode.In this paper,we propose a fast MTT decision method with two Lightweight Neural Networks(LNNs)using Multi-layer Perceptron(MLP),which are applied to determine the early termination of the TT split within the encoding process.Experimental results show that the proposed method significantly reduced the encoding complexity up to 26%with unnoticeable coding loss compared to the VVC TestModel(VTM).展开更多
Medical image compression is one of the essential technologies to facilitate real-time medical data transmission in remote healthcare applications.In general,image compression can introduce undesired coding artifacts,...Medical image compression is one of the essential technologies to facilitate real-time medical data transmission in remote healthcare applications.In general,image compression can introduce undesired coding artifacts,such as blocking artifacts and ringing effects.In this paper,we proposed a Multi-Scale Feature Attention Network(MSFAN)with two essential parts,which are multi-scale feature extraction layers and feature attention layers to efficiently remove coding artifacts of compressed medical images.Multiscale feature extraction layers have four Feature Extraction(FE)blocks.Each FE block consists of five convolution layers and one CA block for weighted skip connection.In order to optimize the proposed network architectures,a variety of verification tests were conducted using validation dataset.We used Computer Vision Center-Clinic Database(CVC-ClinicDB)consisting of 612 colonoscopy medical images to evaluate the enhancement of image restoration.The proposedMSFAN can achieve improved PSNR gains as high as 0.25 and 0.24 dB on average compared to DnCNNand DCSC,respectively.展开更多
基金This work was supported by Institute for Information&communications Technology Planning&Evaluation(IITP)grant funded by the Korea government(MSIT)(No.2017-0-00072,Development of Audio/Video Coding and Light Field Media Fundamental Technologies for Ultra Realistic Tera-media)。
文摘As video compression is one of the core technologies required to enable seamless medical data streaming in mobile healthcare applications,there is a need to develop powerful media codecs that can achieve minimum bitrates while maintaining high perceptual quality.Versatile Video Coding(VVC)is the latest video coding standard that can provide powerful coding performance with a similar visual quality compared to the previously developed method that is High Efficiency Video Coding(HEVC).In order to achieve this improved coding performance,VVC adopted various advanced coding tools,such as flexible Multi-type Tree(MTT)block structure which uses Binary Tree(BT)split and Ternary Tree(TT)split.However,VVC encoder requires heavy computational complexity due to the excessive Ratedistortion Optimization(RDO)processes used to determine the optimalMTT block mode.In this paper,we propose a fast MTT decision method with two Lightweight Neural Networks(LNNs)using Multi-layer Perceptron(MLP),which are applied to determine the early termination of the TT split within the encoding process.Experimental results show that the proposed method significantly reduced the encoding complexity up to 26%with unnoticeable coding loss compared to the VVC TestModel(VTM).
基金This work was supported by Kyungnam University Foundation Grant,2020.
文摘Medical image compression is one of the essential technologies to facilitate real-time medical data transmission in remote healthcare applications.In general,image compression can introduce undesired coding artifacts,such as blocking artifacts and ringing effects.In this paper,we proposed a Multi-Scale Feature Attention Network(MSFAN)with two essential parts,which are multi-scale feature extraction layers and feature attention layers to efficiently remove coding artifacts of compressed medical images.Multiscale feature extraction layers have four Feature Extraction(FE)blocks.Each FE block consists of five convolution layers and one CA block for weighted skip connection.In order to optimize the proposed network architectures,a variety of verification tests were conducted using validation dataset.We used Computer Vision Center-Clinic Database(CVC-ClinicDB)consisting of 612 colonoscopy medical images to evaluate the enhancement of image restoration.The proposedMSFAN can achieve improved PSNR gains as high as 0.25 and 0.24 dB on average compared to DnCNNand DCSC,respectively.