期刊文献+
共找到17篇文章
< 1 >
每页显示 20 50 100
Intelligent Express Delivery System Based on Video Processing
1
作者 Chuang Xu Yanqing Wang +1 位作者 Ruyu Sheng Wenjun Lu 《国际计算机前沿大会会议论文集》 2021年第1期370-379,共10页
Although the scale of the express industry is large, it is difficult toachieve the function of fully intelligent receiving and sending express. In thispaper, the intelligent express delivery system is proposed based o... Although the scale of the express industry is large, it is difficult toachieve the function of fully intelligent receiving and sending express. In thispaper, the intelligent express delivery system is proposed based on the imageand video processing technology of OpenCV, the Faster R-CNN object detectionalgorithm and other technologies. Through the depth camera and electronic scale,it can identify the object category, volume and weight of the items placed on thescale by the sender and store the video of the objects packed into the cabinet. Theoverall framework of the systemwas constructed;key technologies were applied torealize the system;the function of the system was tested. The experimental resultsshow that it achieves the intelligent automation of delivery and delivery throughthe integrated express delivery system of intelligent identification and informationtraceability, which promotes the development of express delivery industry. 展开更多
关键词 video processing Object recognition Intelligent express delivery system
原文传递
Deeply‐Recursive Attention Network for video steganography
2
作者 Jiabao Cui Liangli Zheng +4 位作者 Yunlong Yu Yining Lin Huajian Ni Xin Xu Zhongfei Zhang 《CAAI Transactions on Intelligence Technology》 SCIE EI 2023年第4期1507-1523,共17页
Video steganography plays an important role in secret communication that conceals a secret video in a cover video by perturbing the value of pixels in the cover frames.Imperceptibility is the first and foremost requir... Video steganography plays an important role in secret communication that conceals a secret video in a cover video by perturbing the value of pixels in the cover frames.Imperceptibility is the first and foremost requirement of any steganographic approach.Inspired by the fact that human eyes perceive pixel perturbation differently in different video areas,a novel effective and efficient Deeply‐Recursive Attention Network(DRANet)for video steganography to find suitable areas for information hiding via modelling spatio‐temporal attention is proposed.The DRANet mainly contains two important components,a Non‐Local Self‐Attention(NLSA)block and a Non‐Local Co‐Attention(NLCA)block.Specifically,the NLSA block can select the cover frame areas which are suitable for hiding by computing the correlations among inter‐and intra‐cover frames.The NLCA block aims to effectively produce the enhanced representations of the secret frames to enhance the robustness of the model and alleviate the influence of different areas in the secret video.Furthermore,the DRANet reduces the model parameters by performing similar operations on the different frames within an input video recursively.Experimental results show the proposed DRANet achieves better performance with fewer parameters than the state‐of‐the‐art competitors. 展开更多
关键词 data privacy video processing
下载PDF
Single-Phase Velocity Determination Based in Video and Sub-Images Processing:An Optical Flow Method Implemented with Support of a Programmed MatLab Structured Script
3
作者 Andreas Nascimento Edson Da Costa Bortoni +2 位作者 José Luiz Goncalves Pedro Antunes Duarte Mauro Hugo Mathias 《Journal of Software Engineering and Applications》 2015年第6期290-294,共5页
Important in many different sectors of the industry, the determination of stream velocity has become more and more important due to measurements precision necessity, in order to determine the right production rates, d... Important in many different sectors of the industry, the determination of stream velocity has become more and more important due to measurements precision necessity, in order to determine the right production rates, determine the volumetric production of undesired fluid, establish automated controls based on these measurements avoiding over-flooding or over-production, guaranteeing accurate predictive maintenance, etc. Difficulties being faced have been the determination of the velocity of specific fluids embedded in some others, for example, determining the gas bubbles stream velocity flowing throughout liquid fluid phase. Although different and already applicable methods have been researched and already implemented within the industry, a non-intrusive automated way of providing those stream velocities has its importance, and may have a huge impact in projects budget. Knowing the importance of its determination, this developed script uses a methodology of breaking-down real-time videos media into frame images, analyzing by pixel correlations possible superposition matches for further gas bubbles stream velocity estimation. In raw sense, the script bases itself in functions and procedures already available in MatLab, which can be used for image processing and treatments, allowing the methodology to be implemented. Its accuracy after the running test was of around 97% (ninety-seven percent);the raw source code with comments had almost 3000 (three thousand) characters;and the hardware placed for running the code was an Intel Core Duo 2.13 [Ghz] and 2 [Gb] RAM memory capable workstation. Even showing good results, it could be stated that just the end point correlations were actually getting to the final solution. So that, making use of self-learning functions or neural network, one could surely enhance the capability of the application to be run in real-time without getting exhaust by iterative loops. 展开更多
关键词 Optical Flow Single-Phase Velocity video and Image processing Sensing MatLab Script
下载PDF
Multiple Forgery Detection in Video Using Convolution Neural Network
4
作者 Vinay Kumar Vineet Kansal Manish Gaur 《Computers, Materials & Continua》 SCIE EI 2022年第10期1347-1364,共18页
With the growth of digital media data manipulation in today’s era due to the availability of readily handy tampering software,the authenticity of records is at high risk,especially in video.There is a dire need to de... With the growth of digital media data manipulation in today’s era due to the availability of readily handy tampering software,the authenticity of records is at high risk,especially in video.There is a dire need to detect such problem and do the necessary actions.In this work,we propose an approach to detect the interframe video forgery utilizing the deep features obtained from the parallel deep neural network model and thorough analytical computations.The proposed approach only uses the deep features extracted from the CNN model and then applies the conventional mathematical approach to these features to find the forgery in the video.This work calculates the correlation coefficient from the deep features of the adjacent frames rather than calculating directly from the frames.We divide the procedure of forgery detection into two phases–video forgery detection and video forgery classification.In video forgery detection,this approach detect input video is original or tampered.If the video is not original,then the video is checked in the next phase,which is video forgery classification.In the video forgery classification,method review the forged video for insertion forgery,deletion forgery,and also again check for originality.The proposed work is generalized and it is tested on two different datasets.The experimental results of our proposed model show that our approach can detect the forgery with the accuracy of 91%on VIFFD dataset,90%in TDTV dataset and classify the type of forgery–insertion and deletion with the accuracy of 82%on VIFFD dataset,86%on TDTV dataset.This work can helps in the analysis of original and tempered video in various domain. 展开更多
关键词 Digital forensic forgery detection video authentication video interframe forgery video processing deep learning
下载PDF
Video Analysis Based on Volumetric Event Detection
5
作者 Jing Wang Zhi-Jie Xu 《International Journal of Automation and computing》 EI 2010年第3期365-371,共7页
During the past decade, feature extraction and knowledge acquisition based on video analysis have been extensively researched and tested on many applications such as closed-circuit television (CCTV) data analysis, l... During the past decade, feature extraction and knowledge acquisition based on video analysis have been extensively researched and tested on many applications such as closed-circuit television (CCTV) data analysis, large-scale public event control, and other daily security monitoring and surveillance operations with various degrees of success. However, since the actual video process is a multi-phased one and encompasses extensive theories and techniques ranging from fundamental image processing, computational geometry and graphics, and machine vision, to advanced artificial intelligence, pattern analysis, and even cognitive science, there are still many important problems to resolve before it can be widely applied. Among them, video event identification and detection are two prominent ones. Comparing with the most popular frame-to-frame processing mode of most of today's approaches and systems, this project reorganizes video data as a 3D volume structure that provides the hybrid spatial and temporal information in a unified space. This paper reports an innovative technique to transform original video frames to 3D volume structures denoted by spatial and temporal features. It then highlights the volume array structure in a so-called "pre-suspicion" mechanism for a later process. The focus of this report is the development of an effective and efficient voxel-based segmentation technique suitable to the volumetric nature of video events and ready for deployment in 3D clustering operations. The paper is concluded with a performance evaluation of the devised technique and discussion on the future work for accelerating the pre-processing of the original video data. 展开更多
关键词 Spatio-temporal volume (STV) video processing volume feature extraction SEGMENTATION motion analysis.
下载PDF
Temporal Shape Error Concealment for Video Objects
6
作者 于烨 谢旭东 +2 位作者 陆建华 郑君里 陈长文 《Journal of Beijing Institute of Technology》 EI CAS 2008年第3期322-329,共8页
A novel temporal shape error concealment technique is proposed, which can he used in the context of object-based video coding schemes. In order to reduce the effect of the shape variations of a video object, the curva... A novel temporal shape error concealment technique is proposed, which can he used in the context of object-based video coding schemes. In order to reduce the effect of the shape variations of a video object, the curvature scale space (CSS) technique is adopted to extract features, and then these features are used for boundary matching between the current frame and the previous frame. Because the temporal, spatial and sta- tistical video contour information are all considered, the proposed method can find the optimal matching, which is used to replace the damaged contours. The simulation results show that the proposed algorithm achieves better subjective, objective qualities and higher efficiency than those previously developed methods. 展开更多
关键词 error concealment object-based image and video processing curvature scale space (CSS) shapedata
下载PDF
Visual-attention gabor filter based online multi-armored target tracking 被引量:1
7
作者 Fan-jie Meng Xin-qing Wang +3 位作者 Fa-ming Shao Dong Wang Yao-wei Yu Yi Xiao 《Defence Technology(防务技术)》 SCIE EI CAS CSCD 2021年第4期1249-1261,共13页
The multi-armored target tracking(MATT)plays a crucial role in coordinated tracking and strike.The occlusion and insertion among targets and target scale variation is the key problems in MATT.Most stateof-the-art mult... The multi-armored target tracking(MATT)plays a crucial role in coordinated tracking and strike.The occlusion and insertion among targets and target scale variation is the key problems in MATT.Most stateof-the-art multi-object tracking(MOT)works adopt the tracking-by-detection strategy,which rely on compute-intensive sliding window or anchoring scheme in detection module and neglect the target scale variation in tracking module.In this work,we proposed a more efficient and effective spatial-temporal attention scheme to track multi-armored target in the ground battlefield.By simulating the structure of the retina,a novel visual-attention Gabor filter branch is proposed to enhance detection.By introducing temporal information,some online learned target-specific Convolutional Neural Networks(CNNs)are adopted to address occlusion.More importantly,we built a MOT dataset for armored targets,called Armored Target Tracking dataset(ATTD),based on which several comparable experiments with state-ofthe-art methods are conducted.Experimental results show that the proposed method achieves outstanding tracking performance and meets the actual application requirements. 展开更多
关键词 Multi-object tracking Deep learning Gabor filter Biological vision MILITARY Application video processing
下载PDF
Sparse Crowd Flow Analysis of Tawaaf of Kaaba During the COVID-19 Pandemic
8
作者 Durr-e-Nayab Ali Mustafa Qamar +4 位作者 Rehan Ullah Khan Waleed Albattah Khalil Khan Shabana Habib Muhammad Islam 《Computers, Materials & Continua》 SCIE EI 2022年第6期5581-5601,共21页
The advent of the COVID-19 pandemic has adversely affected the entire world and has put forth high demand for techniques that remotely manage crowd-related tasks.Video surveillance and crowd management using video ana... The advent of the COVID-19 pandemic has adversely affected the entire world and has put forth high demand for techniques that remotely manage crowd-related tasks.Video surveillance and crowd management using video analysis techniques have significantly impacted today’s research,and numerous applications have been developed in this domain.This research proposed an anomaly detection technique applied to Umrah videos in Kaaba during the COVID-19 pandemic through sparse crowd analysis.Managing theKaaba rituals is crucial since the crowd gathers from around the world and requires proper analysis during these days of the pandemic.The Umrah videos are analyzed,and a system is devised that can track and monitor the crowd flow in Kaaba.The crowd in these videos is sparse due to the pandemic,and we have developed a technique to track the maximum crowd flow and detect any object(person)moving in the direction unlikely of the major flow.We have detected abnormal movement by creating the histograms for the vertical and horizontal flows and applying thresholds to identify the non-majority flow.Our algorithm aims to analyze the crowd through video surveillance and timely detect any abnormal activity tomaintain a smooth crowd flowinKaaba during the pandemic. 展开更多
关键词 Computer vision COVID sparse crowd crowd analysis flow analysis sparse crowd management tawaaf video analysis video processing
下载PDF
Dynamic Reconfigurable Structure with Rate Distortion Optimization
9
作者 Lin Jiang Xueting Zhang +3 位作者 Rui Shan Xiaoyan Xie Xinchuang Liu Feilong He 《Journal of Harbin Institute of Technology(New Series)》 EI CAS 2020年第6期35-47,共13页
The Rate Distortion Optimization(RDO)algorithm in High Efficiency Video Coding(HEVC)has many iterations and a large number of calculations.In order to decrease the calculation time and meet the requirements of fast sw... The Rate Distortion Optimization(RDO)algorithm in High Efficiency Video Coding(HEVC)has many iterations and a large number of calculations.In order to decrease the calculation time and meet the requirements of fast switching of RDO algorithms of different scales,an RDO dynamic reconfigurable structure is proposed.First,the Quantization Parameter(QP)and bit rate values were loaded through an H⁃tree Configurable Network(HCN),and the execution status of the array was detected in real time.When the switching request of the RDO algorithm was detected,the corresponding configuration information was delivered.This self⁃reconfiguration implementation method improved the flexibility and utilization of hardware.Experimental results show that when the control bit width was only increased by 31.25%,the designed configuration network could increase the number of controllable processing units by 32 times,and the execution cycle was 50%lower than the same type of design.Compared with previous RDO algorithm,the RDO algorithm implemented on the reconfigurable array based on the configuration network had an average operating frequency increase of 12.5%and an area reduction of 56.4%. 展开更多
关键词 dynamic reconfiguration rate distortion optimization Huffman⁃coding⁃like context switch video processing
下载PDF
Architectural Model of a Biological Retina Using Cellular Automata
10
作者 Francois Devillard Bernard Heit 《Journal of Computer and Communications》 2014年第14期78-97,共20页
Developments in neurophysiology focusing on foveal vision have characterized more and more precisely the spatiotemporal processing that is well adapted to the regularization of the visual information within the retina... Developments in neurophysiology focusing on foveal vision have characterized more and more precisely the spatiotemporal processing that is well adapted to the regularization of the visual information within the retina. The works described in this article focus on a simplified architectural model based on features and mechanisms of adaptation in the retina. Similarly to the biological retina, which transforms luminance information into a series of encoded representations of image characteristics transmitted to the brain, our structural model allows us to reveal more information in the scene. Our modeling of the different functional pathways permits the mapping of important complementary information types at abstract levels of image analysis, and thereby allows a better exploitation of visual clues. Our model is based on a distributed cellular automata network and simulates the retinal processing of stimuli that are stationary or in motion. Thanks to its capacity for dynamic adaptation, our model can adapt itself to different scenes (e.g., bright and dim, stationary and moving, etc.) and can parallelize those processing steps that can be supported by parallel calculators. 展开更多
关键词 Computer Vision Cellular Automata Retina Modeling video processing
下载PDF
Static Scene Illumination Estimation from Videos with Applications 被引量:6
11
作者 Bin Liu Nun Xu Ralph R. Martin 《Journal of Computer Science & Technology》 SCIE EI CSCD 2017年第3期430-442,共13页
We present a system that automatically recovers scene geometry and illumination from a video, providing a basis for various applications. Previous image based illumination estimation methods require either user intera... We present a system that automatically recovers scene geometry and illumination from a video, providing a basis for various applications. Previous image based illumination estimation methods require either user interaction or external information in the form of a database. We adopt structure-from-motion and multi-view stereo for initial scene reconstruction, and then estimate an environment map represented by spherical harmonics (as these perform better than other bases). We also demonstrate several video editing applications that exploit the recovered geometry and illumination, including object insertion (e.g., for augmented reality), shadow detection, and video relighting. 展开更多
关键词 video processing augmented reality illumination recovery
原文传递
Salt and pepper noise removal in surveillance video based on low-rank matrix recovery 被引量:1
12
作者 Yongxia Zhang Yi Liu +1 位作者 Xuemei Li Caiming Zhang 《Computational Visual Media》 2015年第1期59-68,共10页
This paper proposes a new algorithm based on low-rank matrix recovery to remove salt &pepper noise from surveillance video. Unlike single image denoising techniques, noise removal from video sequences aims to util... This paper proposes a new algorithm based on low-rank matrix recovery to remove salt &pepper noise from surveillance video. Unlike single image denoising techniques, noise removal from video sequences aims to utilize both temporal and spatial information. By grouping neighboring frames based on similarities of the whole images in the temporal domain, we formulate the problem of removing salt &pepper noise from a video tracking sequence as a lowrank matrix recovery problem. The resulting nuclear norm and L1-norm related minimization problems can be efficiently solved by many recently developed methods. To determine the low-rank matrix, we use an averaging method based on other similar images. Our method can not only remove noise but also preserve edges and details. The performance of our proposed approach compares favorably to that of existing algorithms and gives better PSNR and SSIM results. 展开更多
关键词 multimedia computing noise cancellation signal denoising sparse matrices video signal processing video surveillance
原文传递
基于分层变块大小运动估计的边信息提取算法(英文) 被引量:4
13
作者 刘荣科 岳志 陈长汶 《Chinese Journal of Aeronautics》 SCIE EI CAS CSCD 2009年第2期167-173,共7页
The side information quality has an immense effect on the compression efficiency of the distributed video coding (DVC) sys- tem. This article, based on the hierarchical motion estimation (HME), proposes a new side inf... The side information quality has an immense effect on the compression efficiency of the distributed video coding (DVC) sys- tem. This article, based on the hierarchical motion estimation (HME), proposes a new side information generation algorithm which is integrated into DVC system. First, forward motion estimation (FME) and bidirectional motion estimation (BME) on the basis of variable block size HME algorithm are used to acquire relatively accurate motion vectors. Second, a motion vector filter (MVF) is i... 展开更多
关键词 communication technology video signal processing hierarchical motion estimation side information motion vector filter frame interpolation
原文传递
De-interlacing technique based on total variation with spatial-temporal smoothness constraint
14
作者 YIN XueMin1,2,3, YUAN JianHua1,2, LU XiaoPeng1,2 & ZOU MouYan1,2 1 Institute of Electronics, Chinese Academy of Sciences, Beijing 100080, China 2 Graduate School, Chinese Academy of Sciences, Beijing 100039, China 3 Jiuquan Satellite Launch Center, Lanzhou 732750, China 《Science in China(Series F)》 2007年第4期561-575,共15页
This paper introduces a new method of converting interlaced video to a progressively scanned video and image, The new method is derived from Bayesian framework with the spatial-temporal smoothness constraint and the M... This paper introduces a new method of converting interlaced video to a progressively scanned video and image, The new method is derived from Bayesian framework with the spatial-temporal smoothness constraint and the MAP is done by minimizing the energy functional, The half-quadratic regularization method is used to solve the corresponding partial differential equations (PDEs), This approach gives the improved results over the conventional de-interlacing methods, Two criteria are proposed in the paper, and they can be used to evaluate the performance of the de-interlacing algorithms, 展开更多
关键词 video processing DE-INTERLACING total variation spatio-temporai smoothness constraint PDES half-quadratic regularization
原文传递
An image-tracking algorithm based on object center distance-weighting and image feature recognition
15
作者 JIANG Shuhong WANG Qin +1 位作者 ZHANG Jianqiu HU Bo 《Frontiers of Electrical and Electronic Engineering in China》 CSCD 2007年第1期1-7,共7页
Areal-time image-tracking algorithm is proposed,which gives small weights to pixels farther from the object center and uses the quantized image gray scales as a template.It identifies the target’s location by the mea... Areal-time image-tracking algorithm is proposed,which gives small weights to pixels farther from the object center and uses the quantized image gray scales as a template.It identifies the target’s location by the mean-shift iteration method and arrives at the target’s scale by using image feature recognition.It improves the kernel-based algorithm in tracking scale-changing targets.A decimation method is proposed to track large-sized targets and real-time experimental results verify the effectiveness of the proposed algorithm. 展开更多
关键词 object tracking video processing mean-shift algorithm feature recognition
原文传递
Study of frame-rate up conversion based on H.264 被引量:1
16
作者 GAN Zong-liang ZHU Xiu-chang 《The Journal of China Universities of Posts and Telecommunications》 EI CSCD 2007年第1期106-110,共5页
In this study, a low complexity frame-rate up conversion method using compressed domain information for H.264 decoder is proposed. In the proposed scheme, the motion vectors (MVs) are estimated using constant accele... In this study, a low complexity frame-rate up conversion method using compressed domain information for H.264 decoder is proposed. In the proposed scheme, the motion vectors (MVs) are estimated using constant acceleration motion model, and the MVs regarded as no credibility are corrected, and the interpolation method is applied on the basis of the macroblock (MB) coded types. Applied to the H.264 decoder, the proposed method provides high quality interpolation frames and an obvious decrease of the block artifacts. 展开更多
关键词 video post processing H.264 motion compensated frame interpolation (MCFI) frame-rate up conversion (FRUC)
原文传递
An improved partial SPIHT with classified weighted rate-distortion optimization for interferential multispectral image compression
17
作者 王柯俨 吴成柯 +1 位作者 孔繁锵 张磊 《Chinese Optics Letters》 SCIE EI CAS CSCD 2008年第5期331-333,共3页
Based on the property analysis of interferential multispectral images, a novel compression algorithm of partial set partitioning in hierarchical trees (SPIHT) with classified weighted rate-distortion optimization is... Based on the property analysis of interferential multispectral images, a novel compression algorithm of partial set partitioning in hierarchical trees (SPIHT) with classified weighted rate-distortion optimization is presented. After wavelet decomposition, partial SPIHT is applied to each zero tree independently by adaptively selecting one of three coding modes according to the probability of the significant coefficients in each bitplane. Meanwhile the interferential multispectral image is partitioned into two kinds of regions in terms of luminous intensity, and the rate-distortion slopes of zero trees are then lifted with classified weights according to their distortion contribution to the constructed spectrum. Finally a global rate- distortion optimization truncation is performed. Compared with the conventional methods, the proposed algorithm not only improves the performance in spatial domain but also reduces the distortion in spectral domain. 展开更多
关键词 Boolean functions Data compression Electric distortion Image coding Image compression Motion estimation OPTIMIZATION Programming theory Risk assessment Signal distortion video signal processing Wavelet decomposition Wavelet transforms
原文传递
上一页 1 下一页 到第
使用帮助 返回顶部