Multi-frame coding is supported by the emerging H.264. It is important for the enhancement of both coding efficiency and error robustness. In this paper, error resilient schemes for H.264 based on multi-frame were inv...Multi-frame coding is supported by the emerging H.264. It is important for the enhancement of both coding efficiency and error robustness. In this paper, error resilient schemes for H.264 based on multi-frame were investigated. Error robust H.264 video transmission schemes were introduced for the applications with and without a feedback channel. The experimental results demonstrate the effectiveness of the proposed schemes.展开更多
This paper focuses on the sea-surface weak target detection based on memory-fully convolutional network(M-FCN)in strong sea clutter.Firstly,the constant false alarm rate(CFAR)detection method utilizes a low threshold ...This paper focuses on the sea-surface weak target detection based on memory-fully convolutional network(M-FCN)in strong sea clutter.Firstly,the constant false alarm rate(CFAR)detection method utilizes a low threshold with high probability of false alarm to detect sea-surface weak targets after non-coherent integration.Reducing the detection threshold can generate a large number of false alarms while increasing the detection rate,and how to suppress a large number of false alarms is the key to improve the performance of weak target detection.Then,the detection result of the low threshold is operated to construct the target matrix suitable for the size of fully convolutional networks and the convolution operator form.Finally,the M-FCN architecture is designed to learn the different accumulation characteristics of the target and the sea clutter between different frames.For improving the detection performance,the historical multi-frame information is memorized by the network,and the end-to-end structure is established to detect sea-surface weak target automatically.Experimental results on measured data demonstrate that the M-FCN method outperforms the traditional track before detection(TBD)method and reduces false alarm tracks by 35.1%,which greatly improves the track quality.展开更多
Point cloud compression is critical to deploy 3D representation of the physical world such as 3D immersive telepresence,autonomous driving,and cultural heritage preservation.However,point cloud data are distributed ir...Point cloud compression is critical to deploy 3D representation of the physical world such as 3D immersive telepresence,autonomous driving,and cultural heritage preservation.However,point cloud data are distributed irregularly and discontinuously in spatial and temporal domains,where redundant unoccupied voxels and weak correlations in 3D space make achieving efficient compression a challenging problem.In this paper,we propose a spatio-temporal context-guided algorithm for lossless point cloud geometry compression.The proposed scheme starts with dividing the point cloud into sliced layers of unit thickness along the longest axis.Then,it introduces a prediction method where both intraframe and inter-frame point clouds are available,by determining correspondences between adjacent layers and estimating the shortest path using the travelling salesman algorithm.Finally,the few prediction residual is efficiently compressed with optimal context-guided and adaptive fastmode arithmetic coding techniques.Experiments prove that the proposed method can effectively achieve low bit rate lossless compression of point cloud geometric information,and is suitable for 3D point cloud compression applicable to various types of scenes.展开更多
Security surveillance of public scene is closely relevant to routine safety of individual.Under the stimulus of this concern,abnormal event detection is becoming one of the most important tasks in computer vision and ...Security surveillance of public scene is closely relevant to routine safety of individual.Under the stimulus of this concern,abnormal event detection is becoming one of the most important tasks in computer vision and video processing.In this paper,we propose a new algorithm to address the visual abnormal detection problem.Our algorithm decouples the problem into a feature descriptor extraction process,followed by an AutoEncoder based network called cascade deep AutoEncoder(CDA).The movement information is represented by a novel descriptor capturing the multi-frame optical flow information.And then,the feature descriptor of the normal samples is fed into the CDA network for training.Finally,the abnormal samples are distinguished by the reconstruction error of the CDA in the testing procedure.We validate the proposed method on several video surveillance datasets.展开更多
In this article,we present the realisation of a multi-frame and multi-dimensional WebGIS that allows users to simultaneously analyse a specific portion of the Earth taking into account the historical information,too.T...In this article,we present the realisation of a multi-frame and multi-dimensional WebGIS that allows users to simultaneously analyse a specific portion of the Earth taking into account the historical information,too.Two graphical panels have been realised:one for the usual 2D view and one for a more realistic 3D view.Both panels display historical maps of the city,the current orthophoto and the digital topographical map.The 3D frame is based on NASA World Wind,an open source virtual globe from where 3D buildings are shown extruding the 2D shapes using their mean height.Thanks to a specifically designed graphical user interface,it is also possible to dynamically thematise the buildings on the globe according to different criteria(e.g.the construction time span)so that only the geometries fulfilling the request are turned on.Within the proposed application,a synchronisation between the two panels has been implemented,in order to maintain a constant alignment of the two viewers.The application is also open to the time dimension.In fact,assigning to each geometry two dates(e.g.‘year of construction’and‘year of demolition’),it is possible to dynamically view how buildings have changed over time,both in their shape and height.Future developments of this work will concern the possibility of implementing a city model with a higher level of detail.展开更多
In this paper,a novel multi-frame track-before-detect algorithm is proposed,which is based on root label clustering to reduce the high computational complexity arising by observation area expansion and clutter/noise d...In this paper,a novel multi-frame track-before-detect algorithm is proposed,which is based on root label clustering to reduce the high computational complexity arising by observation area expansion and clutter/noise density increase.A criterion of track extrapolation is used to construct state transition set,root label is marked by state transition set to obtain the distribution information of multiple targets in measurement space,then measurement plots of multi-frame are divided into several clusters,and finally multi-frame track-before-detect algorithm is implemented in each cluster.The computational complexity can be reduced by employing the proposed algorithm.Simulation results show that the proposed algorithm can accurately detect multiple targets in close proximity and reduce the number of false tracks.展开更多
Due to factors such as motion blur,video out-of-focus,and occlusion,multi-frame human pose estimation is a challenging task.Exploiting temporal consistency between consecutive frames is an efficient approach for addre...Due to factors such as motion blur,video out-of-focus,and occlusion,multi-frame human pose estimation is a challenging task.Exploiting temporal consistency between consecutive frames is an efficient approach for addressing this issue.Currently,most methods explore temporal consistency through refinements of the final heatmaps.The heatmaps contain the semantics information of key points,and can improve the detection quality to a certain extent.However,they are generated by features,and feature-level refinements are rarely considered.In this paper,we propose a human pose estimation framework with refinements at the feature and semantics levels.We align auxiliary features with the features of the current frame to reduce the loss caused by different feature distributions.An attention mechanism is then used to fuse auxiliary features with current features.In terms of semantics,we use the difference information between adjacent heatmaps as auxiliary features to refine the current heatmaps.The method is validated on the large-scale benchmark datasets PoseTrack2017 and PoseTrack2018,and the results demonstrate the effectiveness of our method.展开更多
文摘Multi-frame coding is supported by the emerging H.264. It is important for the enhancement of both coding efficiency and error robustness. In this paper, error resilient schemes for H.264 based on multi-frame were investigated. Error robust H.264 video transmission schemes were introduced for the applications with and without a feedback channel. The experimental results demonstrate the effectiveness of the proposed schemes.
基金This was work supported by the National Natural Science Foundation of China(U19B2031).
文摘This paper focuses on the sea-surface weak target detection based on memory-fully convolutional network(M-FCN)in strong sea clutter.Firstly,the constant false alarm rate(CFAR)detection method utilizes a low threshold with high probability of false alarm to detect sea-surface weak targets after non-coherent integration.Reducing the detection threshold can generate a large number of false alarms while increasing the detection rate,and how to suppress a large number of false alarms is the key to improve the performance of weak target detection.Then,the detection result of the low threshold is operated to construct the target matrix suitable for the size of fully convolutional networks and the convolution operator form.Finally,the M-FCN architecture is designed to learn the different accumulation characteristics of the target and the sea clutter between different frames.For improving the detection performance,the historical multi-frame information is memorized by the network,and the end-to-end structure is established to detect sea-surface weak target automatically.Experimental results on measured data demonstrate that the M-FCN method outperforms the traditional track before detection(TBD)method and reduces false alarm tracks by 35.1%,which greatly improves the track quality.
文摘Point cloud compression is critical to deploy 3D representation of the physical world such as 3D immersive telepresence,autonomous driving,and cultural heritage preservation.However,point cloud data are distributed irregularly and discontinuously in spatial and temporal domains,where redundant unoccupied voxels and weak correlations in 3D space make achieving efficient compression a challenging problem.In this paper,we propose a spatio-temporal context-guided algorithm for lossless point cloud geometry compression.The proposed scheme starts with dividing the point cloud into sliced layers of unit thickness along the longest axis.Then,it introduces a prediction method where both intraframe and inter-frame point clouds are available,by determining correspondences between adjacent layers and estimating the shortest path using the travelling salesman algorithm.Finally,the few prediction residual is efficiently compressed with optimal context-guided and adaptive fastmode arithmetic coding techniques.Experiments prove that the proposed method can effectively achieve low bit rate lossless compression of point cloud geometric information,and is suitable for 3D point cloud compression applicable to various types of scenes.
基金the National Key R&D Program of China(2016YFE0204200)the National Natural Science Foundation of China(Grant Nos.61503017,U1435220)+2 种基金the Fundamental Research Funds for the Central Universities(YWF-14-RSC-102)the Aeronautical Science Foundation of China(2016ZC51022)the ANR AutoFerm project,the Platform CAPSEC funded by Region Champagne-Ardenne and FEDER.
文摘Security surveillance of public scene is closely relevant to routine safety of individual.Under the stimulus of this concern,abnormal event detection is becoming one of the most important tasks in computer vision and video processing.In this paper,we propose a new algorithm to address the visual abnormal detection problem.Our algorithm decouples the problem into a feature descriptor extraction process,followed by an AutoEncoder based network called cascade deep AutoEncoder(CDA).The movement information is represented by a novel descriptor capturing the multi-frame optical flow information.And then,the feature descriptor of the normal samples is fed into the CDA network for training.Finally,the abnormal samples are distinguished by the reconstruction error of the CDA in the testing procedure.We validate the proposed method on several video surveillance datasets.
文摘In this article,we present the realisation of a multi-frame and multi-dimensional WebGIS that allows users to simultaneously analyse a specific portion of the Earth taking into account the historical information,too.Two graphical panels have been realised:one for the usual 2D view and one for a more realistic 3D view.Both panels display historical maps of the city,the current orthophoto and the digital topographical map.The 3D frame is based on NASA World Wind,an open source virtual globe from where 3D buildings are shown extruding the 2D shapes using their mean height.Thanks to a specifically designed graphical user interface,it is also possible to dynamically thematise the buildings on the globe according to different criteria(e.g.the construction time span)so that only the geometries fulfilling the request are turned on.Within the proposed application,a synchronisation between the two panels has been implemented,in order to maintain a constant alignment of the two viewers.The application is also open to the time dimension.In fact,assigning to each geometry two dates(e.g.‘year of construction’and‘year of demolition’),it is possible to dynamically view how buildings have changed over time,both in their shape and height.Future developments of this work will concern the possibility of implementing a city model with a higher level of detail.
基金supported by the Innovation Project of Science and Technology Commission of the Central Military Commission,China(No.19-HXXX-01-ZD-006-XXX-XX)。
文摘In this paper,a novel multi-frame track-before-detect algorithm is proposed,which is based on root label clustering to reduce the high computational complexity arising by observation area expansion and clutter/noise density increase.A criterion of track extrapolation is used to construct state transition set,root label is marked by state transition set to obtain the distribution information of multiple targets in measurement space,then measurement plots of multi-frame are divided into several clusters,and finally multi-frame track-before-detect algorithm is implemented in each cluster.The computational complexity can be reduced by employing the proposed algorithm.Simulation results show that the proposed algorithm can accurately detect multiple targets in close proximity and reduce the number of false tracks.
基金supported by the National Key Research and Development Program of China(Nos.2021YFC2009200 and 2023YFC3606100)the Special Project of Technological Innovation and Application Development of Chongqing,China(No.cstc2019jscx-msxmX0167)。
文摘Due to factors such as motion blur,video out-of-focus,and occlusion,multi-frame human pose estimation is a challenging task.Exploiting temporal consistency between consecutive frames is an efficient approach for addressing this issue.Currently,most methods explore temporal consistency through refinements of the final heatmaps.The heatmaps contain the semantics information of key points,and can improve the detection quality to a certain extent.However,they are generated by features,and feature-level refinements are rarely considered.In this paper,we propose a human pose estimation framework with refinements at the feature and semantics levels.We align auxiliary features with the features of the current frame to reduce the loss caused by different feature distributions.An attention mechanism is then used to fuse auxiliary features with current features.In terms of semantics,we use the difference information between adjacent heatmaps as auxiliary features to refine the current heatmaps.The method is validated on the large-scale benchmark datasets PoseTrack2017 and PoseTrack2018,and the results demonstrate the effectiveness of our method.