期刊文献+
共找到10篇文章
< 1 >
每页显示 20 50 100
Depth-Guided Vision Transformer With Normalizing Flows for Monocular 3D Object Detection
1
作者 Cong Pan Junran Peng zhaoxiang zhang 《IEEE/CAA Journal of Automatica Sinica》 SCIE EI CSCD 2024年第3期673-689,共17页
Monocular 3D object detection is challenging due to the lack of accurate depth information.Some methods estimate the pixel-wise depth maps from off-the-shelf depth estimators and then use them as an additional input t... Monocular 3D object detection is challenging due to the lack of accurate depth information.Some methods estimate the pixel-wise depth maps from off-the-shelf depth estimators and then use them as an additional input to augment the RGB images.Depth-based methods attempt to convert estimated depth maps to pseudo-LiDAR and then use LiDAR-based object detectors or focus on the perspective of image and depth fusion learning.However,they demonstrate limited performance and efficiency as a result of depth inaccuracy and complex fusion mode with convolutions.Different from these approaches,our proposed depth-guided vision transformer with a normalizing flows(NF-DVT)network uses normalizing flows to build priors in depth maps to achieve more accurate depth information.Then we develop a novel Swin-Transformer-based backbone with a fusion module to process RGB image patches and depth map patches with two separate branches and fuse them using cross-attention to exchange information with each other.Furthermore,with the help of pixel-wise relative depth values in depth maps,we develop new relative position embeddings in the cross-attention mechanism to capture more accurate sequence ordering of input tokens.Our method is the first Swin-Transformer-based backbone architecture for monocular 3D object detection.The experimental results on the KITTI and the challenging Waymo Open datasets show the effectiveness of our proposed method and superior performance over previous counterparts. 展开更多
关键词 Monocular 3D object detection normalizing flows Swin Transformer
下载PDF
Empirical correction of kinetic model for polymer thermal reaction process based on first order reaction kinetics 被引量:2
2
作者 zhaoxiang zhang Fei Guo +2 位作者 Wei Song Xiaohong Jia Yuming Wang 《Chinese Journal of Chemical Engineering》 SCIE EI CAS CSCD 2021年第10期132-144,共13页
Based on the theory of first-order reaction kinetics,a thermal reaction kinetic model in integral form has been derive.To make the model more applicable,the effects of time and the conversion degree on the reaction ra... Based on the theory of first-order reaction kinetics,a thermal reaction kinetic model in integral form has been derive.To make the model more applicable,the effects of time and the conversion degree on the reaction rate parameters were considered.Two types of undetermined functions were used to compensate for the intrinsic variation of the reaction rate,and two types of correction methods are provided.The model was explained and verified using published experimental data of different polymer thermal reaction systems,and its effectiveness and wide adaptability were confirmed.For the given kinetic model,only one parameter needs to be determined.The proposed empirical model is expected to be used in the numerical simulation of polymer thermal reaction process. 展开更多
关键词 Thermal reaction Polymer processing Reaction kinetics Mathematical modeling Empirical correction
下载PDF
Research Progress on Metastatic Carcinoma of the Spleen
3
作者 zhaoxiang zhang 《Chinese Journal of Clinical Oncology》 CSCD 2006年第2期142-147,共6页
Metastatic carcinoma of the spleen (MCS) is a rare condition which is frequency misdiagnosed. Research progress on the prevalence, clinicopathological features and diagnosis of MCS from the Chinese and English medical... Metastatic carcinoma of the spleen (MCS) is a rare condition which is frequency misdiagnosed. Research progress on the prevalence, clinicopathological features and diagnosis of MCS from the Chinese and English medical literature was reviewed to increase understanding of all aspects related to MCS. It is hoped that a better comprehension of MCS will increase the diagnotic level and the rate of MCS detection. 展开更多
关键词 NEOPLASM SPLEEN tumor metastasis PATHOLOGY clinical diagnosis.
下载PDF
GRAMO:geometric resampling augmentation for monocular 3D object detection
4
作者 He GUAN Chunfeng SONG zhaoxiang zhang 《Frontiers of Computer Science》 SCIE EI CSCD 2024年第5期161-169,共9页
Data augmentation is widely recognized as an effective means of bolstering model robustness.However,when applied to monocular 3D object detection,non-geometric image augmentation neglects the critical link between the... Data augmentation is widely recognized as an effective means of bolstering model robustness.However,when applied to monocular 3D object detection,non-geometric image augmentation neglects the critical link between the image and physical space,resulting in the semantic collapse of the extended scene.To address this issue,we propose two geometric-level data augmentation operators named Geometric-Copy-Paste(Geo-CP)and Geometric-Crop-Shrink(Geo-CS).Both operators introduce geometric consistency based on the principle of perspective projection,complementing the options available for data augmentation in monocular 3D.Specifically,Geo-CP replicates local patches by reordering object depths to mitigate perspective occlusion conflicts,and Geo-CS re-crops local patches for simultaneous scaling of distance and scale to unify appearance and annotation.These operations ameliorate the problem of class imbalance in the monocular paradigm by increasing the quantity and distribution of geometrically consistent samples.Experiments demonstrate that our geometric-level augmentation operators effectively improve robustness and performance in the KITTI and Waymo monocular 3D detection benchmarks. 展开更多
关键词 3D detection MONOCULAR augmentation GEOMETRY
原文传递
N-fold Bernoulli probability based adaptive fast-tracking algorithm and its application to autonomous aerial refuelling 被引量:5
5
作者 Jarhinbek RASOL Yuelei XU +2 位作者 Qing ZHOU Tian HUI zhaoxiang zhang 《Chinese Journal of Aeronautics》 SCIE EI CAS CSCD 2023年第1期356-368,共13页
Recently,deep learning has been widely utilized for object tracking tasks.However,deep learning encounters limits in tasks such as Autonomous Aerial Refueling(AAR),where the target object can vary substantially in siz... Recently,deep learning has been widely utilized for object tracking tasks.However,deep learning encounters limits in tasks such as Autonomous Aerial Refueling(AAR),where the target object can vary substantially in size,requiring high-precision real-time performance in embedded systems.This paper presents a novel embedded adaptiveness single-object tracking framework based on an improved YOLOv4 detection approach and an n-fold Bernoulli probability theorem.First,an Asymmetric Convolutional Network(ACNet)and dense blocks are combined with the YOLOv4 architecture to detect small objects with high precision when similar objects are in the background.The prior object information,such as its location in the previous frame and its speed,is utilized to adaptively track objects of various sizes.Moreover,based on the n-fold Bernoulli probability theorem,we develop a filter that uses statistical laws to reduce the false positive rate of object tracking.To evaluate the efficiency of our algorithm,a new AAR dataset is collected,and extensive AAR detection and tracking experiments are performed.The results demonstrate that our improved detection algorithm is better than the original YOLOv4 algorithm on small and similar object detection tasks;the object tracking algorithm is better than state-of-the-art object tracking algorithms on refueling drogue tracking tasks. 展开更多
关键词 Autonomous aerial refueling N-fold Bernoulli probability theorem Object detection Object tracking YOLOv4
原文传递
Biologically inspired visual computing:the state of the art 被引量:2
6
作者 Wangli HAO Ian Max ANDOLINA +1 位作者 Wei WANG zhaoxiang zhang 《Frontiers of Computer Science》 SCIE EI CSCD 2021年第1期1-15,共15页
Visual information is highly advantageous for the evolutionary success of almost all animals.This information is likewise critical for many computing tasks,and visual computing has achieved tremendous successes in num... Visual information is highly advantageous for the evolutionary success of almost all animals.This information is likewise critical for many computing tasks,and visual computing has achieved tremendous successes in numerous applications over the last 60 years or so.In that time,the development of visual computing has moved forwards with inspiration from biological mechanisms many times.In particular,deep neural networks were inspired by the hierarchical processing mechanisms that exist in the visual cortex of primate brains(including ours),and have achieved huge breakthroughs in many domainspecific visual tasks.In order to better understand biologically inspired visual computing,we will present a survey of the current work,and hope to offer some new avenues for rethinking visual computing and designing novel neural network architectures. 展开更多
关键词 brain-inspired VISION neural models INTELLIGENCE novel neural networks
原文传递
Automatic object classification using motion blob based local feature fusion for traffic scene surveillance 被引量:2
7
作者 zhaoxiang zhang Yunhong WANG 《Frontiers of Computer Science》 SCIE EI CSCD 2012年第5期537-546,共10页
Automatic object classification in traffic scene videos is an important issue for intelligent visual surveillance with great potential for all kinds of security applications. However, this problem is very challenging ... Automatic object classification in traffic scene videos is an important issue for intelligent visual surveillance with great potential for all kinds of security applications. However, this problem is very challenging for the following reasons. Firstly, regions of interest in videos are of low res- olution and limited size due to the capacity of conventional surveillance cameras. Secondly, the intra-class variations are very large due to changes of view angles, lighting conditions, and environments. Thirdly, real-time performance of algo- rithms is always required for real applications. In this paper, we evaluate the performance of local feature descriptors for automatic object classification in traffic scenes. Image inten- sity or gradient information is directly used to construct ef- fective feature vectors from regions of interest extracted via motion detection. This strategy has great advantages of ef- ficiency compared to various complicated texture features. We not only analyze and evaluate the performance of differ- ent feature descriptors, but also fuse different scales and fea- tures to achieve better performance. Numerous experiments are conducted and experimental results demonstrate the ef- ficiency and effectiveness of this strategy with robustness to noise, variance of view angles, lighting conditions, and environments. 展开更多
关键词 visual surveillance object classification motiondetection feature fusion
原文传递
Local structured representation for generic object detection 被引量:1
8
作者 Junge zhang Kaiqi HUANG +1 位作者 Tieniu TAN zhaoxiang zhang 《Frontiers of Computer Science》 SCIE EI CSCD 2017年第4期632-648,共17页
Structure information plays an important role in both object recognition and detection. This paper studies what visual structure is and addresses the problem of struc- ture modeling and representation from two aspects... Structure information plays an important role in both object recognition and detection. This paper studies what visual structure is and addresses the problem of struc- ture modeling and representation from two aspects: visual feature and topology model. Firstly, at feature level, we pro- pose Local Structured Descriptor to capture the object's local structure effectively, and develop the descriptors from shape and texture information, respectively. Secondly, at topology level, we present a local strnctured model with a boosted fea- ture selection and fusion scheme. All experiments are conducted on the challenging PASCAL Visual Object Classes (VOC) datasets from VOC2007 to VOC2010. Experimental results show that our method achieves very competitive performance. 展开更多
关键词 Local Structured Descriptor Local StructuredModel Object Representation Object Structure Object De-tection PASCAL VOC
原文传递
Effect of contact forms on the wear of hard silicon surfaces by soft polymers
9
作者 zhaoxiang zhang Xiaohong JIA +2 位作者 Fei GUO Zhongde SHAN Yuming WANG 《Friction》 SCIE EI CAS CSCD 2021年第5期918-928,共11页
The mechanism of hard surfaces worn by soft polymers is not clearly understood.In this paper,a new hypothesis has been proposed,it holds that the stress acting on the hard surface under certain working conditions is t... The mechanism of hard surfaces worn by soft polymers is not clearly understood.In this paper,a new hypothesis has been proposed,it holds that the stress acting on the hard surface under certain working conditions is the main reason for wear of the hard surface by a soft polymer.The hypothesis was investigated by changing the contact form between tribo-pairs.For this,friction tests between six polymer spheres and smooth,rough,and inclined monocrystalline silicon surfaces were carried out.The results show that for the same tribo-pair,the silicon surface will not be worn in some contact forms,but in other contact forms it will be worn.We believe the wear of hard surface by a soft polymer is the result of the combined stress state action on the hard surface. 展开更多
关键词 POLYMERS silicon surface WEAR combined stress
原文传递
Toward few-shot domain adaptation with perturbation-invariant representation and transferable prototypes
10
作者 Junsong FAN Yuxi WANG +2 位作者 He GUAN Chunfeng SONG zhaoxiang zhang 《Frontiers of Computer Science》 SCIE EI CSCD 2022年第3期83-93,共11页
Domain adaptation(DA)for semantic segmentation aims to reduce the annotation burden for the dense pixellevel prediction task.It focuses on tackling the domain gap problem and manages to transfer knowledge learned from... Domain adaptation(DA)for semantic segmentation aims to reduce the annotation burden for the dense pixellevel prediction task.It focuses on tackling the domain gap problem and manages to transfer knowledge learned from abundant source data to new target scenes.Although recent works have achieved rapid progress in this field,they still underperform fully supervised models with a large margin due to the absence of any available hints in the target domain.Considering that few-shot labels are cheap to obtain in practical applications,wc attempt to leverage them to mitigate the performance gap between DA and fully supervised methods.The key to this problem is to leverage the few-shot labels to learn robust domain-invariant predictions effectively.To this end,we first design a data perturbation strategy to enhance the robustness of the representations.Furthermore,a transferable prototype module is proposed to bridge the domain gap based on the source data and few-shot targets.By means of these proposed methods,our approach can perform on par with the fully supervised models to some extent.We conduct extensive experiments to demonstrate the effectiveness of the proposed methods and report the state-of-the-art performance on two popular DA tasks,i.e.,from GTA5 to Cityscapes and SYNTHIA to Cityscapes. 展开更多
关键词 domain adaptation semantic segmentation
原文传递
上一页 1 下一页 到第
使用帮助 返回顶部