期刊文献+
共找到90,372篇文章
< 1 2 250 >
每页显示 20 50 100
Effect of low-degree astigmatism on the objective visual quality
1
作者 Ling-Ying Ye Shu-Feng Li +2 位作者 Jing-Jing Zuo Jin Li Hui-Xiang Ma 《International Journal of Ophthalmology(English edition)》 SCIE CAS 2024年第6期1086-1093,共8页
AIM:To evaluate the effect of low-degree astigmatism on objective visual quality through the Optical Quality Analysis System(OQAS).METHODS:This study enrolled 46 participants(aged 23 to 30y,90 eyes)with normal or corr... AIM:To evaluate the effect of low-degree astigmatism on objective visual quality through the Optical Quality Analysis System(OQAS).METHODS:This study enrolled 46 participants(aged 23 to 30y,90 eyes)with normal or corrected-to-normal vision.The cylindrical lenses(0,0.5,0.75,1.0,and 1.25 D)were placed at the axial direction(180°,45°,90°,and 135°)in front of the eyes with the best correction to form 16 types of regular low-degree astigmatism.OQAS was used to detect the objective visual quality,recorded as the objective scattering index(OSI),OQAS values at contrasts of 100%,20%,and 9%predictive visual acuity(OV100%,OV20%,and OV9%),modulation transfer function cut-off(MTFcut-off)and Strehl ratio(SR).The mixed effect linear model was used to compare objective visual quality differences between groups and examine associations between astigmatic magnitude and objective visual quality parameters.RESULTS:Apparent negative relationships between the magnitude of low astigmatism and objective visual quality were observed.The increase of OSI per degree of astigmatism at 180°,45°,90°,and 135°axis were 0.38(95%CI:0.35,0.42),0.50(95%CI:0.46,0.53),0.49(95%CI:0.45,0.54)and 0.37(95%CI:0.34,0.41),respectively.The decrease of MTFcut-off per degree of astigmatism at 180°,45°,90°,and 135°axis were-10.30(95%CI:-11.43,-9.16),-12.73(95%CI:-13.62,-11.86),-12.75(95%CI:-13.79,-11.70),and-9.97(95%CI:-10.92,-9.03),respectively.At the same astigmatism degree,OSI at 45°and 90°axis were higher than that at 0°and 135°axis,while MTFcut-off were lower.CONCLUSION:Low astigmatism of only 0.50 D can significantly reduce the objective visual quality. 展开更多
关键词 low-degree astigmatism objective visual quality Optical Quality Analysis System
下载PDF
Distributed Stochastic Optimization with Compression for Non-Strongly Convex Objectives
2
作者 Xuanjie Li Yuedong Xu 《Computer Modeling in Engineering & Sciences》 SCIE EI 2024年第4期459-481,共23页
We are investigating the distributed optimization problem,where a network of nodes works together to minimize a global objective that is a finite sum of their stored local functions.Since nodes exchange optimization p... We are investigating the distributed optimization problem,where a network of nodes works together to minimize a global objective that is a finite sum of their stored local functions.Since nodes exchange optimization parameters through the wireless network,large-scale training models can create communication bottlenecks,resulting in slower training times.To address this issue,CHOCO-SGD was proposed,which allows compressing information with arbitrary precision without reducing the convergence rate for strongly convex objective functions.Nevertheless,most convex functions are not strongly convex(such as logistic regression or Lasso),which raises the question of whether this algorithm can be applied to non-strongly convex functions.In this paper,we provide the first theoretical analysis of the convergence rate of CHOCO-SGD on non-strongly convex objectives.We derive a sufficient condition,which limits the fidelity of compression,to guarantee convergence.Moreover,our analysis demonstrates that within the fidelity threshold,this algorithm can significantly reduce transmission burden while maintaining the same convergence rate order as its no-compression equivalent.Numerical experiments further validate the theoretical findings by demonstrating that CHOCO-SGD improves communication efficiency and keeps the same convergence rate order simultaneously.And experiments also show that the algorithm fails to converge with low compression fidelity and in time-varying topologies.Overall,our study offers valuable insights into the potential applicability of CHOCO-SGD for non-strongly convex objectives.Additionally,we provide practical guidelines for researchers seeking to utilize this algorithm in real-world scenarios. 展开更多
关键词 Distributed stochastic optimization arbitrary compression fidelity non-strongly convex objective function
下载PDF
Objective Model Selection in Physics: Exploring the Finite Information Quantity Approach
3
作者 Boris Menin 《Journal of Applied Mathematics and Physics》 2024年第5期1848-1889,共42页
Traditional methods for selecting models in experimental data analysis are susceptible to researcher bias, hindering exploration of alternative explanations and potentially leading to overfitting. The Finite Informati... Traditional methods for selecting models in experimental data analysis are susceptible to researcher bias, hindering exploration of alternative explanations and potentially leading to overfitting. The Finite Information Quantity (FIQ) approach offers a novel solution by acknowledging the inherent limitations in information processing capacity of physical systems. This framework facilitates the development of objective criteria for model selection (comparative uncertainty) and paves the way for a more comprehensive understanding of phenomena through exploring diverse explanations. This work presents a detailed comparison of the FIQ approach with ten established model selection methods, highlighting the advantages and limitations of each. We demonstrate the potential of FIQ to enhance the objectivity and robustness of scientific inquiry through three practical examples: selecting appropriate models for measuring fundamental constants, sound velocity, and underwater electrical discharges. Further research is warranted to explore the full applicability of FIQ across various scientific disciplines. 展开更多
关键词 Comparative Uncertainty Finite Information Quantity Formulating a Model Measurement Accuracy Limit objective Model Selection
下载PDF
Eclipse计划系统MU Objective工具在鼻咽癌容积调强放射治疗计划中的应用研究
4
作者 陈宗友 曾华驱 +5 位作者 汤树奎 林启富 梁柱 赖春任 赵善辉 温尊北 《中国医学装备》 2023年第10期53-60,共8页
目的:探讨Varian Eclipse计划系统MU Objective工具在鼻咽癌容积调强放射治疗(VMAT)计划中的应用,旨在保证治疗计划质量的同时降低治疗计划的机器跳数(MU)。方法:选取已接受容积调强放射治疗技术的21例鼻咽癌患者,在未使用MU Objective... 目的:探讨Varian Eclipse计划系统MU Objective工具在鼻咽癌容积调强放射治疗(VMAT)计划中的应用,旨在保证治疗计划质量的同时降低治疗计划的机器跳数(MU)。方法:选取已接受容积调强放射治疗技术的21例鼻咽癌患者,在未使用MU Objective工具的情况下给每例患者设计治疗参考计划(MU_(ref));将使用MU Objective工具对MU_(ref)进行重新优化,分别设置MU_(ref)的MU的30%、50%、70%、90%和120%,分别命名为MU_(30%)、MU_(50%)、MU_(70%)、MU_(90%)和MU_(120%)。将治疗计划MU_(30%)、MU_(50%)、MU_(70%)、MU_(90%)和MU_(120%)分别与MU_(ref)进行配对t检验统计分析,比较两种治疗计划的靶区、危及器官(OAR)剂量分布和计划的MU。结果:使用MU Objective工具对参考计划优化MU后,与参考计划相比部分靶区和OAR剂量体积参数差异虽有统计学意义,但无临床意义。治疗计划参数为MU_(30%)、MU_(50%)、MU_(70%)和MU_(90%)时,与MU_(ref)比较计划MU平均分别减少21.5%、19.5%、16.6%和8%,差异有统计学意义(t=9.652,t=8.843,t=8.963,t=11.323;P<0.05)。在MU_(120%)时,治疗计划MU平均增加1.7%,但未显著提高靶区剂量覆盖。结论:对于鼻咽癌VMAT计划,使用Eclipse计划系统的MU Objective工具在减少计划MU的同时可获得与参考计划相似的剂量分布。MU Objective工具可作为鼻咽癌或其他部位VMAT计划的常规使用工具,以保证计划质量同时减少放射治疗计划的MU。 展开更多
关键词 容积调强放射治疗(VMAT) Eclispe计划系统 MU objective工具 机器跳数(MU)
下载PDF
Objective Identification and Climatic Characteristics of Heavy-Precipitation Northeastern China Cold Vortexes
5
作者 Xu CHEN Xiaoyong ZHUGE +2 位作者 Xidi ZHANG Yuan WANG Daokai XUE 《Advances in Atmospheric Sciences》 SCIE CAS CSCD 2023年第2期305-316,I0009,I0010,共14页
The northeastern China cold vortex(NCCV)plays an important role in regional rainstorms over East Asia.Using the National Centers for Environmental Prediction Final reanalysis dataset and the Global Precipitation Measu... The northeastern China cold vortex(NCCV)plays an important role in regional rainstorms over East Asia.Using the National Centers for Environmental Prediction Final reanalysis dataset and the Global Precipitation Measurement product,an objective algorithm for identifying heavy-precipitation NCCV(HPCV)events was designed,and the climatological features of 164 HPCV events from 2001 to 2019 were investigated.The number of HPCV events showed an upward linear trend,with the highest frequency of occurrence in summer.The most active region of HPCV samples was the Northeast China Plain between 40°–55°N.Most HPCV events lasted 3–5 days and had radii ranging from 250 to 1000 km.The duration of HPCV events with larger sizes was longer.About half of the HPCV events moved into(moved out of)the definition region(35°–60°N,115°–145°E),and half initiated(dissipated)within the region.The initial position was close to the western boundary of the definition region,and the final position was mainly near the eastern boundary.The locations associated with the precipitation were mostly concentrated within 2000 km southeast of the HPCV systems,and they were farther from the center in the cold season than in the warm season. 展开更多
关键词 northeastern China cold vortex heavy precipitation objective identification climatological features
下载PDF
Multi‐objective particle swarm optimisation of complex product change plan considering service performance
6
作者 Ruizhao Zheng Yong Zhang +4 位作者 Xiaoyan Sun Faguang Wang Lei Yang Chen Peng Yulong Wang 《CAAI Transactions on Intelligence Technology》 SCIE EI 2023年第3期1058-1076,共19页
Design change is an inevitable part of the product development process.This study proposes an improved binary multi‐objective PSO algorithm guided by problem char-acteristics(P‐BMOPSO)to solve the optimisation probl... Design change is an inevitable part of the product development process.This study proposes an improved binary multi‐objective PSO algorithm guided by problem char-acteristics(P‐BMOPSO)to solve the optimisation problem of complex product change plan considering service performance.Firstly,a complex product multi‐layer network with service performance is established for the first time to reveal the impact of change effect propagation on the product service performance.Secondly,the concept of service performance impact(SPI)is defined by decoupling the impact of strongly associated nodes on the service performance in the process of change affect propagation.Then,a triple‐objective selection model of change nodes is established,which includes the three indicators:SPI degree,change cost,and change time.Furthermore,an integer multi‐objective particle swarm optimisation algorithm guided by problem characteristics is developed to solve the model above.Experimental results on the design change problem of a certain type of Skyworth TV verify the effectiveness of the established optimisation model and the proposed P‐BMOPSO algorithm. 展开更多
关键词 multi‐objective particle swarm optimization product change service performance
下载PDF
Multi-Objective Optimization with Artificial Neural Network Based Robust Paddy Yield Prediction Model
7
作者 S.Muthukumaran P.Geetha E.Ramaraj 《Intelligent Automation & Soft Computing》 SCIE 2023年第1期215-230,共16页
Agriculture plays a vital role in the food production process that occupies nearly one-third of the total surface of the earth.Rice is propagated from the seeds of paddy and it is a stable food almost used byfifty per... Agriculture plays a vital role in the food production process that occupies nearly one-third of the total surface of the earth.Rice is propagated from the seeds of paddy and it is a stable food almost used byfifty percent of the total world population.The extensive growth of the human population alarms us to ensure food security and the country should take proper food steps to improve the yield of food grains.This paper concentrates on improving the yield of paddy by predicting the factors that influence the growth of paddy with the help of Evolutionary Computation Techniques.Most of the researchers used to relay on historical records of meteorological parameters to predict the yield of paddy.There is a lack in analyzing the day to day impact of meteorological parameters such as direction of wind,relative humidity,Instant Wind Speed in paddy cultivation.The real time meteorological data collected and analysis the impact of weather parameters from the day of paddy sowing to till the last day of paddy harvesting with regular time series.A Robust Optimized Artificial Neural Network(ROANN)Algorithm with Genetic Algorithm(GA)and Multi Objective Particle Swarm Optimization Algorithm(MOPSO)proposed to predict the factors that to be concentrated by farmers to improve the paddy yield in cultivation.A real time paddy data collected from farmers of Tamilnadu and the meteorological parameters were matched with the cropping pattern of the farmers to construct the database.The input parameters were optimized either by using GA or MOPSO optimization algorithms to reconstruct the database.Reconstructed database optimized by using Artificial Neural Network Back Propagation Algorithm.The reason for improving the growth of paddy was identified using the output of the Neural Network.Performance metrics such as Accuracy,Error Rate etc were used to measure the performance of the proposed algorithm.Comparative analysis made between ANN with GA and ANN with MOPSO to identify the recommendations for improving the paddy yield. 展开更多
关键词 ANN back propagation algorithm genetic algorithm multi objective particle swarm optimization algorithm
下载PDF
Salient Object Detection Based on a Novel Combination Framework Using the Perceptual Matching and Subjective-Objective Mapping Technologies
8
作者 Jian Han Jialu Li +3 位作者 Meng Liu Zhe Ren Zhimin Cao Xingbin Liu 《Journal of Beijing Institute of Technology》 EI CAS 2023年第1期95-106,共12页
The integrity and fineness characterization of non-connected regions and contours is a major challenge for existing salient object detection.The key to address is how to make full use of the subjective and objective s... The integrity and fineness characterization of non-connected regions and contours is a major challenge for existing salient object detection.The key to address is how to make full use of the subjective and objective structural information obtained in different steps.Therefore,by simulating the human visual mechanism,this paper proposes a novel multi-decoder matching correction network and subjective structural loss.Specifically,the loss pays different attentions to the foreground,boundary,and background of ground truth map in a top-down structure.And the perceived saliency is mapped to the corresponding objective structure of the prediction map,which is extracted in a bottom-up manner.Thus,multi-level salient features can be effectively detected with the loss as constraint.And then,through the mapping of improved binary cross entropy loss,the differences between salient regions and objects are checked to pay attention to the error prone region to achieve excellent error sensitivity.Finally,through tracking the identifying feature horizontally and vertically,the subjective and objective interaction is maximized.Extensive experiments on five benchmark datasets demonstrate that compared with 12 state-of-the-art methods,the algorithm has higher recall and precision,less error and strong robustness and generalization ability,and can predict complete and refined saliency maps. 展开更多
关键词 salient object detection subjective-objective mapping perceptional separation and matching error sensitivity non-connected region detection
下载PDF
Scheme Based on Multi-Level Patch Attention and Lesion Localization for Diabetic Retinopathy Grading
9
作者 Zhuoqun Xia Hangyu Hu +4 位作者 Wenjing Li Qisheng Jiang Lan Pu Yicong Shu Arun Kumar Sangaiah 《Computer Modeling in Engineering & Sciences》 SCIE EI 2024年第7期409-430,共22页
Early screening of diabetes retinopathy(DR)plays an important role in preventing irreversible blindness.Existing research has failed to fully explore effective DR lesion information in fundus maps.Besides,traditional ... Early screening of diabetes retinopathy(DR)plays an important role in preventing irreversible blindness.Existing research has failed to fully explore effective DR lesion information in fundus maps.Besides,traditional attention schemes have not considered the impact of lesion type differences on grading,resulting in unreasonable extraction of important lesion features.Therefore,this paper proposes a DR diagnosis scheme that integrates a multi-level patch attention generator(MPAG)and a lesion localization module(LLM).Firstly,MPAGis used to predict patches of different sizes and generate a weighted attention map based on the prediction score and the types of lesions contained in the patches,fully considering the impact of lesion type differences on grading,solving the problem that the attention maps of lesions cannot be further refined and then adapted to the final DR diagnosis task.Secondly,the LLM generates a global attention map based on localization.Finally,the weighted attention map and global attention map are weighted with the fundus map to fully explore effective DR lesion information and increase the attention of the classification network to lesion details.This paper demonstrates the effectiveness of the proposed method through extensive experiments on the public DDR dataset,obtaining an accuracy of 0.8064. 展开更多
关键词 DDR dataset diabetic retinopathy lesion localization multi-level patch attention mechanism
下载PDF
Deep neural network based on multi-level wavelet and attention for structured illumination microscopy
10
作者 Yanwei Zhang Song Lang +2 位作者 Xuan Cao Hanqing Zheng Yan Gong 《Journal of Innovative Optical Health Sciences》 SCIE EI CSCD 2024年第2期12-23,共12页
Structured illumination microscopy(SIM)is a popular and powerful super-resolution(SR)technique in biomedical research.However,the conventional reconstruction algorithm for SIM heavily relies on the accurate prior know... Structured illumination microscopy(SIM)is a popular and powerful super-resolution(SR)technique in biomedical research.However,the conventional reconstruction algorithm for SIM heavily relies on the accurate prior knowledge of illumination patterns and signal-to-noise ratio(SNR)of raw images.To obtain high-quality SR images,several raw images need to be captured under high fluorescence level,which further restricts SIM’s temporal resolution and its applications.Deep learning(DL)is a data-driven technology that has been used to expand the limits of optical microscopy.In this study,we propose a deep neural network based on multi-level wavelet and attention mechanism(MWAM)for SIM.Our results show that the MWAM network can extract high-frequency information contained in SIM raw images and accurately integrate it into the output image,resulting in superior SR images compared to those generated using wide-field images as input data.We also demonstrate that the number of SIM raw images can be reduced to three,with one image in each illumination orientation,to achieve the optimal tradeoff between temporal and spatial resolution.Furthermore,our MWAM network exhibits superior reconstruction ability on low-SNR images compared to conventional SIM algorithms.We have also analyzed the adaptability of this network on other biological samples and successfully applied the pretrained model to other SIM systems. 展开更多
关键词 Super-resolution reconstruction multi-level wavelet packet transform residual channel attention selective kernel attention
下载PDF
A Secure and Cost-Effective Training Framework Atop Serverless Computing for Object Detection in Blasting
11
作者 Tianming Zhang Zebin Chen +4 位作者 Haonan Guo Bojun Ren Quanmin Xie Mengke Tian Yong Wang 《Computer Modeling in Engineering & Sciences》 SCIE EI 2024年第5期2139-2154,共16页
The data analysis of blasting sites has always been the research goal of relevant researchers.The rise of mobile blasting robots has aroused many researchers’interest in machine learning methods for target detection ... The data analysis of blasting sites has always been the research goal of relevant researchers.The rise of mobile blasting robots has aroused many researchers’interest in machine learning methods for target detection in the field of blasting.Serverless Computing can provide a variety of computing services for people without hardware foundations and rich software development experience,which has aroused people’s interest in how to use it in the field ofmachine learning.In this paper,we design a distributedmachine learning training application based on the AWS Lambda platform.Based on data parallelism,the data aggregation and training synchronization in Function as a Service(FaaS)are effectively realized.It also encrypts the data set,effectively reducing the risk of data leakage.We rent a cloud server and a Lambda,and then we conduct experiments to evaluate our applications.Our results indicate the effectiveness,rapidity,and economy of distributed training on FaaS. 展开更多
关键词 Serverless computing object detection BLASTING
下载PDF
Learning Discriminatory Information for Object Detection on Urine Sediment Image
12
作者 Sixian Chan Binghui Wu +2 位作者 Guodao Zhang Yuan Yao Hongqiang Wang 《Computer Modeling in Engineering & Sciences》 SCIE EI 2024年第1期411-428,共18页
In clinical practice,the microscopic examination of urine sediment is considered an important in vitro examination with many broad applications.Measuring the amount of each type of urine sediment allows for screening,... In clinical practice,the microscopic examination of urine sediment is considered an important in vitro examination with many broad applications.Measuring the amount of each type of urine sediment allows for screening,diagnosis and evaluation of kidney and urinary tract disease,providing insight into the specific type and severity.However,manual urine sediment examination is labor-intensive,time-consuming,and subjective.Traditional machine learning based object detection methods require hand-crafted features for localization and classification,which have poor generalization capabilities and are difficult to quickly and accurately detect the number of urine sediments.Deep learning based object detection methods have the potential to address the challenges mentioned above,but these methods require access to large urine sediment image datasets.Unfortunately,only a limited number of publicly available urine sediment datasets are currently available.To alleviate the lack of urine sediment datasets in medical image analysis,we propose a new dataset named UriSed2K,which contains 2465 high-quality images annotated with expert guidance.Two main challenges are associated with our dataset:a large number of small objects and the occlusion between these small objects.Our manuscript focuses on applying deep learning object detection methods to the urine sediment dataset and addressing the challenges presented by this dataset.Specifically,our goal is to improve the accuracy and efficiency of the detection algorithm and,in doing so,provide medical professionals with an automatic detector that saves time and effort.We propose an improved lightweight one-stage object detection algorithm called Discriminatory-YOLO.The proposed algorithm comprises a local context attention module and a global background suppression module,which aid the detector in distinguishing urine sediment features in the image.The local context attention module captures context information beyond the object region,while the global background suppression module emphasizes objects in uninformative backgrounds.We comprehensively evaluate our method on the UriSed2K dataset,which includes seven categories of urine sediments,such as erythrocytes(red blood cells),leukocytes(white blood cells),epithelial cells,crystals,mycetes,broken erythrocytes,and broken leukocytes,achieving the best average precision(AP)of 95.3%while taking only 10 ms per image.The source code and dataset are available at https://github.com/binghuiwu98/discriminatoryyolov5. 展开更多
关键词 object detection attention mechanism medical image urine sediment
下载PDF
Enhancing Dense Small Object Detection in UAV Images Based on Hybrid Transformer
13
作者 Changfeng Feng Chunping Wang +2 位作者 Dongdong Zhang Renke Kou Qiang Fu 《Computers, Materials & Continua》 SCIE EI 2024年第3期3993-4013,共21页
Transformer-based models have facilitated significant advances in object detection.However,their extensive computational consumption and suboptimal detection of dense small objects curtail their applicability in unman... Transformer-based models have facilitated significant advances in object detection.However,their extensive computational consumption and suboptimal detection of dense small objects curtail their applicability in unmanned aerial vehicle(UAV)imagery.Addressing these limitations,we propose a hybrid transformer-based detector,H-DETR,and enhance it for dense small objects,leading to an accurate and efficient model.Firstly,we introduce a hybrid transformer encoder,which integrates a convolutional neural network-based cross-scale fusion module with the original encoder to handle multi-scale feature sequences more efficiently.Furthermore,we propose two novel strategies to enhance detection performance without incurring additional inference computation.Query filter is designed to cope with the dense clustering inherent in drone-captured images by counteracting similar queries with a training-aware non-maximum suppression.Adversarial denoising learning is a novel enhancement method inspired by adversarial learning,which improves the detection of numerous small targets by counteracting the effects of artificial spatial and semantic noise.Extensive experiments on the VisDrone and UAVDT datasets substantiate the effectiveness of our approach,achieving a significant improvement in accuracy with a reduction in computational complexity.Our method achieves 31.9%and 21.1%AP on the VisDrone and UAVDT datasets,respectively,and has a faster inference speed,making it a competitive model in UAV image object detection. 展开更多
关键词 UAV images TRANSFORMER dense small object detection
下载PDF
Local saliency consistency-based label inference for weakly supervised salient object detection using scribble annotations
14
作者 Shuo Zhao Peng Cui +1 位作者 Jing Shen Haibo Liu 《CAAI Transactions on Intelligence Technology》 SCIE EI 2024年第1期239-249,共11页
Recently,weak supervision has received growing attention in the field of salient object detection due to the convenience of labelling.However,there is a large performance gap between weakly supervised and fully superv... Recently,weak supervision has received growing attention in the field of salient object detection due to the convenience of labelling.However,there is a large performance gap between weakly supervised and fully supervised salient object detectors because the scribble annotation can only provide very limited foreground/background information.Therefore,an intuitive idea is to infer annotations that cover more complete object and background regions for training.To this end,a label inference strategy is proposed based on the assumption that pixels with similar colours and close positions should have consistent labels.Specifically,k-means clustering algorithm was first performed on both colours and coordinates of original annotations,and then assigned the same labels to points having similar colours with colour cluster centres and near coordinate cluster centres.Next,the same annotations for pixels with similar colours within each kernel neighbourhood was set further.Extensive experiments on six benchmarks demonstrate that our method can significantly improve the performance and achieve the state-of-the-art results. 展开更多
关键词 label inference salient object detection weak supervision
下载PDF
Depth-Guided Vision Transformer With Normalizing Flows for Monocular 3D Object Detection
15
作者 Cong Pan Junran Peng Zhaoxiang Zhang 《IEEE/CAA Journal of Automatica Sinica》 SCIE EI CSCD 2024年第3期673-689,共17页
Monocular 3D object detection is challenging due to the lack of accurate depth information.Some methods estimate the pixel-wise depth maps from off-the-shelf depth estimators and then use them as an additional input t... Monocular 3D object detection is challenging due to the lack of accurate depth information.Some methods estimate the pixel-wise depth maps from off-the-shelf depth estimators and then use them as an additional input to augment the RGB images.Depth-based methods attempt to convert estimated depth maps to pseudo-LiDAR and then use LiDAR-based object detectors or focus on the perspective of image and depth fusion learning.However,they demonstrate limited performance and efficiency as a result of depth inaccuracy and complex fusion mode with convolutions.Different from these approaches,our proposed depth-guided vision transformer with a normalizing flows(NF-DVT)network uses normalizing flows to build priors in depth maps to achieve more accurate depth information.Then we develop a novel Swin-Transformer-based backbone with a fusion module to process RGB image patches and depth map patches with two separate branches and fuse them using cross-attention to exchange information with each other.Furthermore,with the help of pixel-wise relative depth values in depth maps,we develop new relative position embeddings in the cross-attention mechanism to capture more accurate sequence ordering of input tokens.Our method is the first Swin-Transformer-based backbone architecture for monocular 3D object detection.The experimental results on the KITTI and the challenging Waymo Open datasets show the effectiveness of our proposed method and superior performance over previous counterparts. 展开更多
关键词 Monocular 3D object detection normalizing flows Swin Transformer
下载PDF
CAW-YOLO:Cross-Layer Fusion and Weighted Receptive Field-Based YOLO for Small Object Detection in Remote Sensing
16
作者 Weiya Shi Shaowen Zhang Shiqiang Zhang 《Computer Modeling in Engineering & Sciences》 SCIE EI 2024年第6期3209-3231,共23页
In recent years,there has been extensive research on object detection methods applied to optical remote sensing images utilizing convolutional neural networks.Despite these efforts,the detection of small objects in re... In recent years,there has been extensive research on object detection methods applied to optical remote sensing images utilizing convolutional neural networks.Despite these efforts,the detection of small objects in remote sensing remains a formidable challenge.The deep network structure will bring about the loss of object features,resulting in the loss of object features and the near elimination of some subtle features associated with small objects in deep layers.Additionally,the features of small objects are susceptible to interference from background features contained within the image,leading to a decline in detection accuracy.Moreover,the sensitivity of small objects to the bounding box perturbation further increases the detection difficulty.In this paper,we introduce a novel approach,Cross-Layer Fusion and Weighted Receptive Field-based YOLO(CAW-YOLO),specifically designed for small object detection in remote sensing.To address feature loss in deep layers,we have devised a cross-layer attention fusion module.Background noise is effectively filtered through the incorporation of Bi-Level Routing Attention(BRA).To enhance the model’s capacity to perceive multi-scale objects,particularly small-scale objects,we introduce a weightedmulti-receptive field atrous spatial pyramid poolingmodule.Furthermore,wemitigate the sensitivity arising from bounding box perturbation by incorporating the joint Normalized Wasserstein Distance(NWD)and Efficient Intersection over Union(EIoU)losses.The efficacy of the proposedmodel in detecting small objects in remote sensing has been validated through experiments conducted on three publicly available datasets.The experimental results unequivocally demonstrate the model’s pronounced advantages in small object detection for remote sensing,surpassing the performance of current mainstream models. 展开更多
关键词 Small object detection attention mechanism cross-layer fusion discrete cosine transform
下载PDF
Multi-Stream Temporally Enhanced Network for Video Salient Object Detection
17
作者 Dan Xu Jiale Ru Jinlong Shi 《Computers, Materials & Continua》 SCIE EI 2024年第1期85-104,共20页
Video salient object detection(VSOD)aims at locating the most attractive objects in a video by exploring the spatial and temporal features.VSOD poses a challenging task in computer vision,as it involves processing com... Video salient object detection(VSOD)aims at locating the most attractive objects in a video by exploring the spatial and temporal features.VSOD poses a challenging task in computer vision,as it involves processing complex spatial data that is also influenced by temporal dynamics.Despite the progress made in existing VSOD models,they still struggle in scenes of great background diversity within and between frames.Additionally,they encounter difficulties related to accumulated noise and high time consumption during the extraction of temporal features over a long-term duration.We propose a multi-stream temporal enhanced network(MSTENet)to address these problems.It investigates saliency cues collaboration in the spatial domain with a multi-stream structure to deal with the great background diversity challenge.A straightforward,yet efficient approach for temporal feature extraction is developed to avoid the accumulative noises and reduce time consumption.The distinction between MSTENet and other VSOD methods stems from its incorporation of both foreground supervision and background supervision,facilitating enhanced extraction of collaborative saliency cues.Another notable differentiation is the innovative integration of spatial and temporal features,wherein the temporal module is integrated into the multi-stream structure,enabling comprehensive spatial-temporal interactions within an end-to-end framework.Extensive experimental results demonstrate that the proposed method achieves state-of-the-art performance on five benchmark datasets while maintaining a real-time speed of 27 fps(Titan XP).Our code and models are available at https://github.com/RuJiaLe/MSTENet. 展开更多
关键词 Video salient object detection deep learning temporally enhanced foreground-background collaboration
下载PDF
SwinVid:Enhancing Video Object Detection Using Swin Transformer
18
作者 Abdelrahman Maharek Amr Abozeid +1 位作者 Rasha Orban Kamal ElDahshan 《Computer Systems Science & Engineering》 2024年第2期305-320,共16页
What causes object detection in video to be less accurate than it is in still images?Because some video frames have degraded in appearance from fast movement,out-of-focus camera shots,and changes in posture.These reas... What causes object detection in video to be less accurate than it is in still images?Because some video frames have degraded in appearance from fast movement,out-of-focus camera shots,and changes in posture.These reasons have made video object detection(VID)a growing area of research in recent years.Video object detection can be used for various healthcare applications,such as detecting and tracking tumors in medical imaging,monitoring the movement of patients in hospitals and long-term care facilities,and analyzing videos of surgeries to improve technique and training.Additionally,it can be used in telemedicine to help diagnose and monitor patients remotely.Existing VID techniques are based on recurrent neural networks or optical flow for feature aggregation to produce reliable features which can be used for detection.Some of those methods aggregate features on the full-sequence level or from nearby frames.To create feature maps,existing VID techniques frequently use Convolutional Neural Networks(CNNs)as the backbone network.On the other hand,Vision Transformers have outperformed CNNs in various vision tasks,including object detection in still images and image classification.We propose in this research to use Swin-Transformer,a state-of-the-art Vision Transformer,as an alternative to CNN-based backbone networks for object detection in videos.The proposed architecture enhances the accuracy of existing VID methods.The ImageNet VID and EPIC KITCHENS datasets are used to evaluate the suggested methodology.We have demonstrated that our proposed method is efficient by achieving 84.3%mean average precision(mAP)on ImageNet VID using less memory in comparison to other leading VID techniques.The source code is available on the website https://github.com/amaharek/SwinVid. 展开更多
关键词 Video object detection vision transformers convolutional neural networks deep learning
下载PDF
A Novel Tensor Decomposition-Based Efficient Detector for Low-Altitude Aerial Objects With Knowledge Distillation Scheme
19
作者 Nianyin Zeng Xinyu Li +2 位作者 Peishu Wu Han Li Xin Luo 《IEEE/CAA Journal of Automatica Sinica》 SCIE EI CSCD 2024年第2期487-501,共15页
Unmanned aerial vehicles(UAVs) have gained significant attention in practical applications, especially the low-altitude aerial(LAA) object detection imposes stringent requirements on recognition accuracy and computati... Unmanned aerial vehicles(UAVs) have gained significant attention in practical applications, especially the low-altitude aerial(LAA) object detection imposes stringent requirements on recognition accuracy and computational resources. In this paper, the LAA images-oriented tensor decomposition and knowledge distillation-based network(TDKD-Net) is proposed,where the TT-format TD(tensor decomposition) and equalweighted response-based KD(knowledge distillation) methods are designed to minimize redundant parameters while ensuring comparable performance. Moreover, some robust network structures are developed, including the small object detection head and the dual-domain attention mechanism, which enable the model to leverage the learned knowledge from small-scale targets and selectively focus on salient features. Considering the imbalance of bounding box regression samples and the inaccuracy of regression geometric factors, the focal and efficient IoU(intersection of union) loss with optimal transport assignment(F-EIoU-OTA)mechanism is proposed to improve the detection accuracy. The proposed TDKD-Net is comprehensively evaluated through extensive experiments, and the results have demonstrated the effectiveness and superiority of the developed methods in comparison to other advanced detection algorithms, which also present high generalization and strong robustness. As a resource-efficient precise network, the complex detection of small and occluded LAA objects is also well addressed by TDKD-Net, which provides useful insights on handling imbalanced issues and realizing domain adaptation. 展开更多
关键词 Attention mechanism knowledge distillation(KD) object detection tensor decomposition(TD) unmanned aerial vehicles(UAVs)
下载PDF
Management of Penetrating Cranioencephalic Trauma Caused by Sharp Metal Objects—Therapeutic and Evolutionary Aspects: 12 Cases at the Renaissance University Hospital in N’Djamena
20
作者 Goumantar Félicien Toudjingar Li-Iyane Olivier Ouambi +3 位作者 Yannick Canton Kessely Donal Djasdé Mahouli Fata Vounki Momar Codé Ba 《Open Journal of Modern Neurosurgery》 2024年第2期170-178,共9页
Introduction: Cranioencephalic trauma caused by bladed weapons is rare, and that caused by sharp objects is exceptional. The aim of our study was to describe the clinical, therapeutic and evolutionary aspects. Materia... Introduction: Cranioencephalic trauma caused by bladed weapons is rare, and that caused by sharp objects is exceptional. The aim of our study was to describe the clinical, therapeutic and evolutionary aspects. Materials and method: This was a descriptive and analytical study over a 48-month period at CHU la Renaissance from January 1, 2018 to December 31, 2021, concerning patients admitted for penetrating cranioencephalic trauma by pointed object. Results: Twelve cases, all male, of penetrating cranioencephalic sharp-force trauma were identified. The mean age was 34 ± 7 years, with extremes of 11 and 60 years. Farmers and herders accounted for 31% and 25% of cases respectively. The average admission time was 47 hours. Brawls were the circumstances of occurrence in 81.2% of cases. Knives (33%), arrows (25%) and iron bars (16.6%) were the objects used. Altered consciousness was present in 43.8% of cases, and focal deficit in 50%. Scannographic lesions were fracture and/or embarrhment (12 cases), intra-parenchymal haematomas (6 cases) and presence of object in place (4 cases). Surgery was performed in 11 patients. Postoperative outcome was favorable in 9 patients. After 12 months, 2 patients were declared unfit. Conclusion: Penetrating head injuries caused by sharp objects are common in Chad. Urgent surgery can prevent disabling after-effects. 展开更多
关键词 Penetrating Trauma SKULL Encephalon Sharp object Surgery Patient Outcome
下载PDF
上一页 1 2 250 下一页 到第
使用帮助 返回顶部