期刊文献+
共找到2,083篇文章
< 1 2 105 >
每页显示 20 50 100
一种街景图像中建筑物高度估算方法
1
作者 戈士博 刘纪平 +1 位作者 王勇 车向红 《遥感信息》 CSCD 北大核心 2024年第3期1-6,共6页
建筑物高度信息是城市三维建模的基础数据,但已有的建筑物高度估算研究多采用LiDAR和SAR等遥感影像。随着计算机和互联网的快速发展,街景数据因采集容易和成本低等特点成为了一种新兴的建筑物高度估算数据源。文章提出一种街景图像中建... 建筑物高度信息是城市三维建模的基础数据,但已有的建筑物高度估算研究多采用LiDAR和SAR等遥感影像。随着计算机和互联网的快速发展,街景数据因采集容易和成本低等特点成为了一种新兴的建筑物高度估算数据源。文章提出一种街景图像中建筑物高度估算方法,首先利用segment anything model实现图像中建筑物像素高度提取;然后利用图像元数据和电子地图数据获取建筑物与相机之间的距离、图像焦距,根据街景图像与建筑物实体的几何关系改进针孔相机模型,构建建筑物高度估算方法;最后选取北京、柏林的Mapillary街景图像开展实验验证。结果表明,与改进前相比,改进后针孔相机模型明显提升了高度估算准确度,RMSE降低了11.31 m,R^(2)提高了0.4,具备实用价值。 展开更多
关键词 街景图像 建筑物高度估算 针孔相机模型 segment anything model Mapillary
下载PDF
基于SAM&ImageJ图像处理的堆石混凝土坝层面露石率研究
2
作者 安宇 徐小蓉 +2 位作者 尹志刚 金峰 张喜喜 《水资源与水工程学报》 CSCD 北大核心 2024年第1期154-161,共8页
堆石混凝土坝层面的外露块石为上下层提供了重要的啮合作用,其投影面积比例是科学评价层间抗剪性能的重要指标。采用国际最新Meta AI模型segment anything model(SAM)对层面外露堆石进行自动图像分割,并基于ImageJ软件对SAM识别后的图... 堆石混凝土坝层面的外露块石为上下层提供了重要的啮合作用,其投影面积比例是科学评价层间抗剪性能的重要指标。采用国际最新Meta AI模型segment anything model(SAM)对层面外露堆石进行自动图像分割,并基于ImageJ软件对SAM识别后的图片进行再加工与图像计算,利用平滑、差分算法、中值滤波等方法精准标定外露堆石,二值化后计算得到层面露石率。结果表明:SAM图像预分割可识别约90%的外露堆石,经过ImageJ二次图像处理后可有效提高小粒径堆石的识别精度,对比手动标注结果误差在±3%以内。以贵州省两座水库的工程应用为例,对浇筑仓面进行分区预处理,结果发现靠近上游、中部、下游不同区域的露石率差别较大,计算得到的层面露石率以10%~30%居多,其中堆石入仓运输通道区域的露石率较低。研究内容与结论可为堆石混凝土结构层间界面抗剪力学性能和大坝蓄水安全稳定的研究提供参考与借鉴。 展开更多
关键词 堆石混凝土坝 segment anything model(SAM) 图像处理技术 露石率 层间抗剪性能
下载PDF
Two-Staged Method for Ice Channel Identification Based on Image Segmentation and Corner Point Regression 被引量:1
3
作者 DONG Wen-bo ZHOU Li +2 位作者 DING Shi-feng WANG Ai-ming CAI Jin-yan 《China Ocean Engineering》 SCIE EI CSCD 2024年第2期313-325,共13页
Identification of the ice channel is the basic technology for developing intelligent ships in ice-covered waters,which is important to ensure the safety and economy of navigation.In the Arctic,merchant ships with low ... Identification of the ice channel is the basic technology for developing intelligent ships in ice-covered waters,which is important to ensure the safety and economy of navigation.In the Arctic,merchant ships with low ice class often navigate in channels opened up by icebreakers.Navigation in the ice channel often depends on good maneuverability skills and abundant experience from the captain to a large extent.The ship may get stuck if steered into ice fields off the channel.Under this circumstance,it is very important to study how to identify the boundary lines of ice channels with a reliable method.In this paper,a two-staged ice channel identification method is developed based on image segmentation and corner point regression.The first stage employs the image segmentation method to extract channel regions.In the second stage,an intelligent corner regression network is proposed to extract the channel boundary lines from the channel region.A non-intelligent angle-based filtering and clustering method is proposed and compared with corner point regression network.The training and evaluation of the segmentation method and corner regression network are carried out on the synthetic and real ice channel dataset.The evaluation results show that the accuracy of the method using the corner point regression network in the second stage is achieved as high as 73.33%on the synthetic ice channel dataset and 70.66%on the real ice channel dataset,and the processing speed can reach up to 14.58frames per second. 展开更多
关键词 ice channel ship navigation IDENTIFICATION image segmentation corner point regression
下载PDF
Empowering Diagnosis: Cutting-Edge Segmentation and Classification in Lung Cancer Analysis
4
作者 Iftikhar Naseer Tehreem Masood +4 位作者 Sheeraz Akram Zulfiqar Ali Awais Ahmad Shafiq Ur Rehman Arfan Jaffar 《Computers, Materials & Continua》 SCIE EI 2024年第6期4963-4977,共15页
Lung cancer is a leading cause of global mortality rates.Early detection of pulmonary tumors can significantly enhance the survival rate of patients.Recently,various Computer-Aided Diagnostic(CAD)methods have been dev... Lung cancer is a leading cause of global mortality rates.Early detection of pulmonary tumors can significantly enhance the survival rate of patients.Recently,various Computer-Aided Diagnostic(CAD)methods have been developed to enhance the detection of pulmonary nodules with high accuracy.Nevertheless,the existing method-ologies cannot obtain a high level of specificity and sensitivity.The present study introduces a novel model for Lung Cancer Segmentation and Classification(LCSC),which incorporates two improved architectures,namely the improved U-Net architecture and the improved AlexNet architecture.The LCSC model comprises two distinct stages.The first stage involves the utilization of an improved U-Net architecture to segment candidate nodules extracted from the lung lobes.Subsequently,an improved AlexNet architecture is employed to classify lung cancer.During the first stage,the proposed model demonstrates a dice accuracy of 0.855,a precision of 0.933,and a recall of 0.789 for the segmentation of candidate nodules.The suggested improved AlexNet architecture attains 97.06%accuracy,a true positive rate of 96.36%,a true negative rate of 97.77%,a positive predictive value of 97.74%,and a negative predictive value of 96.41%for classifying pulmonary cancer as either benign or malignant.The proposed LCSC model is tested and evaluated employing the publically available dataset furnished by the Lung Image Database Consortium and Image Database Resource Initiative(LIDC-IDRI).This proposed technique exhibits remarkable performance compared to the existing methods by using various evaluation parameters. 展开更多
关键词 Lung cancer SEGMENTATION AlexNet U-Net classification
下载PDF
Visual Semantic Segmentation Based on Few/Zero-Shot Learning:An Overview
5
作者 Wenqi Ren Yang Tang +2 位作者 Qiyu Sun Chaoqiang Zhao Qing-Long Han 《IEEE/CAA Journal of Automatica Sinica》 SCIE EI CSCD 2024年第5期1106-1126,共21页
Visual semantic segmentation aims at separating a visual sample into diverse blocks with specific semantic attributes and identifying the category for each block,and it plays a crucial role in environmental perception... Visual semantic segmentation aims at separating a visual sample into diverse blocks with specific semantic attributes and identifying the category for each block,and it plays a crucial role in environmental perception.Conventional learning-based visual semantic segmentation approaches count heavily on largescale training data with dense annotations and consistently fail to estimate accurate semantic labels for unseen categories.This obstruction spurs a craze for studying visual semantic segmentation with the assistance of few/zero-shot learning.The emergence and rapid progress of few/zero-shot visual semantic segmentation make it possible to learn unseen categories from a few labeled or even zero-labeled samples,which advances the extension to practical applications.Therefore,this paper focuses on the recently published few/zero-shot visual semantic segmentation methods varying from 2D to 3D space and explores the commonalities and discrepancies of technical settlements under different segmentation circumstances.Specifically,the preliminaries on few/zeroshot visual semantic segmentation,including the problem definitions,typical datasets,and technical remedies,are briefly reviewed and discussed.Moreover,three typical instantiations are involved to uncover the interactions of few/zero-shot learning with visual semantic segmentation,including image semantic segmentation,video object segmentation,and 3D segmentation.Finally,the future challenges of few/zero-shot visual semantic segmentation are discussed. 展开更多
关键词 VISUAL SEGMENTATION SEPARATING
下载PDF
Nodule Detection Using Local Binary Pattern Features to Enhance Diagnostic Decisions
6
作者 Umar Rashid Arfan Jaffar +2 位作者 Muhammad Rashid Mohammed S.Alshuhri Sheeraz Akram 《Computers, Materials & Continua》 SCIE EI 2024年第3期3377-3390,共14页
Pulmonary nodules are small, round, or oval-shaped growths on the lungs. They can be benign (noncancerous) or malignant (cancerous). The size of a nodule can range from a few millimeters to a few centimeters in diamet... Pulmonary nodules are small, round, or oval-shaped growths on the lungs. They can be benign (noncancerous) or malignant (cancerous). The size of a nodule can range from a few millimeters to a few centimeters in diameter. Nodules may be found during a chest X-ray or other imaging test for an unrelated health problem. In the proposed methodology pulmonary nodules can be classified into three stages. Firstly, a 2D histogram thresholding technique is used to identify volume segmentation. An ant colony optimization algorithm is used to determine the optimal threshold value. Secondly, geometrical features such as lines, arcs, extended arcs, and ellipses are used to detect oval shapes. Thirdly, Histogram Oriented Surface Normal Vector (HOSNV) feature descriptors can be used to identify nodules of different sizes and shapes by using a scaled and rotation-invariant texture description. Smart nodule classification was performed with the XGBoost classifier. The results are tested and validated using the Lung Image Consortium Database (LICD). The proposed method has a sensitivity of 98.49% for nodules sized 3–30 mm. 展开更多
关键词 Pulmonary nodules SEGMENTATION HISTOGRAM THRESHOLDING
下载PDF
An Improved Lung Cancer Segmentation Based on Nature-Inspired Optimization Approaches
7
作者 Shazia Shamas Surya Narayan Panda +4 位作者 Ishu Sharma Kalpna Guleria Aman Singh Ahmad Ali AlZubi Mallak Ahmad AlZubi 《Computer Modeling in Engineering & Sciences》 SCIE EI 2024年第2期1051-1075,共25页
The distinction and precise identification of tumor nodules are crucial for timely lung cancer diagnosis andplanning intervention. This research work addresses the major issues pertaining to the field of medical image... The distinction and precise identification of tumor nodules are crucial for timely lung cancer diagnosis andplanning intervention. This research work addresses the major issues pertaining to the field of medical imageprocessing while focusing on lung cancer Computed Tomography (CT) images. In this context, the paper proposesan improved lung cancer segmentation technique based on the strengths of nature-inspired approaches. Thebetter resolution of CT is exploited to distinguish healthy subjects from those who have lung cancer. In thisprocess, the visual challenges of the K-means are addressed with the integration of four nature-inspired swarmintelligent techniques. The techniques experimented in this paper are K-means with Artificial Bee Colony (ABC),K-means with Cuckoo Search Algorithm (CSA), K-means with Particle Swarm Optimization (PSO), and Kmeanswith Firefly Algorithm (FFA). The testing and evaluation are performed on Early Lung Cancer ActionProgram (ELCAP) database. The simulation analysis is performed using lung cancer images set against metrics:precision, sensitivity, specificity, f-measure, accuracy,Matthews Correlation Coefficient (MCC), Jaccard, and Dice.The detailed evaluation shows that the K-means with Cuckoo Search Algorithm (CSA) significantly improved thequality of lung cancer segmentation in comparison to the other optimization approaches utilized for lung cancerimages. The results exhibit that the proposed approach (K-means with CSA) achieves precision, sensitivity, and Fmeasureof 0.942, 0.964, and 0.953, respectively, and an average accuracy of 93%. The experimental results prove thatK-meanswithABC,K-meanswith PSO,K-meanswith FFA, andK-meanswithCSAhave achieved an improvementof 10.8%, 13.38%, 13.93%, and 15.7%, respectively, for accuracy measure in comparison to K-means segmentationfor lung cancer images. Further, it is highlighted that the proposed K-means with CSA have achieved a significantimprovement in accuracy, hence can be utilized by researchers for improved segmentation processes of medicalimage datasets for identifying the targeted region of interest. 展开更多
关键词 LESION lung cancer segmentation medical imaging META-HEURISTIC Artificial Bee Colony(ABC) Cuckoo Search Algorithm(CSA) Particle Swarm Optimization(PSO) Firefly Algorithm(FFA) SEGMENTATION
下载PDF
Micro segment analysis of supercritical methane thermal-hydraulic performance and pseudo-boiling in a PCHE straight channel
8
作者 Qian Li Zi-Jie Lin +3 位作者 Liu Yang Yue Wang Yue Li Wei-Hua Cai 《Petroleum Science》 SCIE EI CAS CSCD 2024年第2期1275-1289,共15页
The printed circuit heat exchanger(PCHE) is receiving wide attention as a new kind of compact heat exchanger and is considered as a promising vaporizer in the LNG process. In this paper, a PCHE straight channel in the... The printed circuit heat exchanger(PCHE) is receiving wide attention as a new kind of compact heat exchanger and is considered as a promising vaporizer in the LNG process. In this paper, a PCHE straight channel in the length of 500 mm is established, with a semicircular cross section in a diameter of 1.2 mm.Numerical simulation is employed to investigate the flow and heat transfer performance of supercritical methane in the channel. The pseudo-boiling theory is adopted and the liquid-like, two-phase-like, and vapor-like regimes are divided for supercritical methane to analyze the heat transfer and flow features.The results are presented in micro segment to show the local convective heat transfer coefficient and pressure drop. It shows that the convective heat transfer coefficient in segments along the channel has a significant peak feature near the pseudo-critical point and a heat transfer deterioration when the average fluid temperature in the segment is higher than the pseudo-critical point. The reason is explained with the generation of vapor-like film near the channel wall that the peak feature related to a nucleateboiling-like state and heat transfer deterioration related to a film-boiling-like state. The effects of parameters, including mass flow rate, pressure, and wall heat flux on flow and heat transfer were analyzed.In calculating of the averaged heat transfer coefficient of the whole channel, the traditional method shows significant deviation and the micro segment weighted average method is adopted. The pressure drop can mainly be affected by the mass flux and pressure and little affected by the wall heat flux. The peak of the convective heat transfer coefficient can only form at high mass flux, low wall heat flux, and near critical pressure, in which condition the nucleate-boiling-like state is easier to appear. Moreover,heat transfer deterioration will always appear, since the supercritical flow will finally develop into a filmboiling-like state. So heat transfer deterioration should be taken seriously in the design and safe operation of vaporizer PCHE. The study of this work clarified the local heat transfer and flow feature of supercritical methane in microchannel and contributed to the deep understanding of supercritical methane flow of the vaporization process in PCHE. 展开更多
关键词 Printed circuit heat exchanger Vaporization Supercritical methane Pseudo-boiling Micro segment analysis
下载PDF
Low-Brightness Object Recognition Based on Deep Learning
9
作者 Shu-Yin Chiang Ting-Yu Lin 《Computers, Materials & Continua》 SCIE EI 2024年第5期1757-1773,共17页
This research focuses on addressing the challenges associated with image detection in low-light environments,particularly by applying artificial intelligence techniques to machine vision and object recognition systems... This research focuses on addressing the challenges associated with image detection in low-light environments,particularly by applying artificial intelligence techniques to machine vision and object recognition systems.The primary goal is to tackle issues related to recognizing objects with low brightness levels.In this study,the Intel RealSense Lidar Camera L515 is used to simultaneously capture color information and 16-bit depth information images.The detection scenarios are categorized into normal brightness and low brightness situations.When the system determines a normal brightness environment,normal brightness images are recognized using deep learning methods.In low-brightness situations,three methods are proposed for recognition.The first method is the SegmentationwithDepth image(SD)methodwhich involves segmenting the depth image,creating amask from the segmented depth image,mapping the obtained mask onto the true color(RGB)image to obtain a backgroundreduced RGB image,and recognizing the segmented image.The second method is theHDVmethod(hue,depth,value)which combines RGB images converted to HSV images(hue,saturation,value)with depth images D to form HDV images for recognition.The third method is the HSD(hue,saturation,depth)method which similarly combines RGB images converted to HSV images with depth images D to form HSD images for recognition.In experimental results,in normal brightness environments,the average recognition rate obtained using image recognition methods is 91%.For low-brightness environments,using the SD method with original images for training and segmented images for recognition achieves an average recognition rate of over 82%.TheHDVmethod achieves an average recognition rate of over 70%,while the HSD method achieves an average recognition rate of over 84%.The HSD method allows for a quick and convenient low-light object recognition system.This research outcome can be applied to nighttime surveillance systems or nighttime road safety systems. 展开更多
关键词 Low-brightness depth image image segmentation image recognition HDV HSD
下载PDF
CrossFormer Embedding DeepLabv3+ for Remote Sensing Images Semantic Segmentation
10
作者 Qixiang Tong Zhipeng Zhu +2 位作者 Min Zhang Kerui Cao Haihua Xing 《Computers, Materials & Continua》 SCIE EI 2024年第4期1353-1375,共23页
High-resolution remote sensing image segmentation is a challenging task. In urban remote sensing, the presenceof occlusions and shadows often results in blurred or invisible object boundaries, thereby increasing the d... High-resolution remote sensing image segmentation is a challenging task. In urban remote sensing, the presenceof occlusions and shadows often results in blurred or invisible object boundaries, thereby increasing the difficultyof segmentation. In this paper, an improved network with a cross-region self-attention mechanism for multi-scalefeatures based onDeepLabv3+is designed to address the difficulties of small object segmentation and blurred targetedge segmentation. First,we use CrossFormer as the backbone feature extraction network to achieve the interactionbetween large- and small-scale features, and establish self-attention associations between features at both large andsmall scales to capture global contextual feature information. Next, an improved atrous spatial pyramid poolingmodule is introduced to establish multi-scale feature maps with large- and small-scale feature associations, andattention vectors are added in the channel direction to enable adaptive adjustment of multi-scale channel features.The proposed networkmodel is validated using the PotsdamandVaihingen datasets. The experimental results showthat, compared with existing techniques, the network model designed in this paper can extract and fuse multiscaleinformation, more clearly extract edge information and small-scale information, and segment boundariesmore smoothly. Experimental results on public datasets demonstrate the superiority of ourmethod compared withseveral state-of-the-art networks. 展开更多
关键词 Semantic segmentation remote sensing multiscale self-attention
下载PDF
Part-Whole Relational Few-Shot 3D Point Cloud Semantic Segmentation
11
作者 Shoukun Xu Lujun Zhang +2 位作者 Guangqi Jiang Yining Hua Yi Liu 《Computers, Materials & Continua》 SCIE EI 2024年第3期3021-3039,共19页
This paper focuses on the task of few-shot 3D point cloud semantic segmentation.Despite some progress,this task still encounters many issues due to the insufficient samples given,e.g.,incomplete object segmentation an... This paper focuses on the task of few-shot 3D point cloud semantic segmentation.Despite some progress,this task still encounters many issues due to the insufficient samples given,e.g.,incomplete object segmentation and inaccurate semantic discrimination.To tackle these issues,we first leverage part-whole relationships into the task of 3D point cloud semantic segmentation to capture semantic integrity,which is empowered by the dynamic capsule routing with the module of 3D Capsule Networks(CapsNets)in the embedding network.Concretely,the dynamic routing amalgamates geometric information of the 3D point cloud data to construct higher-level feature representations,which capture the relationships between object parts and their wholes.Secondly,we designed a multi-prototype enhancement module to enhance the prototype discriminability.Specifically,the single-prototype enhancement mechanism is expanded to the multi-prototype enhancement version for capturing rich semantics.Besides,the shot-correlation within the category is calculated via the interaction of different samples to enhance the intra-category similarity.Ablation studies prove that the involved part-whole relations and proposed multi-prototype enhancement module help to achieve complete object segmentation and improve semantic discrimination.Moreover,under the integration of these two modules,quantitative and qualitative experiments on two public benchmarks,including S3DIS and ScanNet,indicate the superior performance of the proposed framework on the task of 3D point cloud semantic segmentation,compared to some state-of-the-art methods. 展开更多
关键词 Few-shot point cloud semantic segmentation CapsNets
下载PDF
A Novel Approach to Breast Tumor Detection: Enhanced Speckle Reduction and Hybrid Classification in Ultrasound Imaging
12
作者 K.Umapathi S.Shobana +5 位作者 Anand Nayyar Judith Justin R.Vanithamani Miguel Villagómez Galindo Mushtaq Ahmad Ansari Hitesh Panchal 《Computers, Materials & Continua》 SCIE EI 2024年第5期1875-1901,共27页
Breast cancer detection heavily relies on medical imaging, particularly ultrasound, for early diagnosis and effectivetreatment. This research addresses the challenges associated with computer-aided diagnosis (CAD) of ... Breast cancer detection heavily relies on medical imaging, particularly ultrasound, for early diagnosis and effectivetreatment. This research addresses the challenges associated with computer-aided diagnosis (CAD) of breastcancer fromultrasound images. The primary challenge is accurately distinguishing between malignant and benigntumors, complicated by factors such as speckle noise, variable image quality, and the need for precise segmentationand classification. The main objective of the research paper is to develop an advanced methodology for breastultrasound image classification, focusing on speckle noise reduction, precise segmentation, feature extraction, andmachine learning-based classification. A unique approach is introduced that combines Enhanced Speckle ReducedAnisotropic Diffusion (SRAD) filters for speckle noise reduction, U-NET-based segmentation, Genetic Algorithm(GA)-based feature selection, and Random Forest and Bagging Tree classifiers, resulting in a novel and efficientmodel. To test and validate the hybrid model, rigorous experimentations were performed and results state thatthe proposed hybrid model achieved accuracy rate of 99.9%, outperforming other existing techniques, and alsosignificantly reducing computational time. This enhanced accuracy, along with improved sensitivity and specificity,makes the proposed hybrid model a valuable addition to CAD systems in breast cancer diagnosis, ultimatelyenhancing diagnostic accuracy in clinical applications. 展开更多
关键词 Ultrasound images breast cancer tumor classification SEGMENTATION deep learning lesion detection
下载PDF
Cooperative Rate Splitting Transmit Design for Full-Duplex-Enabled Multiple Multicast Communication Systems
13
作者 Siyi Duan Mingsheng Wei +2 位作者 Shidang Li Weiqiang Tan Bencheng Yu 《Computer Modeling in Engineering & Sciences》 SCIE EI 2024年第1期619-638,共20页
This paper examines the performance of Full-Duplex Cooperative Rate Splitting(FD-CRS)with Simultaneous Wireless Information and Power Transfer(SWIPT)support in Multiple Input Single Output(MISO)networks.In a Rate Spli... This paper examines the performance of Full-Duplex Cooperative Rate Splitting(FD-CRS)with Simultaneous Wireless Information and Power Transfer(SWIPT)support in Multiple Input Single Output(MISO)networks.In a Rate Splitting Multiple Access(RSMA)multicast system with two local users and one remote user,the common data stream contains the needs of all users,and all users can decode the common data stream.Therefore,each user can receive some information that other users need,and local users with better channel conditions can use this information to further enhance the reception reliability and data rate of users with poor channel quality.Even using Cell-Center-Users(CCUs)as a cooperative relay to assist the transmission of common data can improve the average system speed.To maximize the minimum achievable rate,we optimize the beamforming vector of Base Station(BS),the common streamsplitting vector,the cooperative distributed beamvector and the strong user transmission power under the power budget constraints of BS and relay devices and the service quality requirements constraints of users.Since the whole problem is not convex,we cannot solve it directly.Therefore,we propose a low complexity algorithm based on Successive Convex Approximation(SCA)technology to find the optimal solution to the problemunder consideration.The simulation results show that FD C-RSMA has better gain andmore powerful than FD C-NOMA,HD C-RSMA,RSMA and NOMA. 展开更多
关键词 Full-duplex cooperative rate segmentation SWIPT RSMA power control
下载PDF
Real-Time Detection and Instance Segmentation of Strawberry in Unstructured Environment
14
作者 Chengjun Wang Fan Ding +4 位作者 Yiwen Wang Renyuan Wu Xingyu Yao Chengjie Jiang Liuyi Ling 《Computers, Materials & Continua》 SCIE EI 2024年第1期1481-1501,共21页
The real-time detection and instance segmentation of strawberries constitute fundamental components in the development of strawberry harvesting robots.Real-time identification of strawberries in an unstructured envi-r... The real-time detection and instance segmentation of strawberries constitute fundamental components in the development of strawberry harvesting robots.Real-time identification of strawberries in an unstructured envi-ronment is a challenging task.Current instance segmentation algorithms for strawberries suffer from issues such as poor real-time performance and low accuracy.To this end,the present study proposes an Efficient YOLACT(E-YOLACT)algorithm for strawberry detection and segmentation based on the YOLACT framework.The key enhancements of the E-YOLACT encompass the development of a lightweight attention mechanism,pyramid squeeze shuffle attention(PSSA),for efficient feature extraction.Additionally,an attention-guided context-feature pyramid network(AC-FPN)is employed instead of FPN to optimize the architecture’s performance.Furthermore,a feature-enhanced model(FEM)is introduced to enhance the prediction head’s capabilities,while efficient fast non-maximum suppression(EF-NMS)is devised to improve non-maximum suppression.The experimental results demonstrate that the E-YOLACT achieves a Box-mAP and Mask-mAP of 77.9 and 76.6,respectively,on the custom dataset.Moreover,it exhibits an impressive category accuracy of 93.5%.Notably,the E-YOLACT also demonstrates a remarkable real-time detection capability with a speed of 34.8 FPS.The method proposed in this article presents an efficient approach for the vision system of a strawberry-picking robot. 展开更多
关键词 YOLACT real-time detection instance segmentation attention mechanism STRAWBERRY
下载PDF
A froth velocity measurement method based on improved U-Net++semantic segmentation in flotation process
15
作者 Yiwei Chen Degang Xu Kun Wan 《International Journal of Minerals,Metallurgy and Materials》 SCIE EI CAS CSCD 2024年第8期1816-1827,共12页
During flotation,the features of the froth image are highly correlated with the concentrate grade and the corresponding working conditions.The static features such as color and size of the bubbles and the dynamic feat... During flotation,the features of the froth image are highly correlated with the concentrate grade and the corresponding working conditions.The static features such as color and size of the bubbles and the dynamic features such as velocity have obvious differences between different working conditions.The extraction of these features is typically relied on the outcomes of image segmentation at the froth edge,making the segmentation of froth image the basis for studying its visual information.Meanwhile,the absence of scientifically reliable training data with label and the necessity to manually construct dataset and label make the study difficult in the mineral flotation.To solve this problem,this paper constructs a tungsten concentrate froth image dataset,and proposes a data augmentation network based on Conditional Generative Adversarial Nets(cGAN)and a U-Net++-based edge segmentation network.The performance of this algorithm is also evaluated and contrasted with other algorithms in this paper.On the results of semantic segmentation,a phase-correlationbased velocity extraction method is finally suggested. 展开更多
关键词 froth flotation froth segmentation froth image data augmentation velocity extraction image features
下载PDF
Hydraulic properties and drought response of a tropical bamboo (Cephalostachyum pergracile)
16
作者 Wanwalee Kongjarat Lu Han +10 位作者 Amy Ny Aina Aritsara Shu-Bin Zhang Gao-Juan Zhao Yong-Jiang Zhang Phisamai Maenpuen Ying-Mei Li Yi-Ke Zou Ming-Yi Li Xue-Nan Li Lian-Bin Tao Ya-Jun Chen 《Plant Diversity》 SCIE CAS CSCD 2024年第3期406-415,共10页
Bamboo plants are an essential component of tropical ecosystems,yet their vulnerability to climate extremes,such as drought,is poorly understood due to limited knowledge of their hydraulic properties.Cephalostachyum p... Bamboo plants are an essential component of tropical ecosystems,yet their vulnerability to climate extremes,such as drought,is poorly understood due to limited knowledge of their hydraulic properties.Cephalostachyum pergracile,a commonly used tropical bamboo species,exhibited a substantially higher mortality rate than other co-occurring bamboos during a severe drought event in 2019,but the underlying mechanisms remain unclear.This study investigated the leaf and stem hydraulic traits related to drought responses,including leaf-stem embolism resistance(P50leaf;P50stem) estimated using optical and X-ray microtomography methods,leaf pressure-volume and water-releasing curves.Additionally,we investigated the seasonal water potentials,native embolism level(PLC) and xylem water source using stable isotope.We found that C.pergracile exhibited strong resistance to embolism,showing low P50leaf,P50stem,and turgor loss point,despite its rapid leaf water loss.Interestingly,its leaves displayed greater resistance to embolism than its stem,suggesting a lack of effective hydraulic vulnerability segmentation(HVS) to protect the stem from excessive xylem tension.During the dry season,approximately 49% of the water was absorbed from the upper 20-cm-deep soil layer.Consequently,significant diurnal variation in leaf water potentials and an increase in midday PLC from 5.87±2.33% in the wet season to 12.87±4.09%in the dry season were observed.In summary,this study demonstrated that the rapid leaf water loss,high reliance on surface water,and a lack of effective HVS in C.pergracile accelerated water depletion and increased xylem embolism even in the typical dry season,which may explain its high mortality rate during extreme drought events in 2019. 展开更多
关键词 Climate change DROUGHT Hydraulic safety Hydraulic vulnerability segmentation Stable isotope Tree mortality
下载PDF
An Intelligent Sensor Data Preprocessing Method for OCT Fundus Image Watermarking Using an RCNN
17
作者 Jialun Lin Qiong Chen 《Computer Modeling in Engineering & Sciences》 SCIE EI 2024年第2期1549-1561,共13页
Watermarks can provide reliable and secure copyright protection for optical coherence tomography(OCT)fundus images.The effective image segmentation is helpful for promoting OCT image watermarking.However,OCT images ha... Watermarks can provide reliable and secure copyright protection for optical coherence tomography(OCT)fundus images.The effective image segmentation is helpful for promoting OCT image watermarking.However,OCT images have a large amount of low-quality data,which seriously affects the performance of segmentationmethods.Therefore,this paper proposes an effective segmentation method for OCT fundus image watermarking using a rough convolutional neural network(RCNN).First,the rough-set-based feature discretization module is designed to preprocess the input data.Second,a dual attention mechanism for feature channels and spatial regions in the CNN is added to enable the model to adaptively select important information for fusion.Finally,the refinement module for enhancing the extraction power of multi-scale information is added to improve the edge accuracy in segmentation.RCNN is compared with CE-Net and MultiResUNet on 83 gold standard 3D retinal OCT data samples.The average dice similarly coefficient(DSC)obtained by RCNN is 6%higher than that of CE-Net.The average 95 percent Hausdorff distance(95HD)and average symmetric surface distance(ASD)obtained by RCNN are 32.4%and 33.3%lower than those of MultiResUNet,respectively.We also evaluate the effect of feature discretization,as well as analyze the initial learning rate of RCNN and conduct ablation experiments with the four different models.The experimental results indicate that our method can improve the segmentation accuracy of OCT fundus images,providing strong support for its application in medical image watermarking. 展开更多
关键词 Watermarks image segmentation rough convolutional neural network attentionmechanism feature discretization
下载PDF
Instance Segmentation of Characters Recognized in Palmyrene Aramaic Inscriptions
18
作者 Adéla Hamplová Alexey Lyavdansky +3 位作者 TomášNovák Ondrej Svojše David Franc Arnošt Veselý 《Computer Modeling in Engineering & Sciences》 SCIE EI 2024年第9期2869-2889,共21页
This study presents a single-class and multi-class instance segmentation approach applied to ancient Palmyrene inscriptions,employing two state-of-the-art deep learning algorithms,namely YOLOv8 and Roboflow 3.0.The go... This study presents a single-class and multi-class instance segmentation approach applied to ancient Palmyrene inscriptions,employing two state-of-the-art deep learning algorithms,namely YOLOv8 and Roboflow 3.0.The goal is to contribute to the preservation and understanding of historical texts,showcasing the potential of modern deep learning methods in archaeological research.Our research culminates in several key findings and scientific contributions.We comprehensively compare the performance of YOLOv8 and Roboflow 3.0 in the context of Palmyrene character segmentation—this comparative analysis mainly focuses on the strengths and weaknesses of each algorithm in this context.We also created and annotated an extensive dataset of Palmyrene inscriptions,a crucial resource for further research in the field.The dataset serves for training and evaluating the segmentation models.We employ comparative evaluation metrics to quantitatively assess the segmentation results,ensuring the reliability and reproducibility of our findings and we present custom visualization tools for predicted segmentation masks.Our study advances the state of the art in semi-automatic reading of Palmyrene inscriptions and establishes a benchmark for future research.The availability of the Palmyrene dataset and the insights into algorithm performance contribute to the broader understanding of historical text analysis. 展开更多
关键词 Optical character recognition instance segmentation Palmyrene ancient languages computer vision
下载PDF
ProNet Adaptive Retinal Vessel Segmentation Algorithm Based on Improved UperNet Network
19
作者 Sijia Zhu Pinxiu Wang Ke Shen 《Computers, Materials & Continua》 SCIE EI 2024年第1期283-302,共20页
This paper proposes a new network structure,namely the ProNet network.Retinal medical image segmentation can help clinical diagnosis of related eye diseases and is essential for subsequent rational treatment.The basel... This paper proposes a new network structure,namely the ProNet network.Retinal medical image segmentation can help clinical diagnosis of related eye diseases and is essential for subsequent rational treatment.The baseline model of the ProNet network is UperNet(Unified perceptual parsing Network),and the backbone network is ConvNext(Convolutional Network).A network structure based on depth-separable convolution and 1×1 convolution is used,which has good performance and robustness.We further optimise ProNet mainly in two aspects.One is data enhancement using increased noise and slight angle rotation,which can significantly increase the diversity of data and help the model better learn the patterns and features of the data and improve the model’s performance.Meanwhile,it can effectively expand the training data set,reduce the influence of noise and abnormal data in the data set on the model,and improve the accuracy and reliability of the model.Another is the loss function aspect,and we finally use the focal loss function.The focal loss function is well suited for complex tasks such as object detection.The function will penalise the loss carried by samples that the model misclassifies,thus enabling better training of the model to avoid these errors while solving the category imbalance problem as a way to improve image segmentation density and segmentation accuracy.From the experimental results,the evaluation metrics mIoU(mean Intersection over Union)enhanced by 4.47%,and mDice enhanced by 2.92% compared to the baseline network.Better generalization effects and more accurate image segmentation are achieved. 展开更多
关键词 Retinal segmentation multifaceted optimization cross-fusion data enhancement focal loss
下载PDF
Multilevel Attention Unet Segmentation Algorithmfor Lung Cancer Based on CT Images
20
作者 Huan Wang Shi Qiu +1 位作者 Benyue Zhang Lixuan Xiao 《Computers, Materials & Continua》 SCIE EI 2024年第2期1569-1589,共21页
Lung cancer is a malady of the lungs that gravely jeopardizes human health.Therefore,early detection and treatment are paramount for the preservation of human life.Lung computed tomography(CT)image sequences can expli... Lung cancer is a malady of the lungs that gravely jeopardizes human health.Therefore,early detection and treatment are paramount for the preservation of human life.Lung computed tomography(CT)image sequences can explicitly delineate the pathological condition of the lungs.To meet the imperative for accurate diagnosis by physicians,expeditious segmentation of the region harboring lung cancer is of utmost significance.We utilize computer-aided methods to emulate the diagnostic process in which physicians concentrate on lung cancer in a sequential manner,erect an interpretable model,and attain segmentation of lung cancer.The specific advancements can be encapsulated as follows:1)Concentration on the lung parenchyma region:Based on 16-bit CT image capturing and the luminance characteristics of lung cancer,we proffer an intercept histogram algorithm.2)Focus on the specific locus of lung malignancy:Utilizing the spatial interrelation of lung cancer,we propose a memory-based Unet architecture and incorporate skip connections.3)Data Imbalance:In accordance with the prevalent situation of an overabundance of negative samples and a paucity of positive samples,we scrutinize the existing loss function and suggest a mixed loss function.Experimental results with pre-existing publicly available datasets and assembled datasets demonstrate that the segmentation efficacy,measured as Area Overlap Measure(AOM)is superior to 0.81,which markedly ameliorates in comparison with conventional algorithms,thereby facilitating physicians in diagnosis. 展开更多
关键词 Lung cancer computed tomography computer-aided diagnosis Unet SEGMENTATION
下载PDF
上一页 1 2 105 下一页 到第
使用帮助 返回顶部