The real-time detection and instance segmentation of strawberries constitute fundamental components in the development of strawberry harvesting robots.Real-time identification of strawberries in an unstructured envi-r...The real-time detection and instance segmentation of strawberries constitute fundamental components in the development of strawberry harvesting robots.Real-time identification of strawberries in an unstructured envi-ronment is a challenging task.Current instance segmentation algorithms for strawberries suffer from issues such as poor real-time performance and low accuracy.To this end,the present study proposes an Efficient YOLACT(E-YOLACT)algorithm for strawberry detection and segmentation based on the YOLACT framework.The key enhancements of the E-YOLACT encompass the development of a lightweight attention mechanism,pyramid squeeze shuffle attention(PSSA),for efficient feature extraction.Additionally,an attention-guided context-feature pyramid network(AC-FPN)is employed instead of FPN to optimize the architecture’s performance.Furthermore,a feature-enhanced model(FEM)is introduced to enhance the prediction head’s capabilities,while efficient fast non-maximum suppression(EF-NMS)is devised to improve non-maximum suppression.The experimental results demonstrate that the E-YOLACT achieves a Box-mAP and Mask-mAP of 77.9 and 76.6,respectively,on the custom dataset.Moreover,it exhibits an impressive category accuracy of 93.5%.Notably,the E-YOLACT also demonstrates a remarkable real-time detection capability with a speed of 34.8 FPS.The method proposed in this article presents an efficient approach for the vision system of a strawberry-picking robot.展开更多
The paper puts forward a variance-time plots method based on slide-window mechanism tocalculate the Hurst parameter to detect Distribute Denial of Service(DDoS)attack in real time.Basedon fuzzy logic technology that c...The paper puts forward a variance-time plots method based on slide-window mechanism tocalculate the Hurst parameter to detect Distribute Denial of Service(DDoS)attack in real time.Basedon fuzzy logic technology that can adjust itself dynamically under the fuzzy rules,an intelligent DDoSjudgment mechanism is designed.This new method calculates the Hurst parameter quickly and detectsDDoS attack in real time.Through comparing the detecting technologies based on statistics andfeature-packet respectively under different experiments,it is found that the new method can identifythe change of the Hurst parameter resulting from DDoS attack traffic with different intensities,andintelligently judge DDoS attack self-adaptively in real time.展开更多
Lightning is a typical example of an instantaneous random point source target. It has close connection with severe convective phenomena such as a thunderstorm, whose distribution, variation, position and forecasting c...Lightning is a typical example of an instantaneous random point source target. It has close connection with severe convective phenomena such as a thunderstorm, whose distribution, variation, position and forecasting can be acquired through lightning observation. In this paper, we discuss the way to achieve instantaneous lightning signal intensification and detection from geostationary orbit by using the differences between the lightning signal and the slowly changing background noise such as that of cloud, land and ocean, combining three methods, spectral filtering, spatial filtering and background noise, enabling removal between frames. After six months of operation in orbit, lightning within the coverage of the Geostationary Lightning Imager was effectively detected, strongly supporting the case for shorttime and real-time early warning, forecasting and tracking of severe convective phenomena in China.展开更多
This paper presents a vision-based crack detection approach for concrete bridge decks using an integrated one-dimensional convolutional neural network(1D-CNN)and long short-term memory(LSTM)method in the image frequen...This paper presents a vision-based crack detection approach for concrete bridge decks using an integrated one-dimensional convolutional neural network(1D-CNN)and long short-term memory(LSTM)method in the image frequency domain.The so-called 1D-CNN-LSTM algorithm is trained using thousands of images of cracked and non-cracked concrete bridge decks.In order to improve the training efficiency,images are first transformed into the frequency domain during a preprocessing phase.The algorithm is then calibrated using the flattened frequency data.LSTM is used to improve the performance of the developed network for long sequence data.The accuracy of the developed model is 99.05%,98.9%,and 99.25%,respectively,for training,validation,and testing data.An implementation framework is further developed for future application of the trained model for large-scale images.The proposed 1D-CNN-LSTM method exhibits superior performance in comparison with existing deep learning methods in terms of accuracy and computation time.The fast implementation of the 1D-CNN-LSTM algorithm makes it a promising tool for real-time crack detection.展开更多
Implantable bioelectronics for analyzing physiological biomarkers has recently been recognized as a promising technique in medical treatment or diagnostics. In this study, we developed a self-powered implantable skinl...Implantable bioelectronics for analyzing physiological biomarkers has recently been recognized as a promising technique in medical treatment or diagnostics. In this study, we developed a self-powered implantable skinlike glucometer for real-time detection of blood glucose level in vivo. Based on the piezo-enzymatic-reaction coupling effect of GOx@ZnO nanowire, the device under an applied deformation can actively output piezoelectric signal containing the glucose-detecting information. No external electricity power source or battery is needed for this device, and the outputting piezoelectric voltage acts as both the biosensing signal and electricity power. A practical application of the skin-like glucometer implanted in mouse body for detecting blood glucose level has been simply demonstrated. These results provide a new technique path for diabetes prophylaxis and treatment.展开更多
Cervical cancer remains a critically important problem for women, especially those women in the developing world where the case-fatality rate is high. There are an estimated 528,000 cases and 266,000 deaths worldwide....Cervical cancer remains a critically important problem for women, especially those women in the developing world where the case-fatality rate is high. There are an estimated 528,000 cases and 266,000 deaths worldwide. Established screening and detection programs in the developed world have lowered the mortality from 40/100,000 to 2/100,000 over the last 60 years. The standard of care has been and continues to be: a screening Papanicolaou smear with or without Human Papilloma Virus (HPV) testing;followed by colposcopy and biopsies and if the smear is abnormal;and followed by treatment if the biopsies show high grade disease (cervical intraepithelial neoplasia (CIN) grades 2 and 3 and Carcinoma-in-situ). Low grade lesions (Pap smears with Atypical Cells of Uncertain Significance (ASCUS), Low Grade Squamous Intraepithelial Lesions (LGSIL), biopsies showing HPV changes or showing CIN 1);are usually followed for two years and then treated if persistent. Treatment can be performed with loop excision, LASER, or cryotherapy. Loop excision yields a specimen which can be reviewed to establish the diagnosis more accurately. LASER vaporizes the lesion and cryotherapy leads to tissue destruction. Under long term study;loop excision, LASER, and cryotherapy have the same rate of cure. The standard of care is expensive and takes 6 - 12 weeks for the individual patient. During the last twenty years, new technologies that can view the cervix and even image the cervix with cellular resolution have been developed. These technologies could lead to a new paradigm in which diagnosis and treatment occurs at a single visit. These technologies include fluorescence and reflectance spectroscopy (probe or wide-field, whole cervix scanning approaches) and fluorescence confocal endomicroscopy or high resolution micro-endoscopy. Both technologies have received Federal Drug Administration (FDA) and have been commercialized. Research trials continue to show their remarkable performance. These technologies are reviewed and clinical trials are summarized. Emerging technologies are coming along that may compete with those already approved and include optical coherence tomography, optical coherence tomography with autofluorescence, diffuse optical microscopy, and dual mode micro-endoscopy. These technologies are also reviewed and where available, clinical data is reported. Optical technologies are ready to diffuse into clinical practice because they will save money and 3 or 4 visits in the developed world and offer the same standard of care to the developing world where more cervical cancer exists.展开更多
An approach to detection of moving objects in video sequences, with application to video surveillance is presented. The algorithm combines two kinds of change points, which are detected from the region-based frame dif...An approach to detection of moving objects in video sequences, with application to video surveillance is presented. The algorithm combines two kinds of change points, which are detected from the region-based frame difference and adjusted background subtraction. An adaptive threshold technique is employed to automatically choose the threshold value to segment the moving objects from the still background. And experiment results show that the algorithm is effective and efficient in practical situations. Furthermore, the algorithm is robust to the effects of the changing of lighting condition and can be applied for video surveillance system.展开更多
For the purpose of monitoring apple fruits effectively throughout the entire growth period in smart orchards.A lightweight model named YOLOv8n-ShuffleNetv2-Ghost-SE was proposed.The ShuffleNetv2 basic modules and down...For the purpose of monitoring apple fruits effectively throughout the entire growth period in smart orchards.A lightweight model named YOLOv8n-ShuffleNetv2-Ghost-SE was proposed.The ShuffleNetv2 basic modules and down-sampling modules were alternately connected,replacing the Backbone of YOLOv8n model.The Ghost modules replaced the Conv modules and the C2fGhost modules replaced the C2f modules in the Neck part of the YOLOv8n.ShuffleNetv2 reduced the memory access cost through channel splitting operations.The Ghost module combined linear and non-linear convolutions to reduce the network computation cost.The Wise-IoU(WIoU)replaced the CIoU for calculating the bounding box regression loss,which dynamically adjusted the anchor box quality threshold and gradient gain allocation strategy,optimizing the size and position of predicted bounding boxes.The Squeeze-and-Excitation(SE)was embedded in the Backbone and Neck part of YOLOv8n to enhance the representation ability of feature maps.The algorithm ensured high precision while having small model size and fast detection speed,which facilitated model migration and deployment.Using 9652 images validated the effectiveness of the model.The YOLOv8n-ShuffleNetv2-Ghost-SE model achieved Precision of 94.1%,Recall of 82.6%,mean Average Precision of 91.4%,model size of 2.6 MB,parameters of 1.18 M,FLOPs of 3.9 G,and detection speed of 39.37 fps.The detection speeds on the Jetson Xavier NX development board were 3.17 fps.Comparisons with advanced models including Faster R-CNN,SSD,YOLOv5s,YOLOv7‑tiny,YOLOv8s,YOLOv8n,MobileNetv3_small-Faster,MobileNetv3_small-Ghost,ShuflleNetv2-Faster,ShuflleNetv2-Ghost,ShuflleNetv2-Ghost-CBAM,ShuflleNetv2-Ghost-ECA,and ShuflleNetv2-Ghost-CA demonstrated that the method achieved smaller model and faster detection speed.The research can provide reference for the development of smart devices in apple orchards.展开更多
Active noise controls are used in a wide field of applications to cancel out unwanted surrounding noise. Control systems based on the feedback structure however have the disadvantage that they may become unstable duri...Active noise controls are used in a wide field of applications to cancel out unwanted surrounding noise. Control systems based on the feedback structure however have the disadvantage that they may become unstable during run-time due to changes in the control path—in this context including the listener’s ear. Especially when applied to active noise cancellation (ANC) headphones, the risk of instability is associated with the risk of harmful influence on the listener’s ear, which is exposed to the speaker in striking distance. This paper discusses several methods to enable the analysis of a feedback ANC system during run-time to immediately detect instability. Finally, a solution is proposed, which identifies the open loop behavior parametrically by means of an adaptive filter to subsequently evaluate the coefficients regarding stability.展开更多
In this paper, we proposed a new concept: depth of drowsiness, which can more precisely describe the drowsiness than existing binary description. A set of effective markers for drowsiness: normalized band norm was suc...In this paper, we proposed a new concept: depth of drowsiness, which can more precisely describe the drowsiness than existing binary description. A set of effective markers for drowsiness: normalized band norm was successfully developed. These markers are invariant from voltage amplitude of brain waves, eliminating the need for calibrating the voltage output of the brain-computer interface devices. A new polling algorithm was designed and implemented for computing the depth of drowsiness. The time cost of data acquisition and processing for each estimate is about one second, which is well suited for real-time applications. Test results with a portable brain-computer interface device show that the depth of drowsiness computed by the method in this paper is generally invariant from ages of test subjects and sensor channels (P3 and C4). The comparison between experiment and computing results indicate that the new method is noticeably better than one of the recent methods in terms of accuracy for predicting the drowsiness.展开更多
The moisture content(MC)of municipal sludge is the key factor affecting sludge treatment and disposal technologies,while the vast majority of existing measurement methods are off-line and time-consuming.To realize rap...The moisture content(MC)of municipal sludge is the key factor affecting sludge treatment and disposal technologies,while the vast majority of existing measurement methods are off-line and time-consuming.To realize rapid online detection for the MC of sludge,a detection method based on the microwave reflection principle is proposed:experiments are carried out and the MC computation model of the sludge is derived using the resonant frequency and the permittivity(e').The results reveal that the detection accuracy of granular sludge with a thickness of 10 mm is higher.The theoretical model between the MC and the real part of e'is developed,and the relationship between the resonant frequency and e'is expressed by a cubic polynomial.The average error and the root mean square error(RMSE)of sludge are 2.06%and 2.49%,respectively.The prediction model for the MC of sludge is also given,and the determination coefficient and RMSE are 0.981 and 2.06%,respectively.展开更多
The advancement of navigation systems for the visually impaired has significantly enhanced their mobility by mitigating the risk of encountering obstacles and guiding them along safe,navigable routes.Traditional appro...The advancement of navigation systems for the visually impaired has significantly enhanced their mobility by mitigating the risk of encountering obstacles and guiding them along safe,navigable routes.Traditional approaches primarily focus on broad applications such as wayfinding,obstacle detection,and fall prevention.However,there is a notable discrepancy in applying these technologies to more specific scenarios,like identifying distinct food crop types or recognizing faces.This study proposes a real-time application designed for visually impaired individuals,aiming to bridge this research-application gap.It introduces a system capable of detecting 20 different food crop types and recognizing faces with impressive accuracies of 83.27%and 95.64%,respectively.These results represent a significant contribution to the field of assistive technologies,providing visually impaired users with detailed and relevant information about their surroundings,thereby enhancing their mobility and ensuring their safety.Additionally,it addresses the vital aspects of social engagements,acknowledging the challenges faced by visually impaired individuals in recognizing acquaintances without auditory or tactile signals,and highlights recent developments in prototype systems aimed at assisting with face recognition tasks.This comprehensive approach not only promises enhanced navigational aids but also aims to enrich the social well-being and safety of visually impaired communities.展开更多
To address the challenges of high complexity,poor real-time performance,and low detection rates for small target vehicles in existing vehicle object detection algorithms,this paper proposes a real-time lightweight arc...To address the challenges of high complexity,poor real-time performance,and low detection rates for small target vehicles in existing vehicle object detection algorithms,this paper proposes a real-time lightweight architecture based on You Only Look Once(YOLO)v5m.Firstly,a lightweight upsampling operator called Content-Aware Reassembly of Features(CARAFE)is introduced in the feature fusion layer of the network to maximize the extraction of deep-level features for small target vehicles,reducing the missed detection rate and false detection rate.Secondly,a new prediction layer for tiny targets is added,and the feature fusion network is redesigned to enhance the detection capability for small targets.Finally,this paper applies L1 regularization to train the improved network,followed by pruning and fine-tuning operations to remove redundant channels,reducing computational and parameter complexity and enhancing the detection efficiency of the network.Training is conducted on the VisDrone2019-DET dataset.The experimental results show that the proposed algorithmreduces parameters and computation by 63.8% and 65.8%,respectively.The average detection accuracy improves by 5.15%,and the detection speed reaches 47 images per second,satisfying real-time requirements.Compared with existing approaches,including YOLOv5m and classical vehicle detection algorithms,our method achieves higher accuracy and faster speed for real-time detection of small target vehicles in edge computing.展开更多
Timely acquisition of rescue target information is critical for emergency response after a flood disaster.Unmanned Aerial Vehicles(UAVs)equipped with remote sensing capabilities offer distinct advantages,including hig...Timely acquisition of rescue target information is critical for emergency response after a flood disaster.Unmanned Aerial Vehicles(UAVs)equipped with remote sensing capabilities offer distinct advantages,including high-resolution imagery and exceptional mobility,making them well suited for monitoring flood extent and identifying rescue targets during floods.However,there are some challenges in interpreting rescue information in real time from flood images captured by UAVs,such as the complexity of the scenarios of UAV images,the lack of flood rescue target detection datasets and the limited real-time processing capabilities of the airborne on-board platform.Thus,we propose a real-time rescue target detection method for UAVs that is capable of efficiently delineating flood extent and identifying rescue targets(i.e.,pedestrians and vehicles trapped by floods).The proposed method achieves real-time rescue information extraction for UAV platforms by lightweight processing and fusion of flood extent extraction model and target detection model.The flood inundation range is extracted by the proposed method in real time and detects targets such as people and vehicles to be rescued based on this layer.Our experimental results demonstrate that the Intersection over Union(IoU)for flood water extraction reaches an impressive 80%,and the IoU for real-time flood water extraction stands at a commendable 76.4%.The information on flood stricken targets extracted by this method in real time can be used for flood emergency rescue.展开更多
Real-time indoor camera localization is a significant problem in indoor robot navigation and surveillance systems.The scene can change during the image sequence and plays a vital role in the localization performance o...Real-time indoor camera localization is a significant problem in indoor robot navigation and surveillance systems.The scene can change during the image sequence and plays a vital role in the localization performance of robotic applications in terms of accuracy and speed.This research proposed a real-time indoor camera localization system based on a recurrent neural network that detects scene change during the image sequence.An annotated image dataset trains the proposed system and predicts the camera pose in real-time.The system mainly improved the localization performance of indoor cameras by more accurately predicting the camera pose.It also recognizes the scene changes during the sequence and evaluates the effects of these changes.This system achieved high accuracy and real-time performance.The scene change detection process was performed using visual rhythm and the proposed recurrent deep architecture,which performed camera pose prediction and scene change impact evaluation.Overall,this study proposed a novel real-time localization system for indoor cameras that detects scene changes and shows how they affect localization performance.展开更多
In order to address the challenge of non-destructive detection of tomato fruit ripeness in controlled environments,this study proposed a real-time instance segmentation method based on the edge device.This method comb...In order to address the challenge of non-destructive detection of tomato fruit ripeness in controlled environments,this study proposed a real-time instance segmentation method based on the edge device.This method combined the principles of phenotype robots and machine vision based on deep learning.A compact and remotely controllable phenotype detection robot was employed to acquire precise data on tomato ripeness.The video data were then processed by using an efficient backbone and the FeatFlowNet structure for feature extraction and analysis of key-frame to non-key-frame mapping from video data.To enhance the diversity of training datasets and the generalization of the model,an innovative approach was chosen by using random enhancement techniques.Besides,the PolyLoss optimization technique was applied to further improve the accuracy of the ripeness multi-class detection tasks.Through validation,the method of this study achieved real-time processing speeds of 90.1 fps(RTX 3070Ti)and 65.5 fps(RTX 2060 S),with an average detection accuracy of 97%compared to manually measured results.This is more accurate and efficient than other instance segmentation models according to actual testing in a greenhouse.Therefore,the results of this research can be deployed in edge devices and provide technical support for unmanned greenhouse monitoring devices or fruit-picking robots in facility environments.展开更多
Aiming at the problem of low accuracy of traditional target detection methods for target detection in endoscopes in substation environments, a CNN-based real-time detection method for masked targets is proposed. The m...Aiming at the problem of low accuracy of traditional target detection methods for target detection in endoscopes in substation environments, a CNN-based real-time detection method for masked targets is proposed. The method adopts the overall design of backbone network, detection network and algorithmic parameter optimisation method, completes the model training on the self-constructed occlusion target dataset, and adopts the multi-scale perception method for target detection. The HNM algorithm is used to screen positive and negative samples during the training process, and the NMS algorithm is used to post-process the prediction results during the detection process to improve the detection efficiency. After experimental validation, the obtained model has the multi-class average predicted value (mAP) of the dataset. It has general advantages over traditional target detection methods. The detection time of a single target on FDDB dataset is 39 ms, which can meet the need of real-time target detection. In addition, the project team has successfully deployed the method into substations and put it into use in many places in Beijing, which is important for achieving the anomaly of occlusion target detection.展开更多
Credit card fraud remains a significant challenge, with financial losses and consumer protection at stake. This study addresses the need for practical, real-time fraud detection methodologies. Using a Kaggle credit ca...Credit card fraud remains a significant challenge, with financial losses and consumer protection at stake. This study addresses the need for practical, real-time fraud detection methodologies. Using a Kaggle credit card dataset, I tackle class imbalance using the Synthetic Minority Oversampling Technique (SMOTE) to enhance modeling efficiency. I compare several machine learning algorithms, including Logistic Regression, Linear Discriminant Analysis, K-nearest Neighbors, Classification and Regression Tree, Naive Bayes, Support Vector, Random Forest, XGBoost, and Light Gradient-Boosting Machine to classify transactions as fraud or genuine. Rigorous evaluation metrics, such as AUC, PRAUC, F1, KS, Recall, and Precision, identify the Random Forest as the best performer in detecting fraudulent activities. The Random Forest model successfully identifies approximately 92% of transactions scoring 90 and above as fraudulent, equating to a detection rate of over 70% for all fraudulent transactions in the test dataset. Moreover, the model captures more than half of the fraud in each bin of the test dataset. SHAP values provide model explainability, with the SHAP summary plot highlighting the global importance of individual features, such as “V12” and “V14”. SHAP force plots offer local interpretability, revealing the impact of specific features on individual predictions. This study demonstrates the potential of machine learning, particularly the Random Forest model, for real-time credit card fraud detection, offering a promising approach to mitigate financial losses and protect consumers.展开更多
Being cheap,nondestructive,and easy to use,gas sensors play important roles in the food industry.However,most gas sensors are suitable more for laboratory-quality fast testing rather than for cold-chain continuous and...Being cheap,nondestructive,and easy to use,gas sensors play important roles in the food industry.However,most gas sensors are suitable more for laboratory-quality fast testing rather than for cold-chain continuous and cumulative testing.Also,an ideal electronic nose(E-nose)in a cold chain should be stable to its surroundings and remain highly accurate and portable.In this work,a portable film bulk acoustic resonator(FBAR)-based E-nose was built for real-time measurement of banana shelf time.The sensor chamber to contain the portable circuit of the E-nose is as small as a smartphone,and by introducing an air-tight FBAR as a reference,the E-nose can avoid most of the drift caused by surroundings.With the help of porous layer by layer(LBL)coating of the FBAR,the sensitivity of the E-nose is 5 ppm to ethylene and 0.5 ppm to isoamyl acetate and isoamyl butyrate,while the detection range is large enough to cover a relative humidity of 0.8.In this regard,the E-nose can easily discriminate between yellow bananas with green necks and entirely yellow bananas while allowing the bananas to maintain their biological activities in their normal storage state,thereby showing the possibility of real-time shelf time detection.This portable FBAR-based E-nose has a large testing scale,high sensitivity,good humidity tolerance,and low frequency drift to its surroundings,thereby meeting the needs of cold-chain usage.展开更多
Real-time health data monitoring is pivotal for bolstering road services’safety,intelligence,and efficiency within the Internet of Health Things(IoHT)framework.Yet,delays in data retrieval can markedly hinder the eff...Real-time health data monitoring is pivotal for bolstering road services’safety,intelligence,and efficiency within the Internet of Health Things(IoHT)framework.Yet,delays in data retrieval can markedly hinder the efficacy of big data awareness detection systems.We advocate for a collaborative caching approach involving edge devices and cloud networks to combat this.This strategy is devised to streamline the data retrieval path,subsequently diminishing network strain.Crafting an adept cache processing scheme poses its own set of challenges,especially given the transient nature of monitoring data and the imperative for swift data transmission,intertwined with resource allocation tactics.This paper unveils a novel mobile healthcare solution that harnesses the power of our collaborative caching approach,facilitating nuanced health monitoring via edge devices.The system capitalizes on cloud computing for intricate health data analytics,especially in pinpointing health anomalies.Given the dynamic locational shifts and possible connection disruptions,we have architected a hierarchical detection system,particularly during crises.This system caches data efficiently and incorporates a detection utility to assess data freshness and potential lag in response times.Furthermore,we introduce the Cache-Assisted Real-Time Detection(CARD)model,crafted to optimize utility.Addressing the inherent complexity of the NP-hard CARD model,we have championed a greedy algorithm as a solution.Simulations reveal that our collaborative caching technique markedly elevates the Cache Hit Ratio(CHR)and data freshness,outshining its contemporaneous benchmark algorithms.The empirical results underscore the strength and efficiency of our innovative IoHT-based health monitoring solution.To encapsulate,this paper tackles the nuances of real-time health data monitoring in the IoHT landscape,presenting a joint edge-cloud caching strategy paired with a hierarchical detection system.Our methodology yields enhanced cache efficiency and data freshness.The corroborative numerical data accentuates the feasibility and relevance of our model,casting a beacon for the future trajectory of real-time health data monitoring systems.展开更多
基金funded by Anhui Provincial Natural Science Foundation(No.2208085ME128)the Anhui University-Level Special Project of Anhui University of Science and Technology(No.XCZX2021-01)+1 种基金the Research and the Development Fund of the Institute of Environmental Friendly Materials and Occupational Health,Anhui University of Science and Technology(No.ALW2022YF06)Anhui Province New Era Education Quality Project(Graduate Education)(No.2022xscx073).
文摘The real-time detection and instance segmentation of strawberries constitute fundamental components in the development of strawberry harvesting robots.Real-time identification of strawberries in an unstructured envi-ronment is a challenging task.Current instance segmentation algorithms for strawberries suffer from issues such as poor real-time performance and low accuracy.To this end,the present study proposes an Efficient YOLACT(E-YOLACT)algorithm for strawberry detection and segmentation based on the YOLACT framework.The key enhancements of the E-YOLACT encompass the development of a lightweight attention mechanism,pyramid squeeze shuffle attention(PSSA),for efficient feature extraction.Additionally,an attention-guided context-feature pyramid network(AC-FPN)is employed instead of FPN to optimize the architecture’s performance.Furthermore,a feature-enhanced model(FEM)is introduced to enhance the prediction head’s capabilities,while efficient fast non-maximum suppression(EF-NMS)is devised to improve non-maximum suppression.The experimental results demonstrate that the E-YOLACT achieves a Box-mAP and Mask-mAP of 77.9 and 76.6,respectively,on the custom dataset.Moreover,it exhibits an impressive category accuracy of 93.5%.Notably,the E-YOLACT also demonstrates a remarkable real-time detection capability with a speed of 34.8 FPS.The method proposed in this article presents an efficient approach for the vision system of a strawberry-picking robot.
基金the Six Heights of Talent in Jiangsu Prov-ince(No.06-E-044).
文摘The paper puts forward a variance-time plots method based on slide-window mechanism tocalculate the Hurst parameter to detect Distribute Denial of Service(DDoS)attack in real time.Basedon fuzzy logic technology that can adjust itself dynamically under the fuzzy rules,an intelligent DDoSjudgment mechanism is designed.This new method calculates the Hurst parameter quickly and detectsDDoS attack in real time.Through comparing the detecting technologies based on statistics andfeature-packet respectively under different experiments,it is found that the new method can identifythe change of the Hurst parameter resulting from DDoS attack traffic with different intensities,andintelligently judge DDoS attack self-adaptively in real time.
文摘Lightning is a typical example of an instantaneous random point source target. It has close connection with severe convective phenomena such as a thunderstorm, whose distribution, variation, position and forecasting can be acquired through lightning observation. In this paper, we discuss the way to achieve instantaneous lightning signal intensification and detection from geostationary orbit by using the differences between the lightning signal and the slowly changing background noise such as that of cloud, land and ocean, combining three methods, spectral filtering, spatial filtering and background noise, enabling removal between frames. After six months of operation in orbit, lightning within the coverage of the Geostationary Lightning Imager was effectively detected, strongly supporting the case for shorttime and real-time early warning, forecasting and tracking of severe convective phenomena in China.
文摘This paper presents a vision-based crack detection approach for concrete bridge decks using an integrated one-dimensional convolutional neural network(1D-CNN)and long short-term memory(LSTM)method in the image frequency domain.The so-called 1D-CNN-LSTM algorithm is trained using thousands of images of cracked and non-cracked concrete bridge decks.In order to improve the training efficiency,images are first transformed into the frequency domain during a preprocessing phase.The algorithm is then calibrated using the flattened frequency data.LSTM is used to improve the performance of the developed network for long sequence data.The accuracy of the developed model is 99.05%,98.9%,and 99.25%,respectively,for training,validation,and testing data.An implementation framework is further developed for future application of the trained model for large-scale images.The proposed 1D-CNN-LSTM method exhibits superior performance in comparison with existing deep learning methods in terms of accuracy and computation time.The fast implementation of the 1D-CNN-LSTM algorithm makes it a promising tool for real-time crack detection.
基金supported by the National Natural Science Foundation of China (11674048)the Fundamental Research Funds for the Central Universities (N160502002)Liaoning BaiQianWan Talents Program (2014921017)
文摘Implantable bioelectronics for analyzing physiological biomarkers has recently been recognized as a promising technique in medical treatment or diagnostics. In this study, we developed a self-powered implantable skinlike glucometer for real-time detection of blood glucose level in vivo. Based on the piezo-enzymatic-reaction coupling effect of GOx@ZnO nanowire, the device under an applied deformation can actively output piezoelectric signal containing the glucose-detecting information. No external electricity power source or battery is needed for this device, and the outputting piezoelectric voltage acts as both the biosensing signal and electricity power. A practical application of the skin-like glucometer implanted in mouse body for detecting blood glucose level has been simply demonstrated. These results provide a new technique path for diabetes prophylaxis and treatment.
文摘Cervical cancer remains a critically important problem for women, especially those women in the developing world where the case-fatality rate is high. There are an estimated 528,000 cases and 266,000 deaths worldwide. Established screening and detection programs in the developed world have lowered the mortality from 40/100,000 to 2/100,000 over the last 60 years. The standard of care has been and continues to be: a screening Papanicolaou smear with or without Human Papilloma Virus (HPV) testing;followed by colposcopy and biopsies and if the smear is abnormal;and followed by treatment if the biopsies show high grade disease (cervical intraepithelial neoplasia (CIN) grades 2 and 3 and Carcinoma-in-situ). Low grade lesions (Pap smears with Atypical Cells of Uncertain Significance (ASCUS), Low Grade Squamous Intraepithelial Lesions (LGSIL), biopsies showing HPV changes or showing CIN 1);are usually followed for two years and then treated if persistent. Treatment can be performed with loop excision, LASER, or cryotherapy. Loop excision yields a specimen which can be reviewed to establish the diagnosis more accurately. LASER vaporizes the lesion and cryotherapy leads to tissue destruction. Under long term study;loop excision, LASER, and cryotherapy have the same rate of cure. The standard of care is expensive and takes 6 - 12 weeks for the individual patient. During the last twenty years, new technologies that can view the cervix and even image the cervix with cellular resolution have been developed. These technologies could lead to a new paradigm in which diagnosis and treatment occurs at a single visit. These technologies include fluorescence and reflectance spectroscopy (probe or wide-field, whole cervix scanning approaches) and fluorescence confocal endomicroscopy or high resolution micro-endoscopy. Both technologies have received Federal Drug Administration (FDA) and have been commercialized. Research trials continue to show their remarkable performance. These technologies are reviewed and clinical trials are summarized. Emerging technologies are coming along that may compete with those already approved and include optical coherence tomography, optical coherence tomography with autofluorescence, diffuse optical microscopy, and dual mode micro-endoscopy. These technologies are also reviewed and where available, clinical data is reported. Optical technologies are ready to diffuse into clinical practice because they will save money and 3 or 4 visits in the developed world and offer the same standard of care to the developing world where more cervical cancer exists.
文摘An approach to detection of moving objects in video sequences, with application to video surveillance is presented. The algorithm combines two kinds of change points, which are detected from the region-based frame difference and adjusted background subtraction. An adaptive threshold technique is employed to automatically choose the threshold value to segment the moving objects from the still background. And experiment results show that the algorithm is effective and efficient in practical situations. Furthermore, the algorithm is robust to the effects of the changing of lighting condition and can be applied for video surveillance system.
基金supported by the National Key Research and Development Program of China(2019YFD1002401)the National Natural Science Foundation of China(31701326).
文摘For the purpose of monitoring apple fruits effectively throughout the entire growth period in smart orchards.A lightweight model named YOLOv8n-ShuffleNetv2-Ghost-SE was proposed.The ShuffleNetv2 basic modules and down-sampling modules were alternately connected,replacing the Backbone of YOLOv8n model.The Ghost modules replaced the Conv modules and the C2fGhost modules replaced the C2f modules in the Neck part of the YOLOv8n.ShuffleNetv2 reduced the memory access cost through channel splitting operations.The Ghost module combined linear and non-linear convolutions to reduce the network computation cost.The Wise-IoU(WIoU)replaced the CIoU for calculating the bounding box regression loss,which dynamically adjusted the anchor box quality threshold and gradient gain allocation strategy,optimizing the size and position of predicted bounding boxes.The Squeeze-and-Excitation(SE)was embedded in the Backbone and Neck part of YOLOv8n to enhance the representation ability of feature maps.The algorithm ensured high precision while having small model size and fast detection speed,which facilitated model migration and deployment.Using 9652 images validated the effectiveness of the model.The YOLOv8n-ShuffleNetv2-Ghost-SE model achieved Precision of 94.1%,Recall of 82.6%,mean Average Precision of 91.4%,model size of 2.6 MB,parameters of 1.18 M,FLOPs of 3.9 G,and detection speed of 39.37 fps.The detection speeds on the Jetson Xavier NX development board were 3.17 fps.Comparisons with advanced models including Faster R-CNN,SSD,YOLOv5s,YOLOv7‑tiny,YOLOv8s,YOLOv8n,MobileNetv3_small-Faster,MobileNetv3_small-Ghost,ShuflleNetv2-Faster,ShuflleNetv2-Ghost,ShuflleNetv2-Ghost-CBAM,ShuflleNetv2-Ghost-ECA,and ShuflleNetv2-Ghost-CA demonstrated that the method achieved smaller model and faster detection speed.The research can provide reference for the development of smart devices in apple orchards.
文摘Active noise controls are used in a wide field of applications to cancel out unwanted surrounding noise. Control systems based on the feedback structure however have the disadvantage that they may become unstable during run-time due to changes in the control path—in this context including the listener’s ear. Especially when applied to active noise cancellation (ANC) headphones, the risk of instability is associated with the risk of harmful influence on the listener’s ear, which is exposed to the speaker in striking distance. This paper discusses several methods to enable the analysis of a feedback ANC system during run-time to immediately detect instability. Finally, a solution is proposed, which identifies the open loop behavior parametrically by means of an adaptive filter to subsequently evaluate the coefficients regarding stability.
文摘In this paper, we proposed a new concept: depth of drowsiness, which can more precisely describe the drowsiness than existing binary description. A set of effective markers for drowsiness: normalized band norm was successfully developed. These markers are invariant from voltage amplitude of brain waves, eliminating the need for calibrating the voltage output of the brain-computer interface devices. A new polling algorithm was designed and implemented for computing the depth of drowsiness. The time cost of data acquisition and processing for each estimate is about one second, which is well suited for real-time applications. Test results with a portable brain-computer interface device show that the depth of drowsiness computed by the method in this paper is generally invariant from ages of test subjects and sensor channels (P3 and C4). The comparison between experiment and computing results indicate that the new method is noticeably better than one of the recent methods in terms of accuracy for predicting the drowsiness.
基金the financial support from the Science and Technology Innovation Leading Talent Project of Special Support Plan For High-Level Talents of Zhejiang Province(Grant No.201R52056)Zhejiang Provincial Natural Science Foundation(Grant Nos.LZ23E080001,LR19E060001)+1 种基金Major Science and Technology Projects of Jinhua City(Grant No.2022-1-014)Jiangsu Province International Cooperative Research Project(Grant No.BZ2021008).
文摘The moisture content(MC)of municipal sludge is the key factor affecting sludge treatment and disposal technologies,while the vast majority of existing measurement methods are off-line and time-consuming.To realize rapid online detection for the MC of sludge,a detection method based on the microwave reflection principle is proposed:experiments are carried out and the MC computation model of the sludge is derived using the resonant frequency and the permittivity(e').The results reveal that the detection accuracy of granular sludge with a thickness of 10 mm is higher.The theoretical model between the MC and the real part of e'is developed,and the relationship between the resonant frequency and e'is expressed by a cubic polynomial.The average error and the root mean square error(RMSE)of sludge are 2.06%and 2.49%,respectively.The prediction model for the MC of sludge is also given,and the determination coefficient and RMSE are 0.981 and 2.06%,respectively.
基金supported by theKorea Industrial Technology Association(KOITA)Grant Funded by the Korean government(MSIT)(No.KOITA-2023-3-003)supported by the MSIT(Ministry of Science and ICT),Korea,under the ITRC(Information Technology Research Center)Support Program(IITP-2024-2020-0-01808)Supervised by the IITP(Institute of Information&Communications Technology Planning&Evaluation)。
文摘The advancement of navigation systems for the visually impaired has significantly enhanced their mobility by mitigating the risk of encountering obstacles and guiding them along safe,navigable routes.Traditional approaches primarily focus on broad applications such as wayfinding,obstacle detection,and fall prevention.However,there is a notable discrepancy in applying these technologies to more specific scenarios,like identifying distinct food crop types or recognizing faces.This study proposes a real-time application designed for visually impaired individuals,aiming to bridge this research-application gap.It introduces a system capable of detecting 20 different food crop types and recognizing faces with impressive accuracies of 83.27%and 95.64%,respectively.These results represent a significant contribution to the field of assistive technologies,providing visually impaired users with detailed and relevant information about their surroundings,thereby enhancing their mobility and ensuring their safety.Additionally,it addresses the vital aspects of social engagements,acknowledging the challenges faced by visually impaired individuals in recognizing acquaintances without auditory or tactile signals,and highlights recent developments in prototype systems aimed at assisting with face recognition tasks.This comprehensive approach not only promises enhanced navigational aids but also aims to enrich the social well-being and safety of visually impaired communities.
基金funded by the General Project of Key Research and Develop-ment Plan of Shaanxi Province(No.2022NY-087).
文摘To address the challenges of high complexity,poor real-time performance,and low detection rates for small target vehicles in existing vehicle object detection algorithms,this paper proposes a real-time lightweight architecture based on You Only Look Once(YOLO)v5m.Firstly,a lightweight upsampling operator called Content-Aware Reassembly of Features(CARAFE)is introduced in the feature fusion layer of the network to maximize the extraction of deep-level features for small target vehicles,reducing the missed detection rate and false detection rate.Secondly,a new prediction layer for tiny targets is added,and the feature fusion network is redesigned to enhance the detection capability for small targets.Finally,this paper applies L1 regularization to train the improved network,followed by pruning and fine-tuning operations to remove redundant channels,reducing computational and parameter complexity and enhancing the detection efficiency of the network.Training is conducted on the VisDrone2019-DET dataset.The experimental results show that the proposed algorithmreduces parameters and computation by 63.8% and 65.8%,respectively.The average detection accuracy improves by 5.15%,and the detection speed reaches 47 images per second,satisfying real-time requirements.Compared with existing approaches,including YOLOv5m and classical vehicle detection algorithms,our method achieves higher accuracy and faster speed for real-time detection of small target vehicles in edge computing.
基金National Natural Science Foundation of China(No.42271416)Guangxi Science and Technology Major Project(No.AA22068072)Shennongjia National Park Resources Comprehensive Investigation Research Project(No.SNJNP2023015).
文摘Timely acquisition of rescue target information is critical for emergency response after a flood disaster.Unmanned Aerial Vehicles(UAVs)equipped with remote sensing capabilities offer distinct advantages,including high-resolution imagery and exceptional mobility,making them well suited for monitoring flood extent and identifying rescue targets during floods.However,there are some challenges in interpreting rescue information in real time from flood images captured by UAVs,such as the complexity of the scenarios of UAV images,the lack of flood rescue target detection datasets and the limited real-time processing capabilities of the airborne on-board platform.Thus,we propose a real-time rescue target detection method for UAVs that is capable of efficiently delineating flood extent and identifying rescue targets(i.e.,pedestrians and vehicles trapped by floods).The proposed method achieves real-time rescue information extraction for UAV platforms by lightweight processing and fusion of flood extent extraction model and target detection model.The flood inundation range is extracted by the proposed method in real time and detects targets such as people and vehicles to be rescued based on this layer.Our experimental results demonstrate that the Intersection over Union(IoU)for flood water extraction reaches an impressive 80%,and the IoU for real-time flood water extraction stands at a commendable 76.4%.The information on flood stricken targets extracted by this method in real time can be used for flood emergency rescue.
文摘Real-time indoor camera localization is a significant problem in indoor robot navigation and surveillance systems.The scene can change during the image sequence and plays a vital role in the localization performance of robotic applications in terms of accuracy and speed.This research proposed a real-time indoor camera localization system based on a recurrent neural network that detects scene change during the image sequence.An annotated image dataset trains the proposed system and predicts the camera pose in real-time.The system mainly improved the localization performance of indoor cameras by more accurately predicting the camera pose.It also recognizes the scene changes during the sequence and evaluates the effects of these changes.This system achieved high accuracy and real-time performance.The scene change detection process was performed using visual rhythm and the proposed recurrent deep architecture,which performed camera pose prediction and scene change impact evaluation.Overall,this study proposed a novel real-time localization system for indoor cameras that detects scene changes and shows how they affect localization performance.
基金funded by the National Key R&D Program(Grant No.2022YFD2002305)Beijing Nova Program(Grant No.Z211100002121065,20220484202)Collaborative Innovation Center of Beijing Academy of Agricultural and Forestry Sciences(Grant No.KJCX201917).
文摘In order to address the challenge of non-destructive detection of tomato fruit ripeness in controlled environments,this study proposed a real-time instance segmentation method based on the edge device.This method combined the principles of phenotype robots and machine vision based on deep learning.A compact and remotely controllable phenotype detection robot was employed to acquire precise data on tomato ripeness.The video data were then processed by using an efficient backbone and the FeatFlowNet structure for feature extraction and analysis of key-frame to non-key-frame mapping from video data.To enhance the diversity of training datasets and the generalization of the model,an innovative approach was chosen by using random enhancement techniques.Besides,the PolyLoss optimization technique was applied to further improve the accuracy of the ripeness multi-class detection tasks.Through validation,the method of this study achieved real-time processing speeds of 90.1 fps(RTX 3070Ti)and 65.5 fps(RTX 2060 S),with an average detection accuracy of 97%compared to manually measured results.This is more accurate and efficient than other instance segmentation models according to actual testing in a greenhouse.Therefore,the results of this research can be deployed in edge devices and provide technical support for unmanned greenhouse monitoring devices or fruit-picking robots in facility environments.
文摘Aiming at the problem of low accuracy of traditional target detection methods for target detection in endoscopes in substation environments, a CNN-based real-time detection method for masked targets is proposed. The method adopts the overall design of backbone network, detection network and algorithmic parameter optimisation method, completes the model training on the self-constructed occlusion target dataset, and adopts the multi-scale perception method for target detection. The HNM algorithm is used to screen positive and negative samples during the training process, and the NMS algorithm is used to post-process the prediction results during the detection process to improve the detection efficiency. After experimental validation, the obtained model has the multi-class average predicted value (mAP) of the dataset. It has general advantages over traditional target detection methods. The detection time of a single target on FDDB dataset is 39 ms, which can meet the need of real-time target detection. In addition, the project team has successfully deployed the method into substations and put it into use in many places in Beijing, which is important for achieving the anomaly of occlusion target detection.
文摘Credit card fraud remains a significant challenge, with financial losses and consumer protection at stake. This study addresses the need for practical, real-time fraud detection methodologies. Using a Kaggle credit card dataset, I tackle class imbalance using the Synthetic Minority Oversampling Technique (SMOTE) to enhance modeling efficiency. I compare several machine learning algorithms, including Logistic Regression, Linear Discriminant Analysis, K-nearest Neighbors, Classification and Regression Tree, Naive Bayes, Support Vector, Random Forest, XGBoost, and Light Gradient-Boosting Machine to classify transactions as fraud or genuine. Rigorous evaluation metrics, such as AUC, PRAUC, F1, KS, Recall, and Precision, identify the Random Forest as the best performer in detecting fraudulent activities. The Random Forest model successfully identifies approximately 92% of transactions scoring 90 and above as fraudulent, equating to a detection rate of over 70% for all fraudulent transactions in the test dataset. Moreover, the model captures more than half of the fraud in each bin of the test dataset. SHAP values provide model explainability, with the SHAP summary plot highlighting the global importance of individual features, such as “V12” and “V14”. SHAP force plots offer local interpretability, revealing the impact of specific features on individual predictions. This study demonstrates the potential of machine learning, particularly the Random Forest model, for real-time credit card fraud detection, offering a promising approach to mitigate financial losses and protect consumers.
基金supported financially by the National Natural Science Foundation of China (Grant Nos.22078051 and U1801258)the Fundamental Research Funds for the Central Universities (Grant No.DUT22LAB610).
文摘Being cheap,nondestructive,and easy to use,gas sensors play important roles in the food industry.However,most gas sensors are suitable more for laboratory-quality fast testing rather than for cold-chain continuous and cumulative testing.Also,an ideal electronic nose(E-nose)in a cold chain should be stable to its surroundings and remain highly accurate and portable.In this work,a portable film bulk acoustic resonator(FBAR)-based E-nose was built for real-time measurement of banana shelf time.The sensor chamber to contain the portable circuit of the E-nose is as small as a smartphone,and by introducing an air-tight FBAR as a reference,the E-nose can avoid most of the drift caused by surroundings.With the help of porous layer by layer(LBL)coating of the FBAR,the sensitivity of the E-nose is 5 ppm to ethylene and 0.5 ppm to isoamyl acetate and isoamyl butyrate,while the detection range is large enough to cover a relative humidity of 0.8.In this regard,the E-nose can easily discriminate between yellow bananas with green necks and entirely yellow bananas while allowing the bananas to maintain their biological activities in their normal storage state,thereby showing the possibility of real-time shelf time detection.This portable FBAR-based E-nose has a large testing scale,high sensitivity,good humidity tolerance,and low frequency drift to its surroundings,thereby meeting the needs of cold-chain usage.
基金supported by National Natural Science Foundation of China(NSFC)under Grant Number T2350710232.
文摘Real-time health data monitoring is pivotal for bolstering road services’safety,intelligence,and efficiency within the Internet of Health Things(IoHT)framework.Yet,delays in data retrieval can markedly hinder the efficacy of big data awareness detection systems.We advocate for a collaborative caching approach involving edge devices and cloud networks to combat this.This strategy is devised to streamline the data retrieval path,subsequently diminishing network strain.Crafting an adept cache processing scheme poses its own set of challenges,especially given the transient nature of monitoring data and the imperative for swift data transmission,intertwined with resource allocation tactics.This paper unveils a novel mobile healthcare solution that harnesses the power of our collaborative caching approach,facilitating nuanced health monitoring via edge devices.The system capitalizes on cloud computing for intricate health data analytics,especially in pinpointing health anomalies.Given the dynamic locational shifts and possible connection disruptions,we have architected a hierarchical detection system,particularly during crises.This system caches data efficiently and incorporates a detection utility to assess data freshness and potential lag in response times.Furthermore,we introduce the Cache-Assisted Real-Time Detection(CARD)model,crafted to optimize utility.Addressing the inherent complexity of the NP-hard CARD model,we have championed a greedy algorithm as a solution.Simulations reveal that our collaborative caching technique markedly elevates the Cache Hit Ratio(CHR)and data freshness,outshining its contemporaneous benchmark algorithms.The empirical results underscore the strength and efficiency of our innovative IoHT-based health monitoring solution.To encapsulate,this paper tackles the nuances of real-time health data monitoring in the IoHT landscape,presenting a joint edge-cloud caching strategy paired with a hierarchical detection system.Our methodology yields enhanced cache efficiency and data freshness.The corroborative numerical data accentuates the feasibility and relevance of our model,casting a beacon for the future trajectory of real-time health data monitoring systems.