Road traffic monitoring is an imperative topic widely discussed among researchers.Systems used to monitor traffic frequently rely on cameras mounted on bridges or roadsides.However,aerial images provide the flexibilit...Road traffic monitoring is an imperative topic widely discussed among researchers.Systems used to monitor traffic frequently rely on cameras mounted on bridges or roadsides.However,aerial images provide the flexibility to use mobile platforms to detect the location and motion of the vehicle over a larger area.To this end,different models have shown the ability to recognize and track vehicles.However,these methods are not mature enough to produce accurate results in complex road scenes.Therefore,this paper presents an algorithm that combines state-of-the-art techniques for identifying and tracking vehicles in conjunction with image bursts.The extracted frames were converted to grayscale,followed by the application of a georeferencing algorithm to embed coordinate information into the images.The masking technique eliminated irrelevant data and reduced the computational cost of the overall monitoring system.Next,Sobel edge detection combined with Canny edge detection and Hough line transform has been applied for noise reduction.After preprocessing,the blob detection algorithm helped detect the vehicles.Vehicles of varying sizes have been detected by implementing a dynamic thresholding scheme.Detection was done on the first image of every burst.Then,to track vehicles,the model of each vehicle was made to find its matches in the succeeding images using the template matching algorithm.To further improve the tracking accuracy by incorporating motion information,Scale Invariant Feature Transform(SIFT)features have been used to find the best possible match among multiple matches.An accuracy rate of 87%for detection and 80%accuracy for tracking in the A1 Motorway Netherland dataset has been achieved.For the Vehicle Aerial Imaging from Drone(VAID)dataset,an accuracy rate of 86%for detection and 78%accuracy for tracking has been achieved.展开更多
Object detection in unmanned aerial vehicle(UAV)aerial images has become increasingly important in military and civil applications.General object detection models are not robust enough against interclass similarity an...Object detection in unmanned aerial vehicle(UAV)aerial images has become increasingly important in military and civil applications.General object detection models are not robust enough against interclass similarity and intraclass variability of small objects,and UAV-specific nuisances such as uncontrolledweather conditions.Unlike previous approaches focusing on high-level semantic information,we report the importance of underlying features to improve detection accuracy and robustness fromthe information-theoretic perspective.Specifically,we propose a robust and discriminative feature learning approach through mutual information maximization(RD-MIM),which can be integrated into numerous object detection methods for aerial images.Firstly,we present the rank sample mining method to reduce underlying feature differences between the natural image domain and the aerial image domain.Then,we design a momentum contrast learning strategy to make object features similar to the same category and dissimilar to different categories.Finally,we construct a transformer-based global attention mechanism to boost object location semantics by leveraging the high interrelation of different receptive fields.We conduct extensive experiments on the VisDrone and Unmanned Aerial Vehicle Benchmark Object Detection and Tracking(UAVDT)datasets to prove the effectiveness of the proposed method.The experimental results show that our approach brings considerable robustness gains to basic detectors and advanced detection methods,achieving relative growth rates of 51.0%and 39.4%in corruption robustness,respectively.Our code is available at https://github.com/cq100/RD-MIM(accessed on 2 August 2024).展开更多
There are two types of methods for image segmentation.One is traditional image processing methods,which are sensitive to details and boundaries,yet fail to recognize semantic information.The other is deep learning met...There are two types of methods for image segmentation.One is traditional image processing methods,which are sensitive to details and boundaries,yet fail to recognize semantic information.The other is deep learning methods,which can locate and identify different objects,but boundary identifications are not accurate enough.Both of them cannot generate entire segmentation information.In order to obtain accurate edge detection and semantic information,an Adaptive Boundary and Semantic Composite Segmentation method(ABSCS)is proposed.This method can precisely semantic segment individual objects in large-size aerial images with limited GPU performances.It includes adaptively dividing and modifying the aerial images with the proposed principles and methods,using the deep learning method to semantic segment and preprocess the small divided pieces,using three traditional methods to segment and preprocess original-size aerial images,adaptively selecting traditional results tomodify the boundaries of individual objects in deep learning results,and combining the results of different objects.Individual object semantic segmentation experiments are conducted by using the AeroScapes dataset,and their results are analyzed qualitatively and quantitatively.The experimental results demonstrate that the proposed method can achieve more promising object boundaries than the original deep learning method.This work also demonstrates the advantages of the proposed method in applications of point cloud semantic segmentation and image inpainting.展开更多
In this paper,based on a bidirectional parallel multi-branch feature pyramid network(BPMFPN),a novel one-stage object detector called BPMFPN Det is proposed for real-time detection of ground multi-scale targets by swa...In this paper,based on a bidirectional parallel multi-branch feature pyramid network(BPMFPN),a novel one-stage object detector called BPMFPN Det is proposed for real-time detection of ground multi-scale targets by swarm unmanned aerial vehicles(UAVs).First,the bidirectional parallel multi-branch convolution modules are used to construct the feature pyramid to enhance the feature expression abilities of different scale feature layers.Next,the feature pyramid is integrated into the single-stage object detection framework to ensure real-time performance.In order to validate the effectiveness of the proposed algorithm,experiments are conducted on four datasets.For the PASCAL VOC dataset,the proposed algorithm achieves the mean average precision(mAP)of 85.4 on the VOC 2007 test set.With regard to the detection in optical remote sensing(DIOR)dataset,the proposed algorithm achieves 73.9 mAP.For vehicle detection in aerial imagery(VEDAI)dataset,the detection accuracy of small land vehicle(slv)targets reaches 97.4 mAP.For unmanned aerial vehicle detection and tracking(UAVDT)dataset,the proposed BPMFPN Det achieves the mAP of 48.75.Compared with the previous state-of-the-art methods,the results obtained by the proposed algorithm are more competitive.The experimental results demonstrate that the proposed algorithm can effectively solve the problem of real-time detection of ground multi-scale targets in aerial images of swarm UAVs.展开更多
Because of quick development of cities, the update of urban GIS data is very important. Change detection is the base of automatic or semi-automatic data update. One way of change detections in urban area is based on o...Because of quick development of cities, the update of urban GIS data is very important. Change detection is the base of automatic or semi-automatic data update. One way of change detections in urban area is based on old and new aerial images acquired in different durations. The corresponding theory and experiments are introduced and analyzed in this paper. The main procedure includes four stages. The new and old images have to be registered firstly. Then image matching, based on the maximum correlation coefficient, is performed between registered images after the low contrast areas have been removed. The regions with low matching quality are extracted as candidate changed areas. Thirdly, the Gaussian-Laplacian operator is used to detect edges in candidate changed areas on both the registered images, and the straight lines are detected by Hough transformation. Finally, the changed houses and roads can be detected on the basis of straight line matching in candidate changed areas between registered images. Some experimental results show that the method introduced in this paper is effective.展开更多
This paper develops a method which can be used to assist aerial navigation by determining the spatial position and posture of the aerial photographic plane. After the method, aerial images match known DEM to capture t...This paper develops a method which can be used to assist aerial navigation by determining the spatial position and posture of the aerial photographic plane. After the method, aerial images match known DEM to capture the spatial position and posture. Some aerial images and terrain data are used to testify our method. Compared with those of analytic and stereo mappers, the results by our method are correspondent to real measurements well.展开更多
Desertification has become a global threat and caused a crisis,especially in Middle Eastern countries,such as Saudi Arabia.Makkah is one of the most important cities in Saudi Arabia that needs to be protected from des...Desertification has become a global threat and caused a crisis,especially in Middle Eastern countries,such as Saudi Arabia.Makkah is one of the most important cities in Saudi Arabia that needs to be protected from desertification.The vegetation area in Makkah has been damaged because of desertification through wind,floods,overgrazing,and global climate change.The damage caused by desertification can be recovered provided urgent action is taken to prevent further degradation of the vegetation area.In this paper,we propose an automatic desertification detection system based on Deep Learning techniques.Aerial images are classified using Convolutional Neural Networks(CNN)to detect land state variation in real-time.CNNs have been widely used for computer vision applications,such as image classification,image segmentation,and quality enhancement.The proposed CNN model was trained and evaluated on the Arial Image Dataset(AID).Compared to state-of-the-art methods,the proposed model has better performance while being suitable for embedded implementation.It has achieved high efficiency with 96.47% accuracy.In light of the current research,we assert the appropriateness of the proposed CNN model in detecting desertification from aerial images.展开更多
Three-dimensional(3D)reconstruction based on aerial images has broad prospects,and feature matching is an important step of it.However,for high-resolution aerial images,there are usually problems such as long time,mis...Three-dimensional(3D)reconstruction based on aerial images has broad prospects,and feature matching is an important step of it.However,for high-resolution aerial images,there are usually problems such as long time,mismatching and sparse feature pairs using traditional algorithms.Therefore,an algorithm is proposed to realize fast,accurate and dense feature matching.The algorithm consists of four steps.Firstly,we achieve a balance between the feature matching time and the number of matching pairs by appropriately reducing the image resolution.Secondly,to realize further screening of the mismatches,a feature screening algorithm based on similarity judgment or local optimization is proposed.Thirdly,to make the algorithm more widely applicable,we combine the results of different algorithms to get dense results.Finally,all matching feature pairs in the low-resolution images are restored to the original images.Comparisons between the original algorithms and our algorithm show that the proposed algorithm can effectively reduce the matching time,screen out the mismatches,and improve the number of matches.展开更多
A method for shadow detection and compensation for color aerial images is presented. It is considered that the intensity value of each image pixel is the product of illumination function and ground object reflection, ...A method for shadow detection and compensation for color aerial images is presented. It is considered that the intensity value of each image pixel is the product of illumination function and ground object reflection, and the shadowed regions on the image are mainly caused by the short of illumination, so the information compensation for the shadowed regions should concentrate on the illumination adjustment of concerned area on the basis of the analysis of whole image. The shadow detection and compensation procedure proposed by this paper consists of four steps.展开更多
Sugarcane planting is an important and growing activity in Brazil.Thereupon,several techniques have been developed over the years to maximize crop productivity and profit,amongst them,processing of sugarcane field ima...Sugarcane planting is an important and growing activity in Brazil.Thereupon,several techniques have been developed over the years to maximize crop productivity and profit,amongst them,processing of sugarcane field images.In this sense,this research aims to identify and analyze crop rows and measure their gaps from aerial images of sugarcane fields.For this,a small Remotely Piloted Aircraft captured the images,generating orthomosaics of the areas for analysis.Then,each orthomosaic is classified with the K-Nearest Neighbor algorithm to segment regions of interest.Planting row orientation is estimated using the RGB gradient filter.Morphological operations and computational geometry models are then used to detect and map rows and gaps along the planting row segment.To evaluate the results,crop rows are mapped and compared to manually taken measurements.Our technique obtained an error smaller than 2%when compared to gap length in crop rows from an orthomosaic with the area of 8.05 ha(ha).The proposed approach can map the positioning of the automatically generated row segments appropriately onto manually created segments.Moreover,our method also achieved similar results when confronted with a manual technique for differing growth stages(40 and 80 days after harvest)of the sugarcane crop.The proposed method presents a great potential to be adopted in sugarcane planting monitoring。展开更多
The existing UAV aerial image de-fog methods have low image contrast after de-fog,the difference between light and dark image is not obvious,leading to poor de-fog effect.Therefore,an aerial image de-fog enhancement m...The existing UAV aerial image de-fog methods have low image contrast after de-fog,the difference between light and dark image is not obvious,leading to poor de-fog effect.Therefore,an aerial image de-fog enhancement method based on dark channel a priori is proposed.The image variance and absolute gradient mean are combined to get the weight coefficients,and the edge pixels are smoothed by using the multiple decomposition form.The image intensity is calculated and the noise is reduced.A convolution neural network is introduced to calculate the atmospheric transmittance in haze.Based on this,dark channel prior algorithm is used to enhance the light and shade difference of aerial photography image and realise the de-fog enhancement of aerial photography image.To verify the performance of the proposed method,simulation experiments are designed which were compared with the existing methods results in better fog-removing effect,higher contrast and shorter time.展开更多
This paper proposes a new deep learning framework for the location of broken insulators(in particular the self-blast glass insulator)in aerial images.We address the broken insulators location problem in a low signal-n...This paper proposes a new deep learning framework for the location of broken insulators(in particular the self-blast glass insulator)in aerial images.We address the broken insulators location problem in a low signal-noise-ratio(SNR)setting.We deal with two modules:1)object detection based on Faster R-CNN,and 2)classification of pixels based on U-net.For the first time,our paper combines the above two modules.This combination is motivated as follows:Faster R-CNN is used to improve SNR,while the U-net is used for classification of pixels.A diverse aerial image set measured by a power grid in China is tested to validate the proposed approach.Furthermore,a comparison is made among different methods and the result shows that our approach is accurate in real time.展开更多
In aerial images,near-specular and specular reflection often appear in water bodies.They often lead to irregular brightness or color changes in water bodies and even produce hot spots,harmful to radiometric normalizat...In aerial images,near-specular and specular reflection often appear in water bodies.They often lead to irregular brightness or color changes in water bodies and even produce hot spots,harmful to radiometric normalization.Therefore,water bodies must be eliminated when calculating radiometric differences during radiometric normalization of aerial images.In this paper,a simple method to detect water bodies in aerial images based on texture features is presented,an improved seeded region growing(SRG)method.A texture feature is calculated using the relative standard deviation index(RSDI)and a coarse-to-fine procedure is employed.The proposed method includes a multiple partition strategy and a refinement in gradient image that improves the reliability and accuracy of water body detection.By fusing water bodies detected in multiple images,hot spots in these water bodies are also detected.Experiments validate the feasibility and effectiveness of the proposed method.展开更多
Due to improper acquisition settings and other noise artifacts,the image degraded to yield poor mean preservation in brightness.The simplest way to improve the preservation is the implementation of histogram equalizat...Due to improper acquisition settings and other noise artifacts,the image degraded to yield poor mean preservation in brightness.The simplest way to improve the preservation is the implementation of histogram equalization.Because of over-enhancement,it failed to preserve the mean brightness and produce the poor quality of the image.This paper proposes a multi-scale decomposi-tion for brightness preservation using gamma correction.After transformation to hue,saturation and intensity(HSI)channel,the 2D-discrete wavelet transform decomposed the intensity component into low and high-pass coefficients.At the next phase,gamma correction is used by auto-tuning the scale value.The scale is the modified constant value used in the logarithmic function.Further,the scale value is optimized to obtain better visual quality in the image.The optimized value is the weighted distribution of standard deviation-mean of low pass coefficients.Finally,the experimental result is estimated in terms of quality assessment measures used as absolute mean brightness error,the measure of information detail,signal to noise ratio and patch-based contrast quality in the image.By comparison,the proposed method proved to be suitably remarkable in retaining the mean brightness and better visual quality of the image.展开更多
The recent adoption of satellite technologies,unmanned aerial vehicles(UAVs)and 5G has encouraged telecom networking to evolve into more stable service to remote areas and render higher quality.But,security concerns w...The recent adoption of satellite technologies,unmanned aerial vehicles(UAVs)and 5G has encouraged telecom networking to evolve into more stable service to remote areas and render higher quality.But,security concerns with drones were increasing as drone nodes have been striking targets for cyberattacks because of immensely weak inbuilt and growing poor security volumes.This study presents an Archimedes Optimization with Deep Learning based Aerial Image Classification and Intrusion Detection(AODL-AICID)technique in secure UAV networks.The presented AODLAICID technique concentrates on two major processes:image classification and intrusion detection.For aerial image classification,the AODL-AICID technique encompasses MobileNetv2 feature extraction,Archimedes Optimization Algorithm(AOA)based hyperparameter optimizer,and backpropagation neural network(BPNN)based classifier.In addition,the AODLAICID technique employs a stacked bi-directional long short-term memory(SBLSTM)model to accomplish intrusion detection for cybersecurity in UAV networks.At the final stage,the Nadam optimizer is utilized for parameter tuning of the SBLSTM approach.The experimental validation of the AODLAICID technique is tested and the obtained values reported the improved performance of the AODL-AICID technique over other models.展开更多
The use of drones in construction engineering has gained increasing attention in recent years due to its potential to revolutionize the industry. Drones, offer the ability to capture high-resolution aerial imagery and...The use of drones in construction engineering has gained increasing attention in recent years due to its potential to revolutionize the industry. Drones, offer the ability to capture high-resolution aerial imagery and collect data that was previously difficult or impossible to obtain. The integration drones in construction engineering presents opportunities for accurate data collection, analysis and visualization, which can improve decision-making processes and improve project outcomes. For example, drones equipped with GIS technology can be used to capture high-resolution aerial images of construction sites, allowing engineers to monitor progress, identify potential issues, and make informed adjustments as needed. By harnessing drones, civil engineers in the civil engineering field can potentially optimize project planning, design and execution while minimizing risks and costs. The work of this topic examines the case of the use of Drones combined with GIS in construction engineering. During this study, aerial photography of a certain segment of the Pristina-Gjilan Highway was taken. The results generated by the processing of aerial photos have been compared with the project. However, further research is needed to fully understand the capabilities and limitations of these technologies in this specific context, as well as to explore any potential challenges and barriers to their widespread adoption.展开更多
Bridges are an important part of railway infrastructure and need regular inspection and maintenance.Using unmanned aerial vehicle(UAV)technology to inspect railway infrastructure is an active research issue.However,du...Bridges are an important part of railway infrastructure and need regular inspection and maintenance.Using unmanned aerial vehicle(UAV)technology to inspect railway infrastructure is an active research issue.However,due to the large size of UAV images,flight distance,and height changes,the object scale changes dramatically.At the same time,the elements of interest in railway bridges,such as bolts and corrosion,are small and dense objects,and the sample data set is seriously unbalanced,posing great challenges to the accurate detection of defects.In this paper,an adaptive cropping shallow attention network(ACSANet)is proposed,which includes an adaptive cropping strategy for large UAV images and a shallow attention network for small object detection in limited samples.To enhance the accuracy and generalization of the model,the shallow attention network model integrates a coordinate attention(CA)mechanism module and an alpha intersection over union(α-IOU)loss function,and then carries out defect detection on the bolts,steel surfaces,and railings of railway bridges.The test results show that the ACSANet model outperforms the YOLOv5s model using adaptive cropping strategy in terms of the total mAP(an evaluation index)and missing bolt mAP by 5%and 30%,respectively.Also,compared with the YOLOv5s model that adopts the common cropping strategy,the total mAP and missing bolt mAP are improved by 10%and 60%,respectively.Compared with the YOLOv5s model without any cropping strategy,the total mAP and missing bolt mAP are improved by 40%and 67%,respectively.展开更多
This paper investigates the problem of retrieving aerial scene images by using semantic sketches, since the state-of-the-art retrieval systems turn out to be invalid when there is no exemplar query aerial image availa...This paper investigates the problem of retrieving aerial scene images by using semantic sketches, since the state-of-the-art retrieval systems turn out to be invalid when there is no exemplar query aerial image available. However, due to the complex surface structures and huge variations of resolutions of aerial images, it is very challenging to retrieve aerial images with sketches and few studies have been devoted to this task. In this article, for the first time to our knowledge, we propose a framework to bridge the gap between sketches and aerial images. First, an aerial sketch-image database is collected, and the images and sketches it contains are augmented to various levels of details. We then train a multi-scale deep model by the new dataset. The fully-connected layers of the network in each scale are finally connected and used as cross-domain features, and the Euclidean distance is used to measure the cross-domain similarity between aerial images and sketches. Experiments on several commonly used aerial image datasets demonstrate the superiority of the proposed method compared with the traditional approaches.展开更多
Inventory maps of earthquake-triggered landslides can be constructed using several methods,which are often subject to obvious differences due to lack of commonly accepted criteria or principles,To solve this problem,t...Inventory maps of earthquake-triggered landslides can be constructed using several methods,which are often subject to obvious differences due to lack of commonly accepted criteria or principles,To solve this problem,the author describes the principles for preparing inventory maps of earthquake-triggered landslides,focusing on varied methods and their criteria,The principles include the following key points:all landslides should be mapped as long as they can be recognized from images;both the boundary and source area position of landslides should be mapped;spatial distribution pattern of earthquake-triggered landslides should be continuous;complex landslides should be divided into distinct groups;three types of errors such as precision of the location and boundary of landslides,false positive errors,and false negative errors of earthquake-triggered landslide inventories should be controlled and reduced;and inventories of co-seismic landslides should be constructed by the visual interpretation method rather than automatic extraction of satellite images or/and aerial photographs,In addition,selection of remote sensing images and creation of landslides attribute database are also discussed in this paper,Then the author applies these principles to produce inventory maps of four events:the 12 May 2008 Wenchuan,China M_w 7.9,14 April 2010 Yushu,China M_w 6,9,12 January 2010 Haiti M_w7.0,and 2007 Aysen Fjord,Chile M_w 6.2,The results show obvious differences in comparison with previous studies by other researchers,which again attest to the necessity of establishment of unified principles for preparation of inventory maps of earthquake-triggered landslides.展开更多
In this paper, a study and evaluation of the combination of GPS/GNSS techniques and advanced image processing algorithms for distressed human detection, positioning and tracking, from a fully autonomous Unmanned Aeria...In this paper, a study and evaluation of the combination of GPS/GNSS techniques and advanced image processing algorithms for distressed human detection, positioning and tracking, from a fully autonomous Unmanned Aerial Vehicle (UAV)-based rescue support system, </span><span style="font-family:Verdana;">are</span><span style="font-family:Verdana;"> presented. In particular, the issue of human detection both on terrestrial and marine environment under several illumination and background conditions, as the human silhouette in water differs significantly from a terrestrial one</span><span style="font-family:Verdana;">,</span><span style="font-family:Verdana;"> is addressed. A robust approach, including an adaptive distressed human detection algorithm running every N input image frames combined with a much faster tracking algorithm, is proposed. Real time or near-real-time distressed human detection rates achieved, using a single, low cost day/night NIR camera mounted onboard a fully autonomous UAV for Search and Rescue (SAR) operations. Moreover, the generation of our own dataset, for the image processing algorithms training is also presented. Details about both hardware and software configuration as well as the assessment of the proposed approach performance are fully discussed. Last, a comparison of the proposed approach to other human detection methods used in the literature is presented.展开更多
基金supported by a grant from the Basic Science Research Program through the National Research Foundation(NRF)(2021R1F1A1063634)funded by the Ministry of Science and ICT(MSIT),Republic of KoreaThe authors are thankful to the Deanship of Scientific Research at Najran University for funding this work under the Research Group Funding Program Grant Code(NU/RG/SERC/13/40)+2 种基金Also,the authors are thankful to Prince Satam bin Abdulaziz University for supporting this study via funding from Prince Satam bin Abdulaziz University project number(PSAU/2024/R/1445)This work was also supported by Princess Nourah bint Abdulrahman University Researchers Supporting Project Number(PNURSP2023R54)Princess Nourah bint Abdulrahman University,Riyadh,Saudi Arabia.
文摘Road traffic monitoring is an imperative topic widely discussed among researchers.Systems used to monitor traffic frequently rely on cameras mounted on bridges or roadsides.However,aerial images provide the flexibility to use mobile platforms to detect the location and motion of the vehicle over a larger area.To this end,different models have shown the ability to recognize and track vehicles.However,these methods are not mature enough to produce accurate results in complex road scenes.Therefore,this paper presents an algorithm that combines state-of-the-art techniques for identifying and tracking vehicles in conjunction with image bursts.The extracted frames were converted to grayscale,followed by the application of a georeferencing algorithm to embed coordinate information into the images.The masking technique eliminated irrelevant data and reduced the computational cost of the overall monitoring system.Next,Sobel edge detection combined with Canny edge detection and Hough line transform has been applied for noise reduction.After preprocessing,the blob detection algorithm helped detect the vehicles.Vehicles of varying sizes have been detected by implementing a dynamic thresholding scheme.Detection was done on the first image of every burst.Then,to track vehicles,the model of each vehicle was made to find its matches in the succeeding images using the template matching algorithm.To further improve the tracking accuracy by incorporating motion information,Scale Invariant Feature Transform(SIFT)features have been used to find the best possible match among multiple matches.An accuracy rate of 87%for detection and 80%accuracy for tracking in the A1 Motorway Netherland dataset has been achieved.For the Vehicle Aerial Imaging from Drone(VAID)dataset,an accuracy rate of 86%for detection and 78%accuracy for tracking has been achieved.
基金supported by the National Natural Science Foundation of China under Grant 61671219.
文摘Object detection in unmanned aerial vehicle(UAV)aerial images has become increasingly important in military and civil applications.General object detection models are not robust enough against interclass similarity and intraclass variability of small objects,and UAV-specific nuisances such as uncontrolledweather conditions.Unlike previous approaches focusing on high-level semantic information,we report the importance of underlying features to improve detection accuracy and robustness fromthe information-theoretic perspective.Specifically,we propose a robust and discriminative feature learning approach through mutual information maximization(RD-MIM),which can be integrated into numerous object detection methods for aerial images.Firstly,we present the rank sample mining method to reduce underlying feature differences between the natural image domain and the aerial image domain.Then,we design a momentum contrast learning strategy to make object features similar to the same category and dissimilar to different categories.Finally,we construct a transformer-based global attention mechanism to boost object location semantics by leveraging the high interrelation of different receptive fields.We conduct extensive experiments on the VisDrone and Unmanned Aerial Vehicle Benchmark Object Detection and Tracking(UAVDT)datasets to prove the effectiveness of the proposed method.The experimental results show that our approach brings considerable robustness gains to basic detectors and advanced detection methods,achieving relative growth rates of 51.0%and 39.4%in corruption robustness,respectively.Our code is available at https://github.com/cq100/RD-MIM(accessed on 2 August 2024).
基金funded in part by the Equipment Pre-Research Foundation of China,Grant No.61400010203in part by the Independent Project of the State Key Laboratory of Virtual Reality Technology and Systems.
文摘There are two types of methods for image segmentation.One is traditional image processing methods,which are sensitive to details and boundaries,yet fail to recognize semantic information.The other is deep learning methods,which can locate and identify different objects,but boundary identifications are not accurate enough.Both of them cannot generate entire segmentation information.In order to obtain accurate edge detection and semantic information,an Adaptive Boundary and Semantic Composite Segmentation method(ABSCS)is proposed.This method can precisely semantic segment individual objects in large-size aerial images with limited GPU performances.It includes adaptively dividing and modifying the aerial images with the proposed principles and methods,using the deep learning method to semantic segment and preprocess the small divided pieces,using three traditional methods to segment and preprocess original-size aerial images,adaptively selecting traditional results tomodify the boundaries of individual objects in deep learning results,and combining the results of different objects.Individual object semantic segmentation experiments are conducted by using the AeroScapes dataset,and their results are analyzed qualitatively and quantitatively.The experimental results demonstrate that the proposed method can achieve more promising object boundaries than the original deep learning method.This work also demonstrates the advantages of the proposed method in applications of point cloud semantic segmentation and image inpainting.
文摘In this paper,based on a bidirectional parallel multi-branch feature pyramid network(BPMFPN),a novel one-stage object detector called BPMFPN Det is proposed for real-time detection of ground multi-scale targets by swarm unmanned aerial vehicles(UAVs).First,the bidirectional parallel multi-branch convolution modules are used to construct the feature pyramid to enhance the feature expression abilities of different scale feature layers.Next,the feature pyramid is integrated into the single-stage object detection framework to ensure real-time performance.In order to validate the effectiveness of the proposed algorithm,experiments are conducted on four datasets.For the PASCAL VOC dataset,the proposed algorithm achieves the mean average precision(mAP)of 85.4 on the VOC 2007 test set.With regard to the detection in optical remote sensing(DIOR)dataset,the proposed algorithm achieves 73.9 mAP.For vehicle detection in aerial imagery(VEDAI)dataset,the detection accuracy of small land vehicle(slv)targets reaches 97.4 mAP.For unmanned aerial vehicle detection and tracking(UAVDT)dataset,the proposed BPMFPN Det achieves the mAP of 48.75.Compared with the previous state-of-the-art methods,the results obtained by the proposed algorithm are more competitive.The experimental results demonstrate that the proposed algorithm can effectively solve the problem of real-time detection of ground multi-scale targets in aerial images of swarm UAVs.
基金Project supported by the National Science Foundation of China (No.49771063)
文摘Because of quick development of cities, the update of urban GIS data is very important. Change detection is the base of automatic or semi-automatic data update. One way of change detections in urban area is based on old and new aerial images acquired in different durations. The corresponding theory and experiments are introduced and analyzed in this paper. The main procedure includes four stages. The new and old images have to be registered firstly. Then image matching, based on the maximum correlation coefficient, is performed between registered images after the low contrast areas have been removed. The regions with low matching quality are extracted as candidate changed areas. Thirdly, the Gaussian-Laplacian operator is used to detect edges in candidate changed areas on both the registered images, and the straight lines are detected by Hough transformation. Finally, the changed houses and roads can be detected on the basis of straight line matching in candidate changed areas between registered images. Some experimental results show that the method introduced in this paper is effective.
基金uisition of Directional Funded by the National 973 Project (No.2003CB415205).
文摘This paper develops a method which can be used to assist aerial navigation by determining the spatial position and posture of the aerial photographic plane. After the method, aerial images match known DEM to capture the spatial position and posture. Some aerial images and terrain data are used to testify our method. Compared with those of analytic and stereo mappers, the results by our method are correspondent to real measurements well.
基金by Makkah Digital Gate Initiative under grant no.(MDP-IRI-3-2020).
文摘Desertification has become a global threat and caused a crisis,especially in Middle Eastern countries,such as Saudi Arabia.Makkah is one of the most important cities in Saudi Arabia that needs to be protected from desertification.The vegetation area in Makkah has been damaged because of desertification through wind,floods,overgrazing,and global climate change.The damage caused by desertification can be recovered provided urgent action is taken to prevent further degradation of the vegetation area.In this paper,we propose an automatic desertification detection system based on Deep Learning techniques.Aerial images are classified using Convolutional Neural Networks(CNN)to detect land state variation in real-time.CNNs have been widely used for computer vision applications,such as image classification,image segmentation,and quality enhancement.The proposed CNN model was trained and evaluated on the Arial Image Dataset(AID).Compared to state-of-the-art methods,the proposed model has better performance while being suitable for embedded implementation.It has achieved high efficiency with 96.47% accuracy.In light of the current research,we assert the appropriateness of the proposed CNN model in detecting desertification from aerial images.
基金This work was supported by the Equipment Pre-Research Foundation of China(6140001020310).
文摘Three-dimensional(3D)reconstruction based on aerial images has broad prospects,and feature matching is an important step of it.However,for high-resolution aerial images,there are usually problems such as long time,mismatching and sparse feature pairs using traditional algorithms.Therefore,an algorithm is proposed to realize fast,accurate and dense feature matching.The algorithm consists of four steps.Firstly,we achieve a balance between the feature matching time and the number of matching pairs by appropriately reducing the image resolution.Secondly,to realize further screening of the mismatches,a feature screening algorithm based on similarity judgment or local optimization is proposed.Thirdly,to make the algorithm more widely applicable,we combine the results of different algorithms to get dense results.Finally,all matching feature pairs in the low-resolution images are restored to the original images.Comparisons between the original algorithms and our algorithm show that the proposed algorithm can effectively reduce the matching time,screen out the mismatches,and improve the number of matches.
文摘A method for shadow detection and compensation for color aerial images is presented. It is considered that the intensity value of each image pixel is the product of illumination function and ground object reflection, and the shadowed regions on the image are mainly caused by the short of illumination, so the information compensation for the shadowed regions should concentrate on the illumination adjustment of concerned area on the basis of the analysis of whole image. The shadow detection and compensation procedure proposed by this paper consists of four steps.
基金Sao Paulo Research Founda-tion(FAPESP grant#2017/12646-3)National Council for Scien-tific and Technological Development(CNPq grant#309330/2018-1)Coordination for the Improvement of Higher Education Personnel(CAPES Finance Code#001)for their financial support.
文摘Sugarcane planting is an important and growing activity in Brazil.Thereupon,several techniques have been developed over the years to maximize crop productivity and profit,amongst them,processing of sugarcane field images.In this sense,this research aims to identify and analyze crop rows and measure their gaps from aerial images of sugarcane fields.For this,a small Remotely Piloted Aircraft captured the images,generating orthomosaics of the areas for analysis.Then,each orthomosaic is classified with the K-Nearest Neighbor algorithm to segment regions of interest.Planting row orientation is estimated using the RGB gradient filter.Morphological operations and computational geometry models are then used to detect and map rows and gaps along the planting row segment.To evaluate the results,crop rows are mapped and compared to manually taken measurements.Our technique obtained an error smaller than 2%when compared to gap length in crop rows from an orthomosaic with the area of 8.05 ha(ha).The proposed approach can map the positioning of the automatically generated row segments appropriately onto manually created segments.Moreover,our method also achieved similar results when confronted with a manual technique for differing growth stages(40 and 80 days after harvest)of the sugarcane crop.The proposed method presents a great potential to be adopted in sugarcane planting monitoring。
基金The National Key Research and Development Program of China[grant numbers 2020YFC2004003 and 2020YFC2004002].
文摘The existing UAV aerial image de-fog methods have low image contrast after de-fog,the difference between light and dark image is not obvious,leading to poor de-fog effect.Therefore,an aerial image de-fog enhancement method based on dark channel a priori is proposed.The image variance and absolute gradient mean are combined to get the weight coefficients,and the edge pixels are smoothed by using the multiple decomposition form.The image intensity is calculated and the noise is reduced.A convolution neural network is introduced to calculate the atmospheric transmittance in haze.Based on this,dark channel prior algorithm is used to enhance the light and shade difference of aerial photography image and realise the de-fog enhancement of aerial photography image.To verify the performance of the proposed method,simulation experiments are designed which were compared with the existing methods results in better fog-removing effect,higher contrast and shorter time.
基金This work was supported in part by the National Natural Science Foundation of China(No.61571296)the National Science Foundation of USA(No.CNS-1619250).
文摘This paper proposes a new deep learning framework for the location of broken insulators(in particular the self-blast glass insulator)in aerial images.We address the broken insulators location problem in a low signal-noise-ratio(SNR)setting.We deal with two modules:1)object detection based on Faster R-CNN,and 2)classification of pixels based on U-net.For the first time,our paper combines the above two modules.This combination is motivated as follows:Faster R-CNN is used to improve SNR,while the U-net is used for classification of pixels.A diverse aerial image set measured by a power grid in China is tested to validate the proposed approach.Furthermore,a comparison is made among different methods and the result shows that our approach is accurate in real time.
基金This work was supported by the National Basic Research Program of China(973 Program)[grant number 2012CB719901]the National Natural Science Foundation of China(grant number 91438112)the Foundation for the Author of National Excellent Doctoral Dissertation of China(FANEDD)[grant number 201249].
文摘In aerial images,near-specular and specular reflection often appear in water bodies.They often lead to irregular brightness or color changes in water bodies and even produce hot spots,harmful to radiometric normalization.Therefore,water bodies must be eliminated when calculating radiometric differences during radiometric normalization of aerial images.In this paper,a simple method to detect water bodies in aerial images based on texture features is presented,an improved seeded region growing(SRG)method.A texture feature is calculated using the relative standard deviation index(RSDI)and a coarse-to-fine procedure is employed.The proposed method includes a multiple partition strategy and a refinement in gradient image that improves the reliability and accuracy of water body detection.By fusing water bodies detected in multiple images,hot spots in these water bodies are also detected.Experiments validate the feasibility and effectiveness of the proposed method.
文摘Due to improper acquisition settings and other noise artifacts,the image degraded to yield poor mean preservation in brightness.The simplest way to improve the preservation is the implementation of histogram equalization.Because of over-enhancement,it failed to preserve the mean brightness and produce the poor quality of the image.This paper proposes a multi-scale decomposi-tion for brightness preservation using gamma correction.After transformation to hue,saturation and intensity(HSI)channel,the 2D-discrete wavelet transform decomposed the intensity component into low and high-pass coefficients.At the next phase,gamma correction is used by auto-tuning the scale value.The scale is the modified constant value used in the logarithmic function.Further,the scale value is optimized to obtain better visual quality in the image.The optimized value is the weighted distribution of standard deviation-mean of low pass coefficients.Finally,the experimental result is estimated in terms of quality assessment measures used as absolute mean brightness error,the measure of information detail,signal to noise ratio and patch-based contrast quality in the image.By comparison,the proposed method proved to be suitably remarkable in retaining the mean brightness and better visual quality of the image.
基金funded by Institutional Fund Projects under Grant No.(IFPIP:511-611-1443).
文摘The recent adoption of satellite technologies,unmanned aerial vehicles(UAVs)and 5G has encouraged telecom networking to evolve into more stable service to remote areas and render higher quality.But,security concerns with drones were increasing as drone nodes have been striking targets for cyberattacks because of immensely weak inbuilt and growing poor security volumes.This study presents an Archimedes Optimization with Deep Learning based Aerial Image Classification and Intrusion Detection(AODL-AICID)technique in secure UAV networks.The presented AODLAICID technique concentrates on two major processes:image classification and intrusion detection.For aerial image classification,the AODL-AICID technique encompasses MobileNetv2 feature extraction,Archimedes Optimization Algorithm(AOA)based hyperparameter optimizer,and backpropagation neural network(BPNN)based classifier.In addition,the AODLAICID technique employs a stacked bi-directional long short-term memory(SBLSTM)model to accomplish intrusion detection for cybersecurity in UAV networks.At the final stage,the Nadam optimizer is utilized for parameter tuning of the SBLSTM approach.The experimental validation of the AODLAICID technique is tested and the obtained values reported the improved performance of the AODL-AICID technique over other models.
文摘The use of drones in construction engineering has gained increasing attention in recent years due to its potential to revolutionize the industry. Drones, offer the ability to capture high-resolution aerial imagery and collect data that was previously difficult or impossible to obtain. The integration drones in construction engineering presents opportunities for accurate data collection, analysis and visualization, which can improve decision-making processes and improve project outcomes. For example, drones equipped with GIS technology can be used to capture high-resolution aerial images of construction sites, allowing engineers to monitor progress, identify potential issues, and make informed adjustments as needed. By harnessing drones, civil engineers in the civil engineering field can potentially optimize project planning, design and execution while minimizing risks and costs. The work of this topic examines the case of the use of Drones combined with GIS in construction engineering. During this study, aerial photography of a certain segment of the Pristina-Gjilan Highway was taken. The results generated by the processing of aerial photos have been compared with the project. However, further research is needed to fully understand the capabilities and limitations of these technologies in this specific context, as well as to explore any potential challenges and barriers to their widespread adoption.
基金supported by the National Natural Science Foundation of China(No.61833002).
文摘Bridges are an important part of railway infrastructure and need regular inspection and maintenance.Using unmanned aerial vehicle(UAV)technology to inspect railway infrastructure is an active research issue.However,due to the large size of UAV images,flight distance,and height changes,the object scale changes dramatically.At the same time,the elements of interest in railway bridges,such as bolts and corrosion,are small and dense objects,and the sample data set is seriously unbalanced,posing great challenges to the accurate detection of defects.In this paper,an adaptive cropping shallow attention network(ACSANet)is proposed,which includes an adaptive cropping strategy for large UAV images and a shallow attention network for small object detection in limited samples.To enhance the accuracy and generalization of the model,the shallow attention network model integrates a coordinate attention(CA)mechanism module and an alpha intersection over union(α-IOU)loss function,and then carries out defect detection on the bolts,steel surfaces,and railings of railway bridges.The test results show that the ACSANet model outperforms the YOLOv5s model using adaptive cropping strategy in terms of the total mAP(an evaluation index)and missing bolt mAP by 5%and 30%,respectively.Also,compared with the YOLOv5s model that adopts the common cropping strategy,the total mAP and missing bolt mAP are improved by 10%and 60%,respectively.Compared with the YOLOv5s model without any cropping strategy,the total mAP and missing bolt mAP are improved by 40%and 67%,respectively.
文摘This paper investigates the problem of retrieving aerial scene images by using semantic sketches, since the state-of-the-art retrieval systems turn out to be invalid when there is no exemplar query aerial image available. However, due to the complex surface structures and huge variations of resolutions of aerial images, it is very challenging to retrieve aerial images with sketches and few studies have been devoted to this task. In this article, for the first time to our knowledge, we propose a framework to bridge the gap between sketches and aerial images. First, an aerial sketch-image database is collected, and the images and sketches it contains are augmented to various levels of details. We then train a multi-scale deep model by the new dataset. The fully-connected layers of the network in each scale are finally connected and used as cross-domain features, and the Euclidean distance is used to measure the cross-domain similarity between aerial images and sketches. Experiments on several commonly used aerial image datasets demonstrate the superiority of the proposed method compared with the traditional approaches.
基金supported by the National Natural Science Foundation of China(Grant No.41202235)
文摘Inventory maps of earthquake-triggered landslides can be constructed using several methods,which are often subject to obvious differences due to lack of commonly accepted criteria or principles,To solve this problem,the author describes the principles for preparing inventory maps of earthquake-triggered landslides,focusing on varied methods and their criteria,The principles include the following key points:all landslides should be mapped as long as they can be recognized from images;both the boundary and source area position of landslides should be mapped;spatial distribution pattern of earthquake-triggered landslides should be continuous;complex landslides should be divided into distinct groups;three types of errors such as precision of the location and boundary of landslides,false positive errors,and false negative errors of earthquake-triggered landslide inventories should be controlled and reduced;and inventories of co-seismic landslides should be constructed by the visual interpretation method rather than automatic extraction of satellite images or/and aerial photographs,In addition,selection of remote sensing images and creation of landslides attribute database are also discussed in this paper,Then the author applies these principles to produce inventory maps of four events:the 12 May 2008 Wenchuan,China M_w 7.9,14 April 2010 Yushu,China M_w 6,9,12 January 2010 Haiti M_w7.0,and 2007 Aysen Fjord,Chile M_w 6.2,The results show obvious differences in comparison with previous studies by other researchers,which again attest to the necessity of establishment of unified principles for preparation of inventory maps of earthquake-triggered landslides.
文摘In this paper, a study and evaluation of the combination of GPS/GNSS techniques and advanced image processing algorithms for distressed human detection, positioning and tracking, from a fully autonomous Unmanned Aerial Vehicle (UAV)-based rescue support system, </span><span style="font-family:Verdana;">are</span><span style="font-family:Verdana;"> presented. In particular, the issue of human detection both on terrestrial and marine environment under several illumination and background conditions, as the human silhouette in water differs significantly from a terrestrial one</span><span style="font-family:Verdana;">,</span><span style="font-family:Verdana;"> is addressed. A robust approach, including an adaptive distressed human detection algorithm running every N input image frames combined with a much faster tracking algorithm, is proposed. Real time or near-real-time distressed human detection rates achieved, using a single, low cost day/night NIR camera mounted onboard a fully autonomous UAV for Search and Rescue (SAR) operations. Moreover, the generation of our own dataset, for the image processing algorithms training is also presented. Details about both hardware and software configuration as well as the assessment of the proposed approach performance are fully discussed. Last, a comparison of the proposed approach to other human detection methods used in the literature is presented.