Aerial scene recognition(ASR)has attracted great attention due to its increasingly essential applications.Most of the ASR methods adopt the multi‐scale architecture because both global and local features play great r...Aerial scene recognition(ASR)has attracted great attention due to its increasingly essential applications.Most of the ASR methods adopt the multi‐scale architecture because both global and local features play great roles in ASR.However,the existing multi‐scale methods neglect the effective interactions among different scales and various spatial locations when fusing global and local features,leading to a limited ability to deal with challenges of large‐scale variation and complex background in aerial scene images.In addition,existing methods may suffer from poor generalisations due to millions of to‐belearnt parameters and inconsistent predictions between global and local features.To tackle these problems,this study proposes a scale‐wise interaction fusion and knowledge distillation(SIF‐KD)network for learning robust and discriminative features with scaleinvariance and background‐independent information.The main highlights of this study include two aspects.On the one hand,a global‐local features collaborative learning scheme is devised for extracting scale‐invariance features so as to tackle the large‐scale variation problem in aerial scene images.Specifically,a plug‐and‐play multi‐scale context attention fusion module is proposed for collaboratively fusing the context information between global and local features.On the other hand,a scale‐wise knowledge distillation scheme is proposed to produce more consistent predictions by distilling the predictive distribution between different scales during training.Comprehensive experimental results show the proposed SIF‐KD network achieves the best overall accuracy with 99.68%,98.74%and 95.47%on the UCM,AID and NWPU‐RESISC45 datasets,respectively,compared with state of the arts.展开更多
Fracture is one of the most common and unexpected traumas.If not treated in time,it may cause serious consequences such as joint stiffness,traumatic arthritis,and nerve injury.Using computer vision technology to detec...Fracture is one of the most common and unexpected traumas.If not treated in time,it may cause serious consequences such as joint stiffness,traumatic arthritis,and nerve injury.Using computer vision technology to detect fractures can reduce the workload and misdiagnosis of fractures and also improve the fracture detection speed.However,there are still some problems in sternum fracture detection,such as the low detection rate of small and occult fractures.In this work,the authors have constructed a dataset with 1227 labelled X-ray images for sternum fracture detection.The authors designed a fully automatic fracture detection model based on a deep convolution neural network(CNN).The authors used cascade R-CNN,attention mechanism,and atrous convolution to optimise the detection of small fractures in a large X-ray image with big local variations.The authors compared the detection results of YOLOv5 model,cascade R-CNN and other state-of-the-art models.The authors found that the convolution neural network based on cascade and attention mechanism models has a better detection effect and arrives at an mAP of 0.71,which is much better than using the YOLOv5 model(mAP=0.44)and cascade R-CNN(mAP=0.55).展开更多
基金supported in part by the National Natural Science Foundation of China under Grant 62201452,2271296 and 62201453in part by the Natural Science Basic Research Programme of Shaanxi under Grant 2022JQ‐592+1 种基金in part by the Special Construction Fund for Key Disciplines of Shaanxi Provincial Higher Education,in part by the Natural Science Basic Research Program of Shaanxi under Grant 2021JC‐47in part by Scientific Research Program Funded by Shaanxi Provincial Education Department under Grant 22JK0568.
文摘Aerial scene recognition(ASR)has attracted great attention due to its increasingly essential applications.Most of the ASR methods adopt the multi‐scale architecture because both global and local features play great roles in ASR.However,the existing multi‐scale methods neglect the effective interactions among different scales and various spatial locations when fusing global and local features,leading to a limited ability to deal with challenges of large‐scale variation and complex background in aerial scene images.In addition,existing methods may suffer from poor generalisations due to millions of to‐belearnt parameters and inconsistent predictions between global and local features.To tackle these problems,this study proposes a scale‐wise interaction fusion and knowledge distillation(SIF‐KD)network for learning robust and discriminative features with scaleinvariance and background‐independent information.The main highlights of this study include two aspects.On the one hand,a global‐local features collaborative learning scheme is devised for extracting scale‐invariance features so as to tackle the large‐scale variation problem in aerial scene images.Specifically,a plug‐and‐play multi‐scale context attention fusion module is proposed for collaboratively fusing the context information between global and local features.On the other hand,a scale‐wise knowledge distillation scheme is proposed to produce more consistent predictions by distilling the predictive distribution between different scales during training.Comprehensive experimental results show the proposed SIF‐KD network achieves the best overall accuracy with 99.68%,98.74%and 95.47%on the UCM,AID and NWPU‐RESISC45 datasets,respectively,compared with state of the arts.
基金Science and technology plan project of Xi'an,Grant/Award Number:GXYD17.12Open Fund of Shaanxi Key Laboratory of Network Data Intelligent Processing,Grant/Award Number:XUPT-KLND(201802,201803)Key Research and Development Program of Shaanxi,Grant/Award Number:2019GY-021。
文摘Fracture is one of the most common and unexpected traumas.If not treated in time,it may cause serious consequences such as joint stiffness,traumatic arthritis,and nerve injury.Using computer vision technology to detect fractures can reduce the workload and misdiagnosis of fractures and also improve the fracture detection speed.However,there are still some problems in sternum fracture detection,such as the low detection rate of small and occult fractures.In this work,the authors have constructed a dataset with 1227 labelled X-ray images for sternum fracture detection.The authors designed a fully automatic fracture detection model based on a deep convolution neural network(CNN).The authors used cascade R-CNN,attention mechanism,and atrous convolution to optimise the detection of small fractures in a large X-ray image with big local variations.The authors compared the detection results of YOLOv5 model,cascade R-CNN and other state-of-the-art models.The authors found that the convolution neural network based on cascade and attention mechanism models has a better detection effect and arrives at an mAP of 0.71,which is much better than using the YOLOv5 model(mAP=0.44)and cascade R-CNN(mAP=0.55).