为了探究不同的深度卷积神经网络在行人检测任务中的性能差异,基于Faster-R-CNN深度学习算法框架,在Caltech行人数据集上对VGG-Net(Visual Geometry Group Net)和Res-Net(Residual Net)的性能进行了比较。通过改变数据集、改变训练数据...为了探究不同的深度卷积神经网络在行人检测任务中的性能差异,基于Faster-R-CNN深度学习算法框架,在Caltech行人数据集上对VGG-Net(Visual Geometry Group Net)和Res-Net(Residual Net)的性能进行了比较。通过改变数据集、改变训练数据的数量、对比训练过程中各阶段的检测率,对两个网络的泛化能力、学习能力以及收敛速度进行了对比。实验结果表明,Res-Net相比于VGG-Net网络具有更快的收敛速度和更强的泛化能力;Res-Net的学习能力更强,随着训练数据的扩展,其性能提升更大。在行人检测任务中,Res-Net具有更好的性能。展开更多
Aiming at the difficulty of fault identification caused by manual extraction of fault features of rotating machinery,a one-dimensional multi-scale convolutional auto-encoder fault diagnosis model is proposed,based on ...Aiming at the difficulty of fault identification caused by manual extraction of fault features of rotating machinery,a one-dimensional multi-scale convolutional auto-encoder fault diagnosis model is proposed,based on the standard convolutional auto-encoder.In this model,the parallel convolutional and deconvolutional kernels of different scales are used to extract the features from the input signal and reconstruct the input signal;then the feature map extracted by multi-scale convolutional kernels is used as the input of the classifier;and finally the parameters of the whole model are fine-tuned using labeled data.Experiments on one set of simulation fault data and two sets of rolling bearing fault data are conducted to validate the proposed method.The results show that the model can achieve 99.75%,99.3%and 100%diagnostic accuracy,respectively.In addition,the diagnostic accuracy and reconstruction error of the one-dimensional multi-scale convolutional auto-encoder are compared with traditional machine learning,convolutional neural networks and a traditional convolutional auto-encoder.The final results show that the proposed model has a better recognition effect for rolling bearing fault data.展开更多
With the continuous increase in the number of flights,the use of airport collaborative decision-making(ACDM)systems has been more and more widely spread.The accuracy of the taxi time prediction has an important effect...With the continuous increase in the number of flights,the use of airport collaborative decision-making(ACDM)systems has been more and more widely spread.The accuracy of the taxi time prediction has an important effect on the A-CDM calculation of the departure aircraft’s take-off queue and the accurate time for the aircraft blockout.The spatial-temporal-environment deep learning(STEDL)model is presented to improve the prediction accuracy of departure aircraft taxi-out time.The model is composed of time-flow sub-model(airport capacity,number of taxiing aircraft,and different time periods),spatial sub-model(taxiing distance)and environmental sub-model(weather,air traffic control,runway configuration,and aircraft category).The STEDL model is used to predict the taxi time of departure aircraft at Hong Kong Airport and the results show that the STEDL method has a prediction accuracy of 95.4%.The proposed model also greatly reduces the prediction error rate compared with the other machine learning methods.展开更多
Objective To propose two novel methods based on deep learning for computer-aided tongue diagnosis,including tongue image segmentation and tongue color classification,improving their diagnostic accuracy.Methods LabelMe...Objective To propose two novel methods based on deep learning for computer-aided tongue diagnosis,including tongue image segmentation and tongue color classification,improving their diagnostic accuracy.Methods LabelMe was used to label the tongue mask and Snake model to optimize the labeling results.A new dataset was constructed for tongue image segmentation.Tongue color was marked to build a classified dataset for network training.In this research,the Inception+Atrous Spatial Pyramid Pooling(ASPP)+UNet(IAUNet)method was proposed for tongue image segmentation,based on the existing UNet,Inception,and atrous convolution.Moreover,the Tongue Color Classification Net(TCCNet)was constructed with reference to ResNet,Inception,and Triple-Loss.Several important measurement indexes were selected to evaluate and compare the effects of the novel and existing methods for tongue segmentation and tongue color classification.IAUNet was compared with existing mainstream methods such as UNet and DeepLabV3+for tongue segmentation.TCCNet for tongue color classification was compared with VGG16 and GoogLeNet.Results IAUNet can accurately segment the tongue from original images.The results showed that the Mean Intersection over Union(MIoU)of IAUNet reached 96.30%,and its Mean Pixel Accuracy(MPA),mean Average Precision(mAP),F1-Score,G-Score,and Area Under Curve(AUC)reached 97.86%,99.18%,96.71%,96.82%,and 99.71%,respectively,suggesting IAUNet produced better segmentation than other methods,with fewer parameters.Triplet-Loss was applied in the proposed TCCNet to separate different embedded colors.The experiment yielded ideal results,with F1-Score and mAP of the TCCNet reached 88.86% and 93.49%,respectively.Conclusion IAUNet based on deep learning for tongue segmentation is better than traditional ones.IAUNet can not only produce ideal tongue segmentation,but have better effects than those of PSPNet,SegNet,UNet,and DeepLabV3+,the traditional networks.As for tongue color classification,the proposed network,TCCNet,had better F1-Score and mAP values as compared with other neural networks such as VGG16 and GoogLeNet.展开更多
In order to improve the accuracy of threaded hole object detection,combining a dual camera vision system with the Hough transform circle detection,we propose an object detection method of artifact threaded hole based ...In order to improve the accuracy of threaded hole object detection,combining a dual camera vision system with the Hough transform circle detection,we propose an object detection method of artifact threaded hole based on Faster region-ased convolutional neural network(Faster R-CNN).First,a dual camera image acquisition system is established.One industrial camera placed at a high position is responsible for collecting the whole image of the workpiece,and the suspected screw hole position on the workpiece can be preliminarily selected by Hough transform detection algorithm.Then,the other industrial camera is responsible for collecting the local images of the suspected screw holes that have been detected by Hough transform one by one.After that,ResNet50-based Faster R-CNN object detection model is trained on the self-built screw hole data set.Finally,the local image of the threaded hole is input into the trained Faster R-CNN object detection model for further identification and location.The experimental results show that the proposed method can effectively avoid small object detection of threaded holes,and compared with the method that only uses Hough transform or Faster RCNN object detection alone,it has high recognition and positioning accuracy.展开更多
A data-driven method for arrival pattern recognition and prediction is proposed to provide air traffic controllers(ATCOs)with decision support. For arrival pattern recognition,a clustering-based method is proposed to ...A data-driven method for arrival pattern recognition and prediction is proposed to provide air traffic controllers(ATCOs)with decision support. For arrival pattern recognition,a clustering-based method is proposed to cluster arrival patterns by control intentions. For arrival pattern prediction,two predictors are trained to estimate the most possible command issued by the ATCOs in a particular traffic situation. Training the arrival pattern predictor could be regarded as building an ATCOs simulator. The simulator can assign an appropriate arrival pattern for each arrival aircraft,just like real ATCOs do. Therefore,the simulator is considered to be able to provide effective advice for part of the work of ATCOs. Finally,a case study is carried out and demonstrates that the convolutional neural network(CNN)-based predictor performs better than the radom forest(RF)-based one.展开更多
Objective We developed a universal lesion detector(ULDor)which showed good performance in in-lab experiments.The study aims to evaluate the performance and its ability to generalize in clinical setting via both extern...Objective We developed a universal lesion detector(ULDor)which showed good performance in in-lab experiments.The study aims to evaluate the performance and its ability to generalize in clinical setting via both external and internal validation.Methods The ULDor system consists of a convolutional neural network(CNN)trained on around 80 K lesion annotations from about 12 K CT studies in the DeepLesion dataset and 5 other public organ-specific datasets.During the validation process,the test sets include two parts:the external validation dataset which was comprised of 164 sets of non-contrasted chest and upper abdomen CT scans from a comprehensive hospital,and the internal validation dataset which was comprised of 187 sets of low-dose helical CT scans from the National Lung Screening Trial(NLST).We ran the model on the two test sets to output lesion detection.Three board-certified radiologists read the CT scans and verified the detection results of ULDor.We used positive predictive value(PPV)and sensitivity to evaluate the performance of the model in detecting space-occupying lesions at all extra-pulmonary organs visualized on CT images,including liver,kidney,pancreas,adrenal,spleen,esophagus,thyroid,lymph nodes,body wall,thoracic spine,etc.Results In the external validation,the lesion-level PPV and sensitivity of the model were 57.9%and 67.0%,respectively.On average,the model detected 2.1 findings per set,and among them,0.9 were false positives.ULDor worked well for detecting liver lesions,with a PPV of 78.9%and a sensitivity of 92.7%,followed by kidney,with a PPV of 70.0%and a sensitivity of 58.3%.In internal validation with NLST test set,ULDor obtained a PPV of 75.3%and a sensitivity of 52.0%despite the relatively high noise level of soft tissue on images.Conclusions The performance tests of ULDor with the external real-world data have shown its high effectiveness in multiple-purposed detection for lesions in certain organs.With further optimisation and iterative upgrades,ULDor may be well suited for extensive application to external data.展开更多
In recent years,deep learning methods have gradually come to be used in hyperspectral imaging domains.Because of the peculiarity of hyperspectral imaging,a mass of information is contained in the spectral dimensions o...In recent years,deep learning methods have gradually come to be used in hyperspectral imaging domains.Because of the peculiarity of hyperspectral imaging,a mass of information is contained in the spectral dimensions of hyperspectral images.Also,different ob jects on a land surface are sensitive to different ranges of wavelength.To achieve higher accuracy in classification,we propose a structure that combines spectral sensitivity with a convolutional neural network by adding spectral weights derived from predicted outcomes before the final classification layer.First,samples are divided into visible light and infrared,with a portion of the samples fed into networks during training.Then,two key parameters,unrecognized rate(δ)and wrongly recognized rate(γ),are calculated from the predicted outcome of the whole scene.Next,the spectral weight,derived from these two parameters,is calculated.Finally,the spectral weight is added and an improved structure is constructed.The improved structure not only combines the features in spatial and spectral dimensions,but also gives spectral sensitivity a primary status.Compared with inputs from the whole spectrum,the improved structure attains a nearly 2%higher prediction accuracy.When applied to public data sets,compared with the whole spectrum,on the average we achieve approximately 1%higher accuracy.展开更多
文摘星系的光谱包含其内部恒星的年龄和金属丰度等信息,从观测光谱数据中测量这些信息对于深入了解星系的形成和演化至关重要.LAMOST(Large Sky Area Multi-Object Fiber Spectroscopic Telescope)巡天发布了大量的星系光谱,这些高维光谱与它们的物理参数之间存在着高度的非线性关系.而深度学习适合于处理多维、海量的非线性数据,因此基于深度学习技术构建了一个8个卷积层+4个池化层+1个全连接层的卷积神经网络,对LAMOST Data Release 7(DR7)星系的年龄和金属丰度进行自动估计.实验结果表明,使用卷积神经网络通过星系光谱预测的星族参数与传统方法基本一致,误差在0.18dex以内,并且随着光谱信噪比的增大,预测误差越来越小.实验还对比了卷积神经网络与随机森林回归模型、深度神经网络的参数测量结果,结果表明卷积神经网络的结果优于其他两种回归模型.
文摘为了探究不同的深度卷积神经网络在行人检测任务中的性能差异,基于Faster-R-CNN深度学习算法框架,在Caltech行人数据集上对VGG-Net(Visual Geometry Group Net)和Res-Net(Residual Net)的性能进行了比较。通过改变数据集、改变训练数据的数量、对比训练过程中各阶段的检测率,对两个网络的泛化能力、学习能力以及收敛速度进行了对比。实验结果表明,Res-Net相比于VGG-Net网络具有更快的收敛速度和更强的泛化能力;Res-Net的学习能力更强,随着训练数据的扩展,其性能提升更大。在行人检测任务中,Res-Net具有更好的性能。
基金The National Natural Science Foundation of China(No.51675098)
文摘Aiming at the difficulty of fault identification caused by manual extraction of fault features of rotating machinery,a one-dimensional multi-scale convolutional auto-encoder fault diagnosis model is proposed,based on the standard convolutional auto-encoder.In this model,the parallel convolutional and deconvolutional kernels of different scales are used to extract the features from the input signal and reconstruct the input signal;then the feature map extracted by multi-scale convolutional kernels is used as the input of the classifier;and finally the parameters of the whole model are fine-tuned using labeled data.Experiments on one set of simulation fault data and two sets of rolling bearing fault data are conducted to validate the proposed method.The results show that the model can achieve 99.75%,99.3%and 100%diagnostic accuracy,respectively.In addition,the diagnostic accuracy and reconstruction error of the one-dimensional multi-scale convolutional auto-encoder are compared with traditional machine learning,convolutional neural networks and a traditional convolutional auto-encoder.The final results show that the proposed model has a better recognition effect for rolling bearing fault data.
基金This work was supported by the National Natural Science Foundation of China(Nos.U1833103,71801215)the China Civil Aviation Environment and Sustainable Development Research Center Open Fund(No.CESCA2019Y04).
文摘With the continuous increase in the number of flights,the use of airport collaborative decision-making(ACDM)systems has been more and more widely spread.The accuracy of the taxi time prediction has an important effect on the A-CDM calculation of the departure aircraft’s take-off queue and the accurate time for the aircraft blockout.The spatial-temporal-environment deep learning(STEDL)model is presented to improve the prediction accuracy of departure aircraft taxi-out time.The model is composed of time-flow sub-model(airport capacity,number of taxiing aircraft,and different time periods),spatial sub-model(taxiing distance)and environmental sub-model(weather,air traffic control,runway configuration,and aircraft category).The STEDL model is used to predict the taxi time of departure aircraft at Hong Kong Airport and the results show that the STEDL method has a prediction accuracy of 95.4%.The proposed model also greatly reduces the prediction error rate compared with the other machine learning methods.
基金Scientific Research Project of the Education Department of Hunan Province(20C1435)Open Fund Project for Computer Science and Technology of Hunan University of Chinese Medicine(2018JK05).
文摘Objective To propose two novel methods based on deep learning for computer-aided tongue diagnosis,including tongue image segmentation and tongue color classification,improving their diagnostic accuracy.Methods LabelMe was used to label the tongue mask and Snake model to optimize the labeling results.A new dataset was constructed for tongue image segmentation.Tongue color was marked to build a classified dataset for network training.In this research,the Inception+Atrous Spatial Pyramid Pooling(ASPP)+UNet(IAUNet)method was proposed for tongue image segmentation,based on the existing UNet,Inception,and atrous convolution.Moreover,the Tongue Color Classification Net(TCCNet)was constructed with reference to ResNet,Inception,and Triple-Loss.Several important measurement indexes were selected to evaluate and compare the effects of the novel and existing methods for tongue segmentation and tongue color classification.IAUNet was compared with existing mainstream methods such as UNet and DeepLabV3+for tongue segmentation.TCCNet for tongue color classification was compared with VGG16 and GoogLeNet.Results IAUNet can accurately segment the tongue from original images.The results showed that the Mean Intersection over Union(MIoU)of IAUNet reached 96.30%,and its Mean Pixel Accuracy(MPA),mean Average Precision(mAP),F1-Score,G-Score,and Area Under Curve(AUC)reached 97.86%,99.18%,96.71%,96.82%,and 99.71%,respectively,suggesting IAUNet produced better segmentation than other methods,with fewer parameters.Triplet-Loss was applied in the proposed TCCNet to separate different embedded colors.The experiment yielded ideal results,with F1-Score and mAP of the TCCNet reached 88.86% and 93.49%,respectively.Conclusion IAUNet based on deep learning for tongue segmentation is better than traditional ones.IAUNet can not only produce ideal tongue segmentation,but have better effects than those of PSPNet,SegNet,UNet,and DeepLabV3+,the traditional networks.As for tongue color classification,the proposed network,TCCNet,had better F1-Score and mAP values as compared with other neural networks such as VGG16 and GoogLeNet.
文摘In order to improve the accuracy of threaded hole object detection,combining a dual camera vision system with the Hough transform circle detection,we propose an object detection method of artifact threaded hole based on Faster region-ased convolutional neural network(Faster R-CNN).First,a dual camera image acquisition system is established.One industrial camera placed at a high position is responsible for collecting the whole image of the workpiece,and the suspected screw hole position on the workpiece can be preliminarily selected by Hough transform detection algorithm.Then,the other industrial camera is responsible for collecting the local images of the suspected screw holes that have been detected by Hough transform one by one.After that,ResNet50-based Faster R-CNN object detection model is trained on the self-built screw hole data set.Finally,the local image of the threaded hole is input into the trained Faster R-CNN object detection model for further identification and location.The experimental results show that the proposed method can effectively avoid small object detection of threaded holes,and compared with the method that only uses Hough transform or Faster RCNN object detection alone,it has high recognition and positioning accuracy.
基金supported by the National Natural Science Foundation of China (Nos. U1933117,61773202,52072174)。
文摘A data-driven method for arrival pattern recognition and prediction is proposed to provide air traffic controllers(ATCOs)with decision support. For arrival pattern recognition,a clustering-based method is proposed to cluster arrival patterns by control intentions. For arrival pattern prediction,two predictors are trained to estimate the most possible command issued by the ATCOs in a particular traffic situation. Training the arrival pattern predictor could be regarded as building an ATCOs simulator. The simulator can assign an appropriate arrival pattern for each arrival aircraft,just like real ATCOs do. Therefore,the simulator is considered to be able to provide effective advice for part of the work of ATCOs. Finally,a case study is carried out and demonstrates that the convolutional neural network(CNN)-based predictor performs better than the radom forest(RF)-based one.
文摘Objective We developed a universal lesion detector(ULDor)which showed good performance in in-lab experiments.The study aims to evaluate the performance and its ability to generalize in clinical setting via both external and internal validation.Methods The ULDor system consists of a convolutional neural network(CNN)trained on around 80 K lesion annotations from about 12 K CT studies in the DeepLesion dataset and 5 other public organ-specific datasets.During the validation process,the test sets include two parts:the external validation dataset which was comprised of 164 sets of non-contrasted chest and upper abdomen CT scans from a comprehensive hospital,and the internal validation dataset which was comprised of 187 sets of low-dose helical CT scans from the National Lung Screening Trial(NLST).We ran the model on the two test sets to output lesion detection.Three board-certified radiologists read the CT scans and verified the detection results of ULDor.We used positive predictive value(PPV)and sensitivity to evaluate the performance of the model in detecting space-occupying lesions at all extra-pulmonary organs visualized on CT images,including liver,kidney,pancreas,adrenal,spleen,esophagus,thyroid,lymph nodes,body wall,thoracic spine,etc.Results In the external validation,the lesion-level PPV and sensitivity of the model were 57.9%and 67.0%,respectively.On average,the model detected 2.1 findings per set,and among them,0.9 were false positives.ULDor worked well for detecting liver lesions,with a PPV of 78.9%and a sensitivity of 92.7%,followed by kidney,with a PPV of 70.0%and a sensitivity of 58.3%.In internal validation with NLST test set,ULDor obtained a PPV of 75.3%and a sensitivity of 52.0%despite the relatively high noise level of soft tissue on images.Conclusions The performance tests of ULDor with the external real-world data have shown its high effectiveness in multiple-purposed detection for lesions in certain organs.With further optimisation and iterative upgrades,ULDor may be well suited for extensive application to external data.
基金Project supported by the Strategic Priority Research Program of the Chinese Academy of Sciences(No.XDA23090203)the National Key Technologies Research and Development Program of China(No.2016YFB0502600)the Key Program of Sichuan Bureau of Science and Technology(No.2018SZ0350),China。
文摘In recent years,deep learning methods have gradually come to be used in hyperspectral imaging domains.Because of the peculiarity of hyperspectral imaging,a mass of information is contained in the spectral dimensions of hyperspectral images.Also,different ob jects on a land surface are sensitive to different ranges of wavelength.To achieve higher accuracy in classification,we propose a structure that combines spectral sensitivity with a convolutional neural network by adding spectral weights derived from predicted outcomes before the final classification layer.First,samples are divided into visible light and infrared,with a portion of the samples fed into networks during training.Then,two key parameters,unrecognized rate(δ)and wrongly recognized rate(γ),are calculated from the predicted outcome of the whole scene.Next,the spectral weight,derived from these two parameters,is calculated.Finally,the spectral weight is added and an improved structure is constructed.The improved structure not only combines the features in spatial and spectral dimensions,but also gives spectral sensitivity a primary status.Compared with inputs from the whole spectrum,the improved structure attains a nearly 2%higher prediction accuracy.When applied to public data sets,compared with the whole spectrum,on the average we achieve approximately 1%higher accuracy.