Surface small defects are often missed and incorrectly detected due to their small quantity and unapparent visual features.A method named CSYOLOv3,which is based on CutMix and YOLOv3,is proposed to solve such a proble...Surface small defects are often missed and incorrectly detected due to their small quantity and unapparent visual features.A method named CSYOLOv3,which is based on CutMix and YOLOv3,is proposed to solve such a problem.First,a four-image CutMix method is used to increase the small-defect quantity,and the process is dynamically adjusted based on the beta distribution.Then,the classic YOLOv3 is improved to detect small defects accurately.The shallow and large feature maps are split,and several of them are merged with the feature maps of the predicted branch to preserve the shallow features.The loss function of YOLOv3 is optimized and weighted to improve the attention to small defects.Finally,this method is used to detect 512×512 pixel images under RTX 2060Ti GPU,which can reach the speed of 14.09 frame/s,and the mAP is 71.80%,which is 5%-10%higher than that of other methods.For small defects below 64×64 pixels,the mAP of the method reaches 64.15%,which is 14%higher than that of YOLOv3-GIoU.The surface defects of the workpiece can be effectively detected by the proposed method,and the performance in detecting small defects is significantly improved.展开更多
In order to increase the accuracy rate of emotion recognition in voiceand video,the mixed convolutional neural network(CNN)and recurrent neural network(RNN)ae used to encode and integrate the two information sources.F...In order to increase the accuracy rate of emotion recognition in voiceand video,the mixed convolutional neural network(CNN)and recurrent neural network(RNN)ae used to encode and integrate the two information sources.For the audio signals,several frequency bands as well as some energy functions are extacted as low-level features by using a sophisticated audio technique,and then they are encoded w it a one-dimensional(I D)convolutional neural network to abstact high-level features.Finally,tiese are fed into a recurrent neural network for te sake of capturing dynamic tone changes in a temporal dimensionality.As a contrast,a two-dimensional(2D)convolutional neural network and a similar RNN are used to capture dynamic facial appearance changes of temporal sequences.The method was used in te Chinese Natral Audio-'Visual Emotion Database in te Chinese Conference on Pattern Recognition(CCPR)in2016.Experimental results demonstrate that te classification average precision of the proposed metiod is41.15%,which is increased by16.62%compaed with te baseline algorithm offered by the CCPR in2016.It is proved ta t te proposed method has higher accuracy in te identification of emotional information.展开更多
This study combined fault identification with a deep learning algorithm and applied a convolutional neural network(CNN)design based on an improved balanced crossentropy(BCE)loss function to address the low accuracy in...This study combined fault identification with a deep learning algorithm and applied a convolutional neural network(CNN)design based on an improved balanced crossentropy(BCE)loss function to address the low accuracy in the intelligent identification of seismic faults and the slow training speed of convolutional neural networks caused by unbalanced training sample sets.The network structure and optimal hyperparameters were determined by extracting feature maps layer by layer and by analyzing the results of seismic feature extraction.The BCE loss function was used to add the parameter which is the ratio of nonfaults to the total sample sets,thereby changing the loss function to find the reference of the minimum weight parameter and adjusting the ratio of fault to nonfault data.The method overcame the unbalanced number of sample sets and improved the iteration speed.After a brief training,the accuracy could reach more than 95%,and gradient descent was evident.The proposed method was applied to fault identification in an oilfield area.The trained model can predict faults clearly,and the prediction results are basically consistent with an actual case,verifying the effectiveness and adaptability of the method.展开更多
Though traditional methods could recognize some facies, e.g. lagoon facies, backshoal facies and foreshoal facies, they couldn't recognize reef facies and shoal facies well. To solve this problem, back propagation...Though traditional methods could recognize some facies, e.g. lagoon facies, backshoal facies and foreshoal facies, they couldn't recognize reef facies and shoal facies well. To solve this problem, back propagation neural network(BP-ANN) and an improved BP-ANN with better stability and suitability, optimized by a particle swarm optimizer(PSO) algorithm(PSO-BP-ANN) were proposed to solve the microfacies' auto discrimination of M formation from the R oil field in Iraq. Fourteen wells with complete core, borehole and log data were chosen as the standard wells and 120 microfacies samples were inferred from these 14 wells. Besides, the average value of gamma, neutron and density logs as well as the sum of squares of deviations of gamma were extracted as key parameters to build log facies(facies from log measurements)-microfacies transforming model. The total 120 log facies samples were divided into 12 kinds of log facies and 6 kinds of microfacies, e.g. lagoon bioclasts micrite limestone microfacies, shoal bioclasts grainstone microfacies, backshoal bioclasts packstone microfacies, foreshoal bioclasts micrite limestone microfacies, shallow continental micrite limestone microfacies and reef limestone microfacies. Furthermore, 68 samples of these 120 log facies samples were chosen as training samples and another 52 samples were gotten as testing samples to test the predicting ability of the discrimination template. Compared with conventional methods, like Bayes stepwise discrimination, both the BP-ANN and PSO-BP-ANN can integrate more log details with a correct rate higher than 85%. Furthermore, PSO-BP-ANN has more simple structure, smaller amount of weight and threshold and less iteration time.展开更多
基金The National Natural Science Foundation of China(No.52075095).
文摘Surface small defects are often missed and incorrectly detected due to their small quantity and unapparent visual features.A method named CSYOLOv3,which is based on CutMix and YOLOv3,is proposed to solve such a problem.First,a four-image CutMix method is used to increase the small-defect quantity,and the process is dynamically adjusted based on the beta distribution.Then,the classic YOLOv3 is improved to detect small defects accurately.The shallow and large feature maps are split,and several of them are merged with the feature maps of the predicted branch to preserve the shallow features.The loss function of YOLOv3 is optimized and weighted to improve the attention to small defects.Finally,this method is used to detect 512×512 pixel images under RTX 2060Ti GPU,which can reach the speed of 14.09 frame/s,and the mAP is 71.80%,which is 5%-10%higher than that of other methods.For small defects below 64×64 pixels,the mAP of the method reaches 64.15%,which is 14%higher than that of YOLOv3-GIoU.The surface defects of the workpiece can be effectively detected by the proposed method,and the performance in detecting small defects is significantly improved.
文摘In order to increase the accuracy rate of emotion recognition in voiceand video,the mixed convolutional neural network(CNN)and recurrent neural network(RNN)ae used to encode and integrate the two information sources.For the audio signals,several frequency bands as well as some energy functions are extacted as low-level features by using a sophisticated audio technique,and then they are encoded w it a one-dimensional(I D)convolutional neural network to abstact high-level features.Finally,tiese are fed into a recurrent neural network for te sake of capturing dynamic tone changes in a temporal dimensionality.As a contrast,a two-dimensional(2D)convolutional neural network and a similar RNN are used to capture dynamic facial appearance changes of temporal sequences.The method was used in te Chinese Natral Audio-'Visual Emotion Database in te Chinese Conference on Pattern Recognition(CCPR)in2016.Experimental results demonstrate that te classification average precision of the proposed metiod is41.15%,which is increased by16.62%compaed with te baseline algorithm offered by the CCPR in2016.It is proved ta t te proposed method has higher accuracy in te identification of emotional information.
基金supported by the Natural Science Foundation of Shandong Province(ZR202103050722).
文摘This study combined fault identification with a deep learning algorithm and applied a convolutional neural network(CNN)design based on an improved balanced crossentropy(BCE)loss function to address the low accuracy in the intelligent identification of seismic faults and the slow training speed of convolutional neural networks caused by unbalanced training sample sets.The network structure and optimal hyperparameters were determined by extracting feature maps layer by layer and by analyzing the results of seismic feature extraction.The BCE loss function was used to add the parameter which is the ratio of nonfaults to the total sample sets,thereby changing the loss function to find the reference of the minimum weight parameter and adjusting the ratio of fault to nonfault data.The method overcame the unbalanced number of sample sets and improved the iteration speed.After a brief training,the accuracy could reach more than 95%,and gradient descent was evident.The proposed method was applied to fault identification in an oilfield area.The trained model can predict faults clearly,and the prediction results are basically consistent with an actual case,verifying the effectiveness and adaptability of the method.
基金Project(41272137) supported by the National Natural Science Foundation of China
文摘Though traditional methods could recognize some facies, e.g. lagoon facies, backshoal facies and foreshoal facies, they couldn't recognize reef facies and shoal facies well. To solve this problem, back propagation neural network(BP-ANN) and an improved BP-ANN with better stability and suitability, optimized by a particle swarm optimizer(PSO) algorithm(PSO-BP-ANN) were proposed to solve the microfacies' auto discrimination of M formation from the R oil field in Iraq. Fourteen wells with complete core, borehole and log data were chosen as the standard wells and 120 microfacies samples were inferred from these 14 wells. Besides, the average value of gamma, neutron and density logs as well as the sum of squares of deviations of gamma were extracted as key parameters to build log facies(facies from log measurements)-microfacies transforming model. The total 120 log facies samples were divided into 12 kinds of log facies and 6 kinds of microfacies, e.g. lagoon bioclasts micrite limestone microfacies, shoal bioclasts grainstone microfacies, backshoal bioclasts packstone microfacies, foreshoal bioclasts micrite limestone microfacies, shallow continental micrite limestone microfacies and reef limestone microfacies. Furthermore, 68 samples of these 120 log facies samples were chosen as training samples and another 52 samples were gotten as testing samples to test the predicting ability of the discrimination template. Compared with conventional methods, like Bayes stepwise discrimination, both the BP-ANN and PSO-BP-ANN can integrate more log details with a correct rate higher than 85%. Furthermore, PSO-BP-ANN has more simple structure, smaller amount of weight and threshold and less iteration time.