An otoscope is traditionally used to examine the eardrum and ear canal.A diagnosis of otitis media(OM)relies on the experience of clinicians.If an examiner lacks experience,the examination may be difficult and time-co...An otoscope is traditionally used to examine the eardrum and ear canal.A diagnosis of otitis media(OM)relies on the experience of clinicians.If an examiner lacks experience,the examination may be difficult and time-consuming.This paper presents an ear disease classification method using middle ear images based on a convolutional neural network(CNN).Especially the segmentation and classification networks are used to classify an otoscopic image into six classes:normal,acute otitis media(AOM),otitis media with effusion(OME),chronic otitis media(COM),congenital cholesteatoma(CC)and traumatic perforations(TMPs).The Mask R-CNN is utilized for the segmentation network to extract the region of interest(ROI)from otoscopic images.The extracted ROIs are used as guiding features for the classification.The classification is based on transfer learning with an ensemble of two CNN classifiers:EfficientNetB0 and Inception-V3.The proposed model was trained with a 5-fold cross-validation technique.The proposed method was evaluated and achieved a classification accuracy of 97.29%.展开更多
This paper presents a deep neural network(DNN)-based speech enhancement algorithm based on the soft audible noise masking for the single-channel wind noise reduction. To reduce the low-frequency residual noise, the ps...This paper presents a deep neural network(DNN)-based speech enhancement algorithm based on the soft audible noise masking for the single-channel wind noise reduction. To reduce the low-frequency residual noise, the psychoacoustic model is adopted to calculate the masking threshold from the estimated clean speech spectrum. The gain for noise suppression is obtained based on soft audible noise masking by comparing the estimated wind noise spectrum with the masking threshold. To deal with the abruptly time-varying noisy signals, two separate DNN models are utilized to estimate the spectra of clean speech and wind noise components. Experimental results on the subjective and objective quality tests show that the proposed algorithm achieves the better performance compared with the conventional DNN-based wind noise reduction method.展开更多
To pursue the ideal of a safe high-tech society in a time when traffic accidents are frequent,the traffic signs detection system has become one of the necessary topics in recent years and in the future.The ultimate go...To pursue the ideal of a safe high-tech society in a time when traffic accidents are frequent,the traffic signs detection system has become one of the necessary topics in recent years and in the future.The ultimate goal of this research is to identify and classify the types of traffic signs in a panoramic image.To accomplish this goal,the paper proposes a new model for traffic sign detection based on the Convolutional Neural Network for com-prehensive traffic sign classification and Mask Region-based Convolutional Neural Networks(R-CNN)implementation for identifying and extracting signs in panoramic images.Data augmentation and normalization of the images are also applied to assist in classifying better even if old traffic signs are degraded,and considerably minimize the rates of discovering the extra boxes.The proposed model is tested on both the testing dataset and the actual images and gets 94.5%of the correct signs recognition rate,the classification rate of those signs discovered was 99.41%and the rate of false signs was only around 0.11.展开更多
针对关系抽取(RE)任务中实体关系语义挖掘困难和预测关系有偏差等问题,提出一种基于掩码提示与门控记忆网络校准(MGMNC)的RE方法。首先,利用提示中的掩码学习实体之间在预训练语言模型(PLM)语义空间中的潜在语义,通过构造掩码注意力权...针对关系抽取(RE)任务中实体关系语义挖掘困难和预测关系有偏差等问题,提出一种基于掩码提示与门控记忆网络校准(MGMNC)的RE方法。首先,利用提示中的掩码学习实体之间在预训练语言模型(PLM)语义空间中的潜在语义,通过构造掩码注意力权重矩阵,将离散的掩码语义空间相互关联;其次,采用门控校准网络将含有实体和关系语义的掩码表示融入句子的全局语义;再次,将它们作为关系提示校准关系信息,随后将句子表示的最终表示映射至相应的关系类别;最后,通过更好地利用提示中掩码,并结合传统微调方法的学习句子全局语义的优势,充分激发PLM的潜力。实验结果表明,所提方法在SemEval(SemEval-2010 Task 8)数据集的F1值达到91.4%,相较于RELA(Relation Extraction with Label Augmentation)生成式方法提高了1.0个百分点;在SciERC(Entities, Relations, and Coreference for Scientific knowledge graph construction)和CLTC(Chinese Literature Text Corpus)数据集上的F1值分别达到91.0%和82.8%。所提方法在上述3个数据集上均明显优于对比方法,验证了所提方法的有效性。相较于基于生成式的方法,所提方法实现了更优的抽取性能。展开更多
Rod insulators are vital parts of the catenary of high speed railways(HSRs).There are many different catenary insulators,and the background of the insulator image is complicated.It is difficult to recognise insulators...Rod insulators are vital parts of the catenary of high speed railways(HSRs).There are many different catenary insulators,and the background of the insulator image is complicated.It is difficult to recognise insulators and detect defects automatically.In this paper,we propose a catenary intelligent defect detection algorithm based on Mask region-convolutional neural network(R-CNN)and an image processing model.Vertical projection technology is used to achieve single shed positioning and precise cutting of the insulator.Gradient,texture,and gray feature fusion(GTGFF)and a K-means clustering analysis model(KCAM)are proposed to detect broken insulators,dirt,foreign bodies,and flashover.Using this model,insulator recognition and defect detection can achieve a high recall rate and accuracy,and generalized defect detection.The algorithm is tested and verified on a dataset of realistic insulator images,and the accuracy and reliability of the algorithm satisfy current requirements for HSR catenary automatic inspection and intelligent maintenance.展开更多
This paper proposes a solution to localization and classification of rice grains in an image.All existing related works rely on conventional based machine learning approaches.However,those techniques do not do well fo...This paper proposes a solution to localization and classification of rice grains in an image.All existing related works rely on conventional based machine learning approaches.However,those techniques do not do well for the problem designed in this paper,due to the high similarities between different types of rice grains.The deep learning based solution is developed in the proposed solution.It contains pre-processing steps of data annotation using the watershed algorithm,auto-alignment using the major axis orientation,and image enhancement using the contrast-limited adaptive histogram equalization(CLAHE)technique.Then,the mask region-based convolutional neural networks(R-CNN)is trained to localize and classify rice grains in an input image.The performance is enhanced by using the transfer learning and the dropout regularization for overfitting prevention.The proposed method is validated using many scenarios of experiments,reported in the forms of mean average precision(mAP)and a confusion matrix.It achieves above 80%mAP for main scenarios in the experiments.It is also shown to perform outstanding,when compared to human experts.展开更多
小麦麦穗智能识别在产量预估、种子筛选、农田密度估计等方面具有一定的科研价值和应用价值。传统的麦穗识别方法效率低且浪费人力,因此提出以无人机高清图像为数据源,基于卷积神经网络建立Mask-RCNN深度神经网络的小麦麦穗自动识别模型...小麦麦穗智能识别在产量预估、种子筛选、农田密度估计等方面具有一定的科研价值和应用价值。传统的麦穗识别方法效率低且浪费人力,因此提出以无人机高清图像为数据源,基于卷积神经网络建立Mask-RCNN深度神经网络的小麦麦穗自动识别模型,以平均精准均值(Mean Average Precision,MAP)和预测准确率(Precision)作为模型的评价指标。实验结果表明,Mask-RCNN麦穗识别模型的MAP达到72.32%,准确率达到95.2%,说明基于Mask-RCNN的麦穗识别模型是一种高效准确的识别方法,具有一定的实用价值。展开更多
基金This study was supported by a Basic Science Research Program through the National Research Foundation of Korea(NRF)funded by the Ministry of Science,ICT&Future Planning NRF-2020R1A2C1014829the Soonchunhyang University Research Fund.
文摘An otoscope is traditionally used to examine the eardrum and ear canal.A diagnosis of otitis media(OM)relies on the experience of clinicians.If an examiner lacks experience,the examination may be difficult and time-consuming.This paper presents an ear disease classification method using middle ear images based on a convolutional neural network(CNN).Especially the segmentation and classification networks are used to classify an otoscopic image into six classes:normal,acute otitis media(AOM),otitis media with effusion(OME),chronic otitis media(COM),congenital cholesteatoma(CC)and traumatic perforations(TMPs).The Mask R-CNN is utilized for the segmentation network to extract the region of interest(ROI)from otoscopic images.The extracted ROIs are used as guiding features for the classification.The classification is based on transfer learning with an ensemble of two CNN classifiers:EfficientNetB0 and Inception-V3.The proposed model was trained with a 5-fold cross-validation technique.The proposed method was evaluated and achieved a classification accuracy of 97.29%.
基金partially supported by the National Natural Science Foundation of China (Nos.11590772, 11590770)the Pre-research Project for Equipment of General Information System (No.JZX2017-0994/Y306)
文摘This paper presents a deep neural network(DNN)-based speech enhancement algorithm based on the soft audible noise masking for the single-channel wind noise reduction. To reduce the low-frequency residual noise, the psychoacoustic model is adopted to calculate the masking threshold from the estimated clean speech spectrum. The gain for noise suppression is obtained based on soft audible noise masking by comparing the estimated wind noise spectrum with the masking threshold. To deal with the abruptly time-varying noisy signals, two separate DNN models are utilized to estimate the spectra of clean speech and wind noise components. Experimental results on the subjective and objective quality tests show that the proposed algorithm achieves the better performance compared with the conventional DNN-based wind noise reduction method.
文摘To pursue the ideal of a safe high-tech society in a time when traffic accidents are frequent,the traffic signs detection system has become one of the necessary topics in recent years and in the future.The ultimate goal of this research is to identify and classify the types of traffic signs in a panoramic image.To accomplish this goal,the paper proposes a new model for traffic sign detection based on the Convolutional Neural Network for com-prehensive traffic sign classification and Mask Region-based Convolutional Neural Networks(R-CNN)implementation for identifying and extracting signs in panoramic images.Data augmentation and normalization of the images are also applied to assist in classifying better even if old traffic signs are degraded,and considerably minimize the rates of discovering the extra boxes.The proposed model is tested on both the testing dataset and the actual images and gets 94.5%of the correct signs recognition rate,the classification rate of those signs discovered was 99.41%and the rate of false signs was only around 0.11.
文摘针对关系抽取(RE)任务中实体关系语义挖掘困难和预测关系有偏差等问题,提出一种基于掩码提示与门控记忆网络校准(MGMNC)的RE方法。首先,利用提示中的掩码学习实体之间在预训练语言模型(PLM)语义空间中的潜在语义,通过构造掩码注意力权重矩阵,将离散的掩码语义空间相互关联;其次,采用门控校准网络将含有实体和关系语义的掩码表示融入句子的全局语义;再次,将它们作为关系提示校准关系信息,随后将句子表示的最终表示映射至相应的关系类别;最后,通过更好地利用提示中掩码,并结合传统微调方法的学习句子全局语义的优势,充分激发PLM的潜力。实验结果表明,所提方法在SemEval(SemEval-2010 Task 8)数据集的F1值达到91.4%,相较于RELA(Relation Extraction with Label Augmentation)生成式方法提高了1.0个百分点;在SciERC(Entities, Relations, and Coreference for Scientific knowledge graph construction)和CLTC(Chinese Literature Text Corpus)数据集上的F1值分别达到91.0%和82.8%。所提方法在上述3个数据集上均明显优于对比方法,验证了所提方法的有效性。相较于基于生成式的方法,所提方法实现了更优的抽取性能。
基金supported by the National Natural Science Foundation of China(Nos.51677171,51637009,51577166 and 51827810)the National Key R&D Program of China(No.2018YFB0606000)+2 种基金the China Scholarship Council(No.201708330502)the Fund of Shuohuang Railway Development Limited Liability Company(No.SHTL-2020-13)the Fund of State Key Laboratory of Industrial Control Technology(No.ICT2022B29),China。
文摘Rod insulators are vital parts of the catenary of high speed railways(HSRs).There are many different catenary insulators,and the background of the insulator image is complicated.It is difficult to recognise insulators and detect defects automatically.In this paper,we propose a catenary intelligent defect detection algorithm based on Mask region-convolutional neural network(R-CNN)and an image processing model.Vertical projection technology is used to achieve single shed positioning and precise cutting of the insulator.Gradient,texture,and gray feature fusion(GTGFF)and a K-means clustering analysis model(KCAM)are proposed to detect broken insulators,dirt,foreign bodies,and flashover.Using this model,insulator recognition and defect detection can achieve a high recall rate and accuracy,and generalized defect detection.The algorithm is tested and verified on a dataset of realistic insulator images,and the accuracy and reliability of the algorithm satisfy current requirements for HSR catenary automatic inspection and intelligent maintenance.
文摘随着智能工程机械的不断发展,结构复杂、功能单一的传统液压破碎锤逐渐被日益成熟的智能破碎锤取代,对多目标石块的识别与定位是智能破碎提高动作输出精度、完成破碎任务的重要保证.提出基于分割掩码卷积神经网络(Mask R-CNN)实例分割和激光雷达信息融合的目标石块识别定位方法,通过Mask R-CNN实例分割算法快速识别复杂作业场景下目标石块的感兴趣区(Region of Interest,RoI);在保证石块检测精确率的前提下,融合激光雷达通过卡尔曼滤波算法得到破碎点位置信息,引导破碎锤实现定位作业.现场试验结果表明,目标石块检测模型对石块的平均识别精确率为95.35%,召回率为95.06%,石块破碎点识别精确率为94.20%.在复杂作业背景下,该方法可实现多目标石块识别和破碎点定位,满足自动破碎实时性要求.
文摘This paper proposes a solution to localization and classification of rice grains in an image.All existing related works rely on conventional based machine learning approaches.However,those techniques do not do well for the problem designed in this paper,due to the high similarities between different types of rice grains.The deep learning based solution is developed in the proposed solution.It contains pre-processing steps of data annotation using the watershed algorithm,auto-alignment using the major axis orientation,and image enhancement using the contrast-limited adaptive histogram equalization(CLAHE)technique.Then,the mask region-based convolutional neural networks(R-CNN)is trained to localize and classify rice grains in an input image.The performance is enhanced by using the transfer learning and the dropout regularization for overfitting prevention.The proposed method is validated using many scenarios of experiments,reported in the forms of mean average precision(mAP)and a confusion matrix.It achieves above 80%mAP for main scenarios in the experiments.It is also shown to perform outstanding,when compared to human experts.
文摘小麦麦穗智能识别在产量预估、种子筛选、农田密度估计等方面具有一定的科研价值和应用价值。传统的麦穗识别方法效率低且浪费人力,因此提出以无人机高清图像为数据源,基于卷积神经网络建立Mask-RCNN深度神经网络的小麦麦穗自动识别模型,以平均精准均值(Mean Average Precision,MAP)和预测准确率(Precision)作为模型的评价指标。实验结果表明,Mask-RCNN麦穗识别模型的MAP达到72.32%,准确率达到95.2%,说明基于Mask-RCNN的麦穗识别模型是一种高效准确的识别方法,具有一定的实用价值。