噪声环境下语音检测准确率偏低是短波通话面临的公开挑战。当前已有方法应用有限,其根源在于难以可靠地在噪音环境下提取准确且高效的语音特征。针对上述问题,提出了一个面向短波通信的低秩方向梯度直方图(Low-rank Histogram of Orient...噪声环境下语音检测准确率偏低是短波通话面临的公开挑战。当前已有方法应用有限,其根源在于难以可靠地在噪音环境下提取准确且高效的语音特征。针对上述问题,提出了一个面向短波通信的低秩方向梯度直方图(Low-rank Histogram of Oriented Gradient,LHOG)话音检测方法。首先,对目标音频源数据进行预处理,实现噪声环境下语音信息的可视化表征;然后,在HOG特征提取器中嵌入低秩化结构,缓解特征中的冗余信息,并降低噪声干扰,从而获得准确且高效的特征;最后,通过常用的SVM分类模型便可在噪声环境中准确快速地区分话音和噪声。测试结果表明,该方法的准确率达到了95.12%,误报率仅为0.96%,漏报率为13.14%。与现有主流方法的对比实验证明,该方法话音检测准确率高,资源占用少,能够有效提高短波通信侦控效率。展开更多
This article proposes a VGG network with histogram of oriented gradient(HOG) feature fusion(HOG-VGG) for polarization synthetic aperture radar(PolSAR) image terrain classification.VGG-Net has a strong ability of deep ...This article proposes a VGG network with histogram of oriented gradient(HOG) feature fusion(HOG-VGG) for polarization synthetic aperture radar(PolSAR) image terrain classification.VGG-Net has a strong ability of deep feature extraction,which can fully extract the global deep features of different terrains in PolSAR images,so it is widely used in PolSAR terrain classification.However,VGG-Net ignores the local edge & shape features,resulting in incomplete feature representation of the PolSAR terrains,as a consequence,the terrain classification accuracy is not promising.In fact,edge and shape features play an important role in PolSAR terrain classification.To solve this problem,a new VGG network with HOG feature fusion was specifically proposed for high-precision PolSAR terrain classification.HOG-VGG extracts both the global deep semantic features and the local edge & shape features of the PolSAR terrains,so the terrain feature representation completeness is greatly elevated.Moreover,HOG-VGG optimally fuses the global deep features and the local edge & shape features to achieve the best classification results.The superiority of HOG-VGG is verified on the Flevoland,San Francisco and Oberpfaffenhofen datasets.Experiments show that the proposed HOG-VGG achieves much better PolSAR terrain classification performance,with overall accuracies of 97.54%,94.63%,and 96.07%,respectively.展开更多
A solution is proposed for the real-time vehicle verification which is an important problem for numerous on- road vehicle applications. First, based on the vertical symmetry characteristics of vehicle images, a vertic...A solution is proposed for the real-time vehicle verification which is an important problem for numerous on- road vehicle applications. First, based on the vertical symmetry characteristics of vehicle images, a vertical symmetrical histograms of oriented gradients (VS-HOG) descriptor is proposed for extracting the image features. In the classification stage, an extreme learning machine (ELM) is used to improve the real-time performance. Experimental data demonstrate that, compared with other classical methods, the vehicle verification algorithm based on VS-HOG and ELM achieves a better trade-off between cost and performance. The computational cost is reduced by using the algorithm, while keeping the performance loss as low as possible. Furthermore, experimental results further show that the proposed vehicle verification method is suitable for on-road vehicle applications due to its better performance both in efficiency and accuracy.展开更多
行人检测在机器人、驾驶辅助系统和视频监控等领域有广泛的应用,该文提出一种基于显著性检测与方向梯度直方图-非负矩阵分解(Histogram of Oriented Gradient-Non-negative Matrix Factorization,HOG-NMF)特征的快速行人检测方法。采用...行人检测在机器人、驾驶辅助系统和视频监控等领域有广泛的应用,该文提出一种基于显著性检测与方向梯度直方图-非负矩阵分解(Histogram of Oriented Gradient-Non-negative Matrix Factorization,HOG-NMF)特征的快速行人检测方法。采用频谱调谐显著性检测提取显著图,并基于熵值门限进行感兴趣区域的提取;组合非负矩阵分解和方向梯度直方图生成HOG-NMF特征;采用加性交叉核支持向量机方法(Intersection Kernel Support Vector Machine,IKSVM)。该算法显著降低了特征维数,在相同的计算复杂度下明显改善了线性支持向量机的检测率。在INRIA数据库的实验结果表明,该方法对比HOG/线性SVM和HOG/RBF-SVM显著减少了检测时间,并达到了满意的检测率。展开更多
针对高速铁路接触网支撑结构中旋转双耳耳片断裂故障难以检测的问题,提出一种HOG(histogram of oriented gradients,梯度方向直方图)特征与二维Gabor小波相结合的图像检测方法。为实现旋转双耳在待检测图像中的定位,利用其正负样本的HO...针对高速铁路接触网支撑结构中旋转双耳耳片断裂故障难以检测的问题,提出一种HOG(histogram of oriented gradients,梯度方向直方图)特征与二维Gabor小波相结合的图像检测方法。为实现旋转双耳在待检测图像中的定位,利用其正负样本的HOG特征对线性SVM分类器进行训练,对检测窗口内是否包含旋转双耳进行判别。为实现耳片断裂故障的可靠诊断,利用二维Gabor小波变换能量值对图像中的边缘信息进行筛选,进而对耳片断裂故障引起的故障裂痕进行识别。实验结果表明,本文提出的方法能在复杂的接触网支撑与悬挂装置图像中准确识别发生耳片断裂故障的旋转双耳部件,检测结果不受拍摄距离、拍摄角度以及曝光度等因素的影响,具有较高的使用价值。展开更多
文摘噪声环境下语音检测准确率偏低是短波通话面临的公开挑战。当前已有方法应用有限,其根源在于难以可靠地在噪音环境下提取准确且高效的语音特征。针对上述问题,提出了一个面向短波通信的低秩方向梯度直方图(Low-rank Histogram of Oriented Gradient,LHOG)话音检测方法。首先,对目标音频源数据进行预处理,实现噪声环境下语音信息的可视化表征;然后,在HOG特征提取器中嵌入低秩化结构,缓解特征中的冗余信息,并降低噪声干扰,从而获得准确且高效的特征;最后,通过常用的SVM分类模型便可在噪声环境中准确快速地区分话音和噪声。测试结果表明,该方法的准确率达到了95.12%,误报率仅为0.96%,漏报率为13.14%。与现有主流方法的对比实验证明,该方法话音检测准确率高,资源占用少,能够有效提高短波通信侦控效率。
基金Sponsored by the Fundamental Research Funds for the Central Universities of China(Grant No.PA2023IISL0098)the Hefei Municipal Natural Science Foundation(Grant No.202201)+1 种基金the National Natural Science Foundation of China(Grant No.62071164)the Open Fund of Information Materials and Intelligent Sensing Laboratory of Anhui Province(Anhui University)(Grant No.IMIS202214 and IMIS202102)。
文摘This article proposes a VGG network with histogram of oriented gradient(HOG) feature fusion(HOG-VGG) for polarization synthetic aperture radar(PolSAR) image terrain classification.VGG-Net has a strong ability of deep feature extraction,which can fully extract the global deep features of different terrains in PolSAR images,so it is widely used in PolSAR terrain classification.However,VGG-Net ignores the local edge & shape features,resulting in incomplete feature representation of the PolSAR terrains,as a consequence,the terrain classification accuracy is not promising.In fact,edge and shape features play an important role in PolSAR terrain classification.To solve this problem,a new VGG network with HOG feature fusion was specifically proposed for high-precision PolSAR terrain classification.HOG-VGG extracts both the global deep semantic features and the local edge & shape features of the PolSAR terrains,so the terrain feature representation completeness is greatly elevated.Moreover,HOG-VGG optimally fuses the global deep features and the local edge & shape features to achieve the best classification results.The superiority of HOG-VGG is verified on the Flevoland,San Francisco and Oberpfaffenhofen datasets.Experiments show that the proposed HOG-VGG achieves much better PolSAR terrain classification performance,with overall accuracies of 97.54%,94.63%,and 96.07%,respectively.
基金The National Natural Science Foundation of China(No.61203237)the Natural Science Foundation of Zhejiang Province(No.LQ12F03016)the China Postdoctoral Science Foundation(No.2011M500836)
文摘A solution is proposed for the real-time vehicle verification which is an important problem for numerous on- road vehicle applications. First, based on the vertical symmetry characteristics of vehicle images, a vertical symmetrical histograms of oriented gradients (VS-HOG) descriptor is proposed for extracting the image features. In the classification stage, an extreme learning machine (ELM) is used to improve the real-time performance. Experimental data demonstrate that, compared with other classical methods, the vehicle verification algorithm based on VS-HOG and ELM achieves a better trade-off between cost and performance. The computational cost is reduced by using the algorithm, while keeping the performance loss as low as possible. Furthermore, experimental results further show that the proposed vehicle verification method is suitable for on-road vehicle applications due to its better performance both in efficiency and accuracy.
文摘行人检测在机器人、驾驶辅助系统和视频监控等领域有广泛的应用,该文提出一种基于显著性检测与方向梯度直方图-非负矩阵分解(Histogram of Oriented Gradient-Non-negative Matrix Factorization,HOG-NMF)特征的快速行人检测方法。采用频谱调谐显著性检测提取显著图,并基于熵值门限进行感兴趣区域的提取;组合非负矩阵分解和方向梯度直方图生成HOG-NMF特征;采用加性交叉核支持向量机方法(Intersection Kernel Support Vector Machine,IKSVM)。该算法显著降低了特征维数,在相同的计算复杂度下明显改善了线性支持向量机的检测率。在INRIA数据库的实验结果表明,该方法对比HOG/线性SVM和HOG/RBF-SVM显著减少了检测时间,并达到了满意的检测率。
文摘针对高速铁路接触网支撑结构中旋转双耳耳片断裂故障难以检测的问题,提出一种HOG(histogram of oriented gradients,梯度方向直方图)特征与二维Gabor小波相结合的图像检测方法。为实现旋转双耳在待检测图像中的定位,利用其正负样本的HOG特征对线性SVM分类器进行训练,对检测窗口内是否包含旋转双耳进行判别。为实现耳片断裂故障的可靠诊断,利用二维Gabor小波变换能量值对图像中的边缘信息进行筛选,进而对耳片断裂故障引起的故障裂痕进行识别。实验结果表明,本文提出的方法能在复杂的接触网支撑与悬挂装置图像中准确识别发生耳片断裂故障的旋转双耳部件,检测结果不受拍摄距离、拍摄角度以及曝光度等因素的影响,具有较高的使用价值。