Inspired by the coarse-to-fine visual perception process of human vision system,a new approach based on Gaussian multi-scale space for defect detection of industrial products was proposed.By selecting different scale ...Inspired by the coarse-to-fine visual perception process of human vision system,a new approach based on Gaussian multi-scale space for defect detection of industrial products was proposed.By selecting different scale parameters of the Gaussian kernel,the multi-scale representation of the original image data could be obtained and used to constitute the multi- variate image,in which each channel could represent a perceptual observation of the original image from different scales.The Multivariate Image Analysis (MIA) techniques were used to extract defect features information.The MIA combined Principal Component Analysis (PCA) to obtain the principal component scores of the multivariate test image.The Q-statistic image, derived from the residuals after the extraction of the first principal component score and noise,could be used to efficiently reveal the surface defects with an appropriate threshold value decided by training images.Experimental results show that the proposed method performs better than the gray histogram-based method.It has less sensitivity to the inhomogeneous of illumination,and has more robustness and reliability of defect detection with lower pseudo reject rate.展开更多
A technique for estimating tropical cyclone(TC) intensity over the Western North Pacific utilizing FY-3Microwave Imager(MWRI) data is developed. As a first step, we investigated the relationship between the FY-3 MWRI ...A technique for estimating tropical cyclone(TC) intensity over the Western North Pacific utilizing FY-3Microwave Imager(MWRI) data is developed. As a first step, we investigated the relationship between the FY-3 MWRI brightness temperature(TB) parameters, which are computed in concentric circles or annuli of different radius in different MWRI frequencies, and the TC maximum wind speed(Vmax) from the TC best track data. We found that the parameters of lower frequency channels' minimum TB, mean TB and ratio of pixels over the threshold TB with a radius of 1.0 or 1.5 degrees from the center give higher correlation. Then by applying principal components analysis(PCA)and multiple regression method, we established an estimation model and evaluated it using independent verification data, with the RMSE being 13 kt. The estimated Vmax is always stronger in the early stages of development, but slightly weaker toward the mature stage, and a reversal of positive and negative bias takes place with a boundary of around 70 kt. For the TC that has a larger error, we found that they are often with less organized and asymmetric cloud pattern, so the classification of TC cloud pattern will help improve the acuracy of the estimated TC intensity, and with the increase of statistical samples the accuracy of the estimated TC intensity will also be improved.展开更多
A comprehensive understanding of spatial distribution and clustering patterns of gravels is of great significance for ecological restoration and monitoring.However,traditional methods for studying gravels are low-effi...A comprehensive understanding of spatial distribution and clustering patterns of gravels is of great significance for ecological restoration and monitoring.However,traditional methods for studying gravels are low-efficiency and have many errors.This study researched the spatial distribution and cluster characteristics of gravels based on digital image processing technology combined with a self-organizing map(SOM)and multivariate statistical methods in the grassland of northern Tibetan Plateau.Moreover,the correlation of morphological parameters of gravels between different cluster groups and the environmental factors affecting gravel distribution were analyzed.The results showed that the morphological characteristics of gravels in northern region(cluster C)and southern region(cluster B)of the Tibetan Plateau were similar,with a low gravel coverage,small gravel diameter,and elongated shape.These regions were mainly distributed in high mountainous areas with large topographic relief.The central region(cluster A)has high coverage of gravels with a larger diameter,mainly distributed in high-altitude plains with smaller undulation.Principal component analysis(PCA)results showed that the gravel distribution of cluster A may be mainly affected by vegetation,while those in clusters B and C could be mainly affected by topography,climate,and soil.The study confirmed that the combination of digital image processing technology and SOM could effectively analyzed the spatial distribution characteristics of gravels,providing a new mode for gravel research.展开更多
Internet of Things systems generate a large amount of sensor data that needs to be analyzed for extracting useful insights on the health status of the machine under consideration.Sensor data of all possible states of ...Internet of Things systems generate a large amount of sensor data that needs to be analyzed for extracting useful insights on the health status of the machine under consideration.Sensor data of all possible states of a system are used for building machine learning models.These models are further used to predict the possible downtime for proactive action on the system condition.Aircraft engine data from run to failure is used in the current study.The run to failure data includes states like new installation,stable operation,first reported issue,erroneous operation,and final failure.In the present work,the non-linear multivariate sensor data is used to understand the health status and anomalous behavior.The methodology is based on different sampling sizes to obtain optimum results with great accuracy.The time series of each sensor is converted to a 2D image with a specific time window.Converted Images would represent the health of a system in higher-dimensional space.The created images were fed to Convolutional Neural Network,which includes both time variation and space variation of each sensed parameter.Using these created images,a model for estimating the remaining life of the aircraft is developed.Further,the proposed net is also used for predicting the number of engines that would fail in the given time window.The current methodology is useful in avoiding the health index generation for predicting the remaining useful life of the industrial components.Better accuracy in the classification of components is achieved using the TimeImagenet-based approach.展开更多
Chemical imaging(CI)possesses a strong ability of pharmaceutical analysis.Its great strength relies on the integration of traditional spectroscopy(one dimension)and imaging technique(two dimensions)to generate three-d...Chemical imaging(CI)possesses a strong ability of pharmaceutical analysis.Its great strength relies on the integration of traditional spectroscopy(one dimension)and imaging technique(two dimensions)to generate three-dimensional data hypercubes.Data pre-processing or processing methods are proposed to analyze vast data matrixes and thereby realizing different research objectives.In this review paper,various pharmaceutical applications of quality control over the past few years are summed up in two groups of final product test and industrial utilization.The scope of"quality control"here includes traditional analytical use,process understanding and manufactural control.Finally,two major challenges about undesirable sample geometry and lengthy acquisition time are discussed for prospective commercial or industrial application.展开更多
随着城市矿产资源循环利用技术的不断发展,废旧手机回收已成为当前研究热点。受限于计算资源和数据资源的相对缺乏,目前基于线下智能回收装备的废旧手机识别精度难以达到实际应用。针对上述问题,提出一种基于多元特征异构集成深度学习...随着城市矿产资源循环利用技术的不断发展,废旧手机回收已成为当前研究热点。受限于计算资源和数据资源的相对缺乏,目前基于线下智能回收装备的废旧手机识别精度难以达到实际应用。针对上述问题,提出一种基于多元特征异构集成深度学习的图像识别模型。首先,利用字符级文本检测算法(character region awareness for text detection,CRAFT)提取手机背部字符区域,再利用ImageNet预训练的VGG19模型作为图像特征嵌入模型,利用迁移学习理念提取待回收手机的局部字符特征和全局图像特征;然后,利用局部特征构建神经网络模式光学字符识别(optical character recognition,OCR)模型,利用全局和局部特征构建非神经网络模式深度森林分类(deep forest classification,DFC)模型;最后,将异构OCR和DFC识别模型输出的结果与向量组合后输入Softmax进行集成,基于权重向量得分最大准则获取最终识别结果。基于废旧手机回收装备的真实图像验证了所提方法的有效性。展开更多
Tip-enhanced Raman spectroscopy(TERS)is a powerful surface analysis technique that can provide subnanometer-resolved images of nanostructures with site-specific chemical fingerprints.However,due to the limitation of w...Tip-enhanced Raman spectroscopy(TERS)is a powerful surface analysis technique that can provide subnanometer-resolved images of nanostructures with site-specific chemical fingerprints.However,due to the limitation of weak Raman signals and the resultant difficulty in achieving TERS imaging with good signal-to-noise ratios(SNRs),the conventional single-peak analysis is unsuitable for distinguishing complex molecular architectures at the subnanometer scale.Here we demonstrate that the combination of subnanometer-resolved TERS imaging and advanced multivariate analysis can provide an unbiased panoramic view of the chemical identity and spatial distribution of different molecules on surfaces,yielding high-quality chemical images despite limited SNRs in individual pixel-level spectra.This methodology allows us to exploit the full power of TERS imaging and unambiguously distinguish between adjacent molecules with a resolution of~0.4 nm,as well as to resolve submolecular features and the differences in molecular adsorption configurations.Our results provide a promising methodology that promotes TERS imaging as a routine analytical technique for the analysis of complex nanostructures on surfaces.展开更多
A new multivariate statistical strategy for analyzing large datasets that are produced by imaging mass spectrometry(IMS) techniques is reported.The strategy divides the whole datacube of the sample into several subs...A new multivariate statistical strategy for analyzing large datasets that are produced by imaging mass spectrometry(IMS) techniques is reported.The strategy divides the whole datacube of the sample into several subsets and analyses them one by one to obtain the results.Instead of analyzing the whole datacube at one time,the strategy makes the analysis easier and decreases the computation time greatly.In this report,the IMS data are produced by the air flow-assisted ionization IMS(AFAI-IMS).The strategy can be used in combination with most multivariate statistical analysis methods.In this paper,the strategy was combined with the principal component analysis(PCA) and partial least square analysis(PLS).It was proven to be effective by analyzing the handwriting sample.By using the strategy,the m/z corresponding to the specific lipids in rat brain tissue were distinguished successfully.Moreover the analysis time grew linearly instead of exponentially as the size of sample increased.The strategy developed in this study has enormous potential for searching for the mjz of potential biomarkers quickly and effectively.展开更多
目的:多元线性回归模型在保持输入自变量光谱信息和空间特征的同时,通过线性变换获取自变量和因变量的光谱拟合关系,对原输入自变量的光谱信息进行优化,从而获得高空间分辨率和丰富光谱信息的重构数据。方法:利用同期获取的OLI(Operatio...目的:多元线性回归模型在保持输入自变量光谱信息和空间特征的同时,通过线性变换获取自变量和因变量的光谱拟合关系,对原输入自变量的光谱信息进行优化,从而获得高空间分辨率和丰富光谱信息的重构数据。方法:利用同期获取的OLI(Operational Land Imager)和PMS(Panchromatic and Multispectral Scanner)多光谱遥感影像,根据最小二乘法构建多元线性回归模型,重构生成具有丰富光谱特征和空间特征的遥感影像,从主客观两个方面评价重构影像的质量。结果:在目视解译(主观)方面,重构影像在一定程度上保留了原OLI影像的光谱特性,提升了原PMS影像的清晰度和分辨性;在量化角度(客观)方面,重构影像的信息量和平均梯度比原OLI对应波段影像的信息量(在部分波段上)和平均梯度要低,但比原PMS影像的信息量和平均梯度要高,可见重构影像的质量介于原PMS影像和OLI影像的质量之间。结论:以青海省门源回族自治县的耕地内不同作物为实例对象,利用最大似然法获取门源县青稞和油菜的空间分布,研究区实测数据验证表明,重构影像对耕地内部青稞与油菜的提取精度高于原PMS和OLI多光谱影像的提取精度。展开更多
基金supported in part by the Natural Science Foundation of China (NSFC) (Grant No:50875240).
文摘Inspired by the coarse-to-fine visual perception process of human vision system,a new approach based on Gaussian multi-scale space for defect detection of industrial products was proposed.By selecting different scale parameters of the Gaussian kernel,the multi-scale representation of the original image data could be obtained and used to constitute the multi- variate image,in which each channel could represent a perceptual observation of the original image from different scales.The Multivariate Image Analysis (MIA) techniques were used to extract defect features information.The MIA combined Principal Component Analysis (PCA) to obtain the principal component scores of the multivariate test image.The Q-statistic image, derived from the residuals after the extraction of the first principal component score and noise,could be used to efficiently reveal the surface defects with an appropriate threshold value decided by training images.Experimental results show that the proposed method performs better than the gray histogram-based method.It has less sensitivity to the inhomogeneous of illumination,and has more robustness and reliability of defect detection with lower pseudo reject rate.
基金National Key Research and Development Program of China(2016YFA0600101)National Basic Research Program of China(973 Program,2010CB950802)National Natural Science Fund(41605028)
文摘A technique for estimating tropical cyclone(TC) intensity over the Western North Pacific utilizing FY-3Microwave Imager(MWRI) data is developed. As a first step, we investigated the relationship between the FY-3 MWRI brightness temperature(TB) parameters, which are computed in concentric circles or annuli of different radius in different MWRI frequencies, and the TC maximum wind speed(Vmax) from the TC best track data. We found that the parameters of lower frequency channels' minimum TB, mean TB and ratio of pixels over the threshold TB with a radius of 1.0 or 1.5 degrees from the center give higher correlation. Then by applying principal components analysis(PCA)and multiple regression method, we established an estimation model and evaluated it using independent verification data, with the RMSE being 13 kt. The estimated Vmax is always stronger in the early stages of development, but slightly weaker toward the mature stage, and a reversal of positive and negative bias takes place with a boundary of around 70 kt. For the TC that has a larger error, we found that they are often with less organized and asymmetric cloud pattern, so the classification of TC cloud pattern will help improve the acuracy of the estimated TC intensity, and with the increase of statistical samples the accuracy of the estimated TC intensity will also be improved.
基金funded by the National Natural Science Foundation of China(41971226,41871357)the Major Research and Development and Achievement Transformation Projects of Qinghai,China(2022-QY-224)the Strategic Priority Research Program of the Chinese Academy of Sciences(XDA28110502,XDA19030303).
文摘A comprehensive understanding of spatial distribution and clustering patterns of gravels is of great significance for ecological restoration and monitoring.However,traditional methods for studying gravels are low-efficiency and have many errors.This study researched the spatial distribution and cluster characteristics of gravels based on digital image processing technology combined with a self-organizing map(SOM)and multivariate statistical methods in the grassland of northern Tibetan Plateau.Moreover,the correlation of morphological parameters of gravels between different cluster groups and the environmental factors affecting gravel distribution were analyzed.The results showed that the morphological characteristics of gravels in northern region(cluster C)and southern region(cluster B)of the Tibetan Plateau were similar,with a low gravel coverage,small gravel diameter,and elongated shape.These regions were mainly distributed in high mountainous areas with large topographic relief.The central region(cluster A)has high coverage of gravels with a larger diameter,mainly distributed in high-altitude plains with smaller undulation.Principal component analysis(PCA)results showed that the gravel distribution of cluster A may be mainly affected by vegetation,while those in clusters B and C could be mainly affected by topography,climate,and soil.The study confirmed that the combination of digital image processing technology and SOM could effectively analyzed the spatial distribution characteristics of gravels,providing a new mode for gravel research.
文摘Internet of Things systems generate a large amount of sensor data that needs to be analyzed for extracting useful insights on the health status of the machine under consideration.Sensor data of all possible states of a system are used for building machine learning models.These models are further used to predict the possible downtime for proactive action on the system condition.Aircraft engine data from run to failure is used in the current study.The run to failure data includes states like new installation,stable operation,first reported issue,erroneous operation,and final failure.In the present work,the non-linear multivariate sensor data is used to understand the health status and anomalous behavior.The methodology is based on different sampling sizes to obtain optimum results with great accuracy.The time series of each sensor is converted to a 2D image with a specific time window.Converted Images would represent the health of a system in higher-dimensional space.The created images were fed to Convolutional Neural Network,which includes both time variation and space variation of each sensed parameter.Using these created images,a model for estimating the remaining life of the aircraft is developed.Further,the proposed net is also used for predicting the number of engines that would fail in the given time window.The current methodology is useful in avoiding the health index generation for predicting the remaining useful life of the industrial components.Better accuracy in the classification of components is achieved using the TimeImagenet-based approach.
文摘Chemical imaging(CI)possesses a strong ability of pharmaceutical analysis.Its great strength relies on the integration of traditional spectroscopy(one dimension)and imaging technique(two dimensions)to generate three-dimensional data hypercubes.Data pre-processing or processing methods are proposed to analyze vast data matrixes and thereby realizing different research objectives.In this review paper,various pharmaceutical applications of quality control over the past few years are summed up in two groups of final product test and industrial utilization.The scope of"quality control"here includes traditional analytical use,process understanding and manufactural control.Finally,two major challenges about undesirable sample geometry and lengthy acquisition time are discussed for prospective commercial or industrial application.
基金National Natural Science Foundation of China(82104738)Postdoctoral Science Foundation of China(2023M732337)Capacity Building of Local Colleges and Universities of Shanghai Science and Technology Commission(21010504400)。
文摘随着城市矿产资源循环利用技术的不断发展,废旧手机回收已成为当前研究热点。受限于计算资源和数据资源的相对缺乏,目前基于线下智能回收装备的废旧手机识别精度难以达到实际应用。针对上述问题,提出一种基于多元特征异构集成深度学习的图像识别模型。首先,利用字符级文本检测算法(character region awareness for text detection,CRAFT)提取手机背部字符区域,再利用ImageNet预训练的VGG19模型作为图像特征嵌入模型,利用迁移学习理念提取待回收手机的局部字符特征和全局图像特征;然后,利用局部特征构建神经网络模式光学字符识别(optical character recognition,OCR)模型,利用全局和局部特征构建非神经网络模式深度森林分类(deep forest classification,DFC)模型;最后,将异构OCR和DFC识别模型输出的结果与向量组合后输入Softmax进行集成,基于权重向量得分最大准则获取最终识别结果。基于废旧手机回收装备的真实图像验证了所提方法的有效性。
基金supported by the National Natural Science Foundation of Chinathe National Basic Research Program of China+1 种基金the Strategic Priority Research Program of the Chinese Academy of Sciencessupport from the NSFC’s 1000 Young Talents Recruitment Plan for Global Experts.
文摘Tip-enhanced Raman spectroscopy(TERS)is a powerful surface analysis technique that can provide subnanometer-resolved images of nanostructures with site-specific chemical fingerprints.However,due to the limitation of weak Raman signals and the resultant difficulty in achieving TERS imaging with good signal-to-noise ratios(SNRs),the conventional single-peak analysis is unsuitable for distinguishing complex molecular architectures at the subnanometer scale.Here we demonstrate that the combination of subnanometer-resolved TERS imaging and advanced multivariate analysis can provide an unbiased panoramic view of the chemical identity and spatial distribution of different molecules on surfaces,yielding high-quality chemical images despite limited SNRs in individual pixel-level spectra.This methodology allows us to exploit the full power of TERS imaging and unambiguously distinguish between adjacent molecules with a resolution of~0.4 nm,as well as to resolve submolecular features and the differences in molecular adsorption configurations.Our results provide a promising methodology that promotes TERS imaging as a routine analytical technique for the analysis of complex nanostructures on surfaces.
基金supported by the National Instrumentation Programmme(Nos.2011YQ17006702 and 2011YQ14015010)the National Natural Science Foundation of China(Nos.81102413 and 21175121)Fundamental Research Program of Shenzhen (No.JC201005280634A).
文摘A new multivariate statistical strategy for analyzing large datasets that are produced by imaging mass spectrometry(IMS) techniques is reported.The strategy divides the whole datacube of the sample into several subsets and analyses them one by one to obtain the results.Instead of analyzing the whole datacube at one time,the strategy makes the analysis easier and decreases the computation time greatly.In this report,the IMS data are produced by the air flow-assisted ionization IMS(AFAI-IMS).The strategy can be used in combination with most multivariate statistical analysis methods.In this paper,the strategy was combined with the principal component analysis(PCA) and partial least square analysis(PLS).It was proven to be effective by analyzing the handwriting sample.By using the strategy,the m/z corresponding to the specific lipids in rat brain tissue were distinguished successfully.Moreover the analysis time grew linearly instead of exponentially as the size of sample increased.The strategy developed in this study has enormous potential for searching for the mjz of potential biomarkers quickly and effectively.
文摘目的:多元线性回归模型在保持输入自变量光谱信息和空间特征的同时,通过线性变换获取自变量和因变量的光谱拟合关系,对原输入自变量的光谱信息进行优化,从而获得高空间分辨率和丰富光谱信息的重构数据。方法:利用同期获取的OLI(Operational Land Imager)和PMS(Panchromatic and Multispectral Scanner)多光谱遥感影像,根据最小二乘法构建多元线性回归模型,重构生成具有丰富光谱特征和空间特征的遥感影像,从主客观两个方面评价重构影像的质量。结果:在目视解译(主观)方面,重构影像在一定程度上保留了原OLI影像的光谱特性,提升了原PMS影像的清晰度和分辨性;在量化角度(客观)方面,重构影像的信息量和平均梯度比原OLI对应波段影像的信息量(在部分波段上)和平均梯度要低,但比原PMS影像的信息量和平均梯度要高,可见重构影像的质量介于原PMS影像和OLI影像的质量之间。结论:以青海省门源回族自治县的耕地内不同作物为实例对象,利用最大似然法获取门源县青稞和油菜的空间分布,研究区实测数据验证表明,重构影像对耕地内部青稞与油菜的提取精度高于原PMS和OLI多光谱影像的提取精度。