We introduce first a sort of gray-scale morphological dilations and erosions, which might have some further applications in image analysis. Then we show that the dilation and the erosion defined here form adjunctive p...We introduce first a sort of gray-scale morphological dilations and erosions, which might have some further applications in image analysis. Then we show that the dilation and the erosion defined here form adjunctive pairs. The duality between the dilation and the erosion and some other properties, such as homothety, of these operators are discussed the Commuting property with translation and as well.展开更多
A new cellular neural network (CNN) with nonlinear templates is presented forextracting convex corners of objects in gray-scale images. Application examples showed that the newCNN can even detect convex corner charact...A new cellular neural network (CNN) with nonlinear templates is presented forextracting convex corners of objects in gray-scale images. Application examples showed that the newCNN can even detect convex corner characteristics of objects in images with Gaussian noise.展开更多
An adaptive digital image watermark algorithm with strong robustness based on gray-scale morphology is proposed in this paper.The embedded strategies include:The algorithm seeks and extracts adaptively the image stron...An adaptive digital image watermark algorithm with strong robustness based on gray-scale morphology is proposed in this paper.The embedded strategies include:The algorithm seeks and extracts adaptively the image strong texture regions.The algorithm maps the image strong texture region to the wavelet tree structures, and embeds adaptively watermark into the wavelet coefficients corresponding to the image's strong texture regions.According to the visual masking features, the algorithm adjusts adaptively the watermark-embedding intensity.Experimental results show the algorithm is robust to compression, filtering, noise as well as strong shear attacks.The algorithm is blind watermark scheme.The image strong texture region extraction method based on morphology in this algorithm is simple and effective and adaptive to various images.展开更多
Rail fastener positioning method has high requirements for image quality and positioning accuracy.Therefore,a rail fastener positioning method based on gray mutation is proposed.Firstly,the image is denoised by an imp...Rail fastener positioning method has high requirements for image quality and positioning accuracy.Therefore,a rail fastener positioning method based on gray mutation is proposed.Firstly,the image is denoised by an improved median filter.Then,according to the characteristics of image frequency domain,the image is decomposed by wavelet transfrom to extract low-frequency component,and the low-frequency component is filtered and processed by gamma transformation to reduce the influence of natural environment factors on image quality.After that,according to the change rule of gray-scale values of different regions in the image,the gray-scale mutation in column and line directions of the image is statistically analyzed,and then the rail surface and the sleeper are located.Finally,the fastener area is accurately located by using the position relationship of the rail surface,the sleeper and the fastener in the image.The experimental results show that the positioning accuracy of the proposed method is 93.19%,which can quickly and effectively locate the fastener region,and has strong environmental adaptability,robustness and practicability.展开更多
This paper deals with a novel local arc length estimator for curves in gray-scale images.The method first estimates a cubic spline curve fit for the boundary points using the gray-level information of the nearby pixel...This paper deals with a novel local arc length estimator for curves in gray-scale images.The method first estimates a cubic spline curve fit for the boundary points using the gray-level information of the nearby pixels,and then computes the sum of the spline segments’lengths.In this model,the second derivatives and y coordinates at the knots are required in the computation;the spline polynomial coefficients need not be computed explicitly.We provide the algorithm pseudo code for estimation and preprocessing,both taking linear time.Implementation shows that the proposed model gains a smaller relative error than other state-of-the-art methods.展开更多
In this paper, a new quantum images encoding scheme is proposed. The proposed scheme mainly consists of four different encoding algorithms. The idea behind of the scheme is a binary key generated randomly for each pix...In this paper, a new quantum images encoding scheme is proposed. The proposed scheme mainly consists of four different encoding algorithms. The idea behind of the scheme is a binary key generated randomly for each pixel of the original image. Afterwards, the employed encoding algorithm is selected corresponding to the qubit pair of the generated randomized binary key. The security analysis of the proposed scheme proved its enhancement through both randomization of the generated binary image key and altering the gray-scale value of the image pixels using the qubits of randomized binary key. The simulation of the proposed scheme assures that the final encoded image could not be recognized visually. Moreover, the histogram diagram of encoded image is flatter than the originM one. The Shannon entropies of the final encoded images are significantly higher than the original one, which indicates that the attacker can not gain any information about the encoded images.展开更多
Using the high-speed camera the time sequences of the classical flow patterns of horizontal gas-liquid pipe flow are recorded, from which the average gray-scale values of single-frame images are extracted. Thus obtain...Using the high-speed camera the time sequences of the classical flow patterns of horizontal gas-liquid pipe flow are recorded, from which the average gray-scale values of single-frame images are extracted. Thus obtained gray-scale time series is decomposed by the Empirical Mode Decomposition (EMD) method, the various scales of the signals are processed by Hurst exponent method, and then the dual-fractal characteristics are obtained. The scattered bubble and the bubble cluster theories are applied to the evolution analysis of two-phase flow patterns. At the same time the various signals are checked in the chaotic recursion chart by which the two typical characteristics (diagonal average length and Shannon entropy) are obtained. Resulting term of these properties, the dynamic characteristics of gas-liquid two-phase flow patterns are quantitatively analyzed. The results show that the evolution paths of gas-liquid two-phase flow patterns can be well characterized by the integrated analysis on the basis of the gray-scale time series of flowing images from EMD, Hurst exponents and Recurrence Plot (RP). In the middle frequency section (2nd, 3rd, 4th scales), three flow patterns decomposed by the EMD exhibit dual fractal characteristics which represent the dynamic features of bubble cluster, single bubble, slug bubble and scattered bubble. According to the change of diagonal average lengths and recursive Shannon entropy characteristic value, the structure deterministic of the slug flow is better than the other two patterns. After the decomposition by EMD the slug flow and the mist flow in the high frequency section have obvious peaks. Anyway, it is an effective way to understand and characterize the dynamic characteristics of two-phase flow patterns using the multi-scale non-linear analysis method based on image gray-scale fluctuation signals.展开更多
Carpal tunnel syndrome(CTS) is a common peripheral entrapment neuropathy of the median nerve at wrist level, and is thought to be caused by compression of the median nerve in the carpal tunnel. There is no standard qu...Carpal tunnel syndrome(CTS) is a common peripheral entrapment neuropathy of the median nerve at wrist level, and is thought to be caused by compression of the median nerve in the carpal tunnel. There is no standard quantitative reference for the diagnosis of CTS. Greyscale sonography and sonoelastography(SEL) have been used as diagnostic tools. The most commonly agreed findings in grey-scale sonography for the diagnosis of CTS is enlargement of the median nerve cross-sectional area(CSA). Several authors have assessed additional parameters. "Delta CSA" is the difference between the proximal median nerve CSA at the pronator quadratus and the maximal CSA within the carpal tunnel. The "CSA ratio" is the ratio of CSA in the carpal tunnel to the CSA at the mid forearm. These additional parameters showed better diagnostic accuracy than CSA measurement alone. Recently, a number of studies have investigated the elasticity of the median nerve using SEL, and have shown that this also has diagnostic value, as it was significantly stiffer in CTS patients compared to healthy volunteers. In this review, we summarize the usefulness of grey-scale sonography and SEL in diagnosing CTS.展开更多
The S/N of an underwater image is low and has a fuzzy edge.If using traditional methods to process it directly,the result is not satisfying.Though the traditional fuzzy C-means algorithm can sometimes divide the image...The S/N of an underwater image is low and has a fuzzy edge.If using traditional methods to process it directly,the result is not satisfying.Though the traditional fuzzy C-means algorithm can sometimes divide the image into object and background,its time-consuming computation is often an obstacle.The mission of the vision system of an autonomous underwater vehicle (AUV) is to rapidly and exactly deal with the information about the object in a complex environment for the AUV to use the obtained result to execute the next task.So,by using the statistical characteristics of the gray image histogram,a fast and effective fuzzy C-means underwater image segmentation algorithm was presented.With the weighted histogram modifying the fuzzy membership,the above algorithm can not only cut down on a large amount of data processing and storage during the computation process compared with the traditional algorithm,so as to speed up the efficiency of the segmentation,but also improve the quality of underwater image segmentation.Finally,particle swarm optimization (PSO) described by the sine function was introduced to the algorithm mentioned above.It made up for the shortcomings that the FCM algorithm can not get the global optimal solution.Thus,on the one hand,it considers the global impact and achieves the local optimal solution,and on the other hand,further greatly increases the computing speed.Experimental results indicate that the novel algorithm can reach a better segmentation quality and the processing time of each image is reduced.They enhance efficiency and satisfy the requirements of a highly effective,real-time AUV.展开更多
Camera calibration is the key technique in a C-arm based orthopaedic surgical navigation system. The extraction of marker location information is a necessary step in the calibration process. Ideal marker images should...Camera calibration is the key technique in a C-arm based orthopaedic surgical navigation system. The extraction of marker location information is a necessary step in the calibration process. Ideal marker images should possess uniform background and contain marker shadow only, but in fact marker images always possess nonuniform background and are contaminated by noise and unwanted anatomic information, making the extraction very difficult. A target-orientated marker shadow extraction method was proposed. With this method a proper threshold for marker image binarization can be determined.展开更多
A novel learning method for multi-valued associative memory network is introduced, which is based on Hebb rule, but utilizes more information. According to the current probe vector, the connection weights matrix could...A novel learning method for multi-valued associative memory network is introduced, which is based on Hebb rule, but utilizes more information. According to the current probe vector, the connection weights matrix could be chosen dynamically. Double-valued and multi-valued associative memory are all realized in our simulation experiment. The experimental results show that the method could enhance the associative success rate.展开更多
The properties of feldspar and quartze are studied in this article from a fractal point of view using gray-scale micro-images of granite samples collected at the Fangshan (房山) granite body in Hebei (河北) Provin...The properties of feldspar and quartze are studied in this article from a fractal point of view using gray-scale micro-images of granite samples collected at the Fangshan (房山) granite body in Hebei (河北) Province, China, which can be regarded as an ideal granite in the sense of Vistelius. We found that there exist power-law relationships between the eigenvalues of the gray-scale matrices and their ranks for the feldspar and quartz. The fractal model used here is a λ-R model similar to the N-λ model proposed by Qiuming Cheng in 2005. Meanwhile, we found that average variances for the gray-scale matrices of feldspar are larger than those of quartz on the same sections, and this may be useful for auto-identification of feldspar and quartz as well as other minerals.展开更多
Improved gray-scale (IGS) quantization is a known method for re-quantizing digital gray-scale images for data compression while producing halftones by adding a level of randomness to improve visual quality of the resu...Improved gray-scale (IGS) quantization is a known method for re-quantizing digital gray-scale images for data compression while producing halftones by adding a level of randomness to improve visual quality of the resultant images. In this paper, first, analyzing the IGS quantizing operations reveals the capability of conserving a DC signal level of a source image through the quantization. Then, a complete procedure for producing a multi-level halftone image by IGS quantization that can achieve the DC conservation is presented. Also, the procedure uses the scanning of source pixels in an order such that geometric patterns can be prevented from occurring in the resulting halftone image. Next, the performance of the multi-level IGS halftoning is evaluated by experiments conducted on 8-bit gray-scale test images in comparison with the halftoning by error diffusion. The experimental result demonstrates that a signal level to be quantized in the IGS halftoning varies more randomly than that in the error diffusion halftoning, but not entirely randomly. Also, visual quality of the resulting halftone images was measured by subjective evaluations of viewers. The result indicates that for 3 or more-bit, in other words, 8 or more-level halftones, the IGS halftoning achieves image quality comparable to that by the error diffusion.展开更多
Everyone knows that thousand of words are represented by a single image. As a result, image search has become a very popular mechanism for the Web-searchers. Image search means, the search results are produced by the ...Everyone knows that thousand of words are represented by a single image. As a result, image search has become a very popular mechanism for the Web-searchers. Image search means, the search results are produced by the search engine should be a set of images along with their Web-page Unified Resource Locator (URL). Now Web-searcher can perform two types of image search, they are “Text to Image” and “Image to Image” search. In “Text to Image” search, search query should be a text. Based on the input text data, system will generate a set of images along with their Web-page URL as an output. On the other hand, in “Image to Image” search, search query should be an image and based on this image, system will generate a set of images along with their Web-page URL as an output. According to the current scenarios, “Text to Image” search mechanism always not returns perfect result. It matches the text data and then displays the corresponding images as an output, which is not always perfect. To resolve this problem, Web researchers have introduced the “Image to Image” search mechanism. In this paper, we have also proposed an alternate approach of “Image to Image” search mechanism using Histogram.展开更多
The explosive growth ofmalware variants poses a major threat to information security. Traditional anti-virus systems based on signatures fail to classify unknown malware into their corresponding families and to detect...The explosive growth ofmalware variants poses a major threat to information security. Traditional anti-virus systems based on signatures fail to classify unknown malware into their corresponding families and to detect new kinds of malware pro- grams. Therefore, we propose a machine learning based malware analysis system, which is composed of three modules: data processing, decision making, and new malware detection. The data processing module deals with gray-scale images, Opcode n-gram, and import fimctions, which are employed to extract the features of the malware. The decision-making module uses the features to classify the malware and to identify suspicious malware. Finally, the detection module uses the shared nearest neighbor (SNN) clustering algorithm to discover new malware families. Our approach is evaluated on more than 20 000 malware instances, which were collected by Kingsoft, ESET NOD32, and Anubis. The results show that our system can effectively classify the un- known malware with a best accuracy of 98.9%, and successfully detects 86.7% of the new malware.展开更多
基金Supported by the National Natural Science Foundation of China(11671293, 11271282)
文摘We introduce first a sort of gray-scale morphological dilations and erosions, which might have some further applications in image analysis. Then we show that the dilation and the erosion defined here form adjunctive pairs. The duality between the dilation and the erosion and some other properties, such as homothety, of these operators are discussed the Commuting property with translation and as well.
基金This project is jointly supported by the National Nature Science Foundation of China(Nos.60074034,70271068),the Research Fund for the Doctoral Program of Higher Education(No.20020008004)and the Foundation for University Key Teacher by the Ministry of Ed
文摘A new cellular neural network (CNN) with nonlinear templates is presented forextracting convex corners of objects in gray-scale images. Application examples showed that the newCNN can even detect convex corner characteristics of objects in images with Gaussian noise.
基金Supported by the Technology Key Project of Shanxi Province (2007K04-13)the Application Development and Research Project of Xi’an (YF07017)
文摘An adaptive digital image watermark algorithm with strong robustness based on gray-scale morphology is proposed in this paper.The embedded strategies include:The algorithm seeks and extracts adaptively the image strong texture regions.The algorithm maps the image strong texture region to the wavelet tree structures, and embeds adaptively watermark into the wavelet coefficients corresponding to the image's strong texture regions.According to the visual masking features, the algorithm adjusts adaptively the watermark-embedding intensity.Experimental results show the algorithm is robust to compression, filtering, noise as well as strong shear attacks.The algorithm is blind watermark scheme.The image strong texture region extraction method based on morphology in this algorithm is simple and effective and adaptive to various images.
基金National Natural Science Foundation of China(Nos.61616202,61461203)Ministry of Education Innovation Team Development Plan(No.IRT_16R36)Plateau Information Engineering and Control Key Practice Laboratory Open Project Fund of Gansu Province(No.201611105)。
文摘Rail fastener positioning method has high requirements for image quality and positioning accuracy.Therefore,a rail fastener positioning method based on gray mutation is proposed.Firstly,the image is denoised by an improved median filter.Then,according to the characteristics of image frequency domain,the image is decomposed by wavelet transfrom to extract low-frequency component,and the low-frequency component is filtered and processed by gamma transformation to reduce the influence of natural environment factors on image quality.After that,according to the change rule of gray-scale values of different regions in the image,the gray-scale mutation in column and line directions of the image is statistically analyzed,and then the rail surface and the sleeper are located.Finally,the fastener area is accurately located by using the position relationship of the rail surface,the sleeper and the fastener in the image.The experimental results show that the positioning accuracy of the proposed method is 93.19%,which can quickly and effectively locate the fastener region,and has strong environmental adaptability,robustness and practicability.
基金Project supported by the National Natural Science Foundationof China(Nos.61170092,61133011,61272208,61103091,and61202308)the Fundamental Research Funds for the CentralUniversities,China(Nos.450060445674 and 450060481512)
文摘This paper deals with a novel local arc length estimator for curves in gray-scale images.The method first estimates a cubic spline curve fit for the boundary points using the gray-level information of the nearby pixels,and then computes the sum of the spline segments’lengths.In this model,the second derivatives and y coordinates at the knots are required in the computation;the spline polynomial coefficients need not be computed explicitly.We provide the algorithm pseudo code for estimation and preprocessing,both taking linear time.Implementation shows that the proposed model gains a smaller relative error than other state-of-the-art methods.
基金Supported by Kermanshah Branch,Islamic Azad University,Kermanshah,IRAN
文摘In this paper, a new quantum images encoding scheme is proposed. The proposed scheme mainly consists of four different encoding algorithms. The idea behind of the scheme is a binary key generated randomly for each pixel of the original image. Afterwards, the employed encoding algorithm is selected corresponding to the qubit pair of the generated randomized binary key. The security analysis of the proposed scheme proved its enhancement through both randomization of the generated binary image key and altering the gray-scale value of the image pixels using the qubits of randomized binary key. The simulation of the proposed scheme assures that the final encoded image could not be recognized visually. Moreover, the histogram diagram of encoded image is flatter than the originM one. The Shannon entropies of the final encoded images are significantly higher than the original one, which indicates that the attacker can not gain any information about the encoded images.
基金Supported by the National Natural Science Foundation of China (50976018) the Natural Science Foundation of JilinProvince (20101562)
文摘Using the high-speed camera the time sequences of the classical flow patterns of horizontal gas-liquid pipe flow are recorded, from which the average gray-scale values of single-frame images are extracted. Thus obtained gray-scale time series is decomposed by the Empirical Mode Decomposition (EMD) method, the various scales of the signals are processed by Hurst exponent method, and then the dual-fractal characteristics are obtained. The scattered bubble and the bubble cluster theories are applied to the evolution analysis of two-phase flow patterns. At the same time the various signals are checked in the chaotic recursion chart by which the two typical characteristics (diagonal average length and Shannon entropy) are obtained. Resulting term of these properties, the dynamic characteristics of gas-liquid two-phase flow patterns are quantitatively analyzed. The results show that the evolution paths of gas-liquid two-phase flow patterns can be well characterized by the integrated analysis on the basis of the gray-scale time series of flowing images from EMD, Hurst exponents and Recurrence Plot (RP). In the middle frequency section (2nd, 3rd, 4th scales), three flow patterns decomposed by the EMD exhibit dual fractal characteristics which represent the dynamic features of bubble cluster, single bubble, slug bubble and scattered bubble. According to the change of diagonal average lengths and recursive Shannon entropy characteristic value, the structure deterministic of the slug flow is better than the other two patterns. After the decomposition by EMD the slug flow and the mist flow in the high frequency section have obvious peaks. Anyway, it is an effective way to understand and characterize the dynamic characteristics of two-phase flow patterns using the multi-scale non-linear analysis method based on image gray-scale fluctuation signals.
文摘Carpal tunnel syndrome(CTS) is a common peripheral entrapment neuropathy of the median nerve at wrist level, and is thought to be caused by compression of the median nerve in the carpal tunnel. There is no standard quantitative reference for the diagnosis of CTS. Greyscale sonography and sonoelastography(SEL) have been used as diagnostic tools. The most commonly agreed findings in grey-scale sonography for the diagnosis of CTS is enlargement of the median nerve cross-sectional area(CSA). Several authors have assessed additional parameters. "Delta CSA" is the difference between the proximal median nerve CSA at the pronator quadratus and the maximal CSA within the carpal tunnel. The "CSA ratio" is the ratio of CSA in the carpal tunnel to the CSA at the mid forearm. These additional parameters showed better diagnostic accuracy than CSA measurement alone. Recently, a number of studies have investigated the elasticity of the median nerve using SEL, and have shown that this also has diagnostic value, as it was significantly stiffer in CTS patients compared to healthy volunteers. In this review, we summarize the usefulness of grey-scale sonography and SEL in diagnosing CTS.
基金Supported by the National Natural Science Foundation of China under Grant No.50909025/E091002the Open Research Foundation of SKLab AUV, HEU under Grant No.2008003
文摘The S/N of an underwater image is low and has a fuzzy edge.If using traditional methods to process it directly,the result is not satisfying.Though the traditional fuzzy C-means algorithm can sometimes divide the image into object and background,its time-consuming computation is often an obstacle.The mission of the vision system of an autonomous underwater vehicle (AUV) is to rapidly and exactly deal with the information about the object in a complex environment for the AUV to use the obtained result to execute the next task.So,by using the statistical characteristics of the gray image histogram,a fast and effective fuzzy C-means underwater image segmentation algorithm was presented.With the weighted histogram modifying the fuzzy membership,the above algorithm can not only cut down on a large amount of data processing and storage during the computation process compared with the traditional algorithm,so as to speed up the efficiency of the segmentation,but also improve the quality of underwater image segmentation.Finally,particle swarm optimization (PSO) described by the sine function was introduced to the algorithm mentioned above.It made up for the shortcomings that the FCM algorithm can not get the global optimal solution.Thus,on the one hand,it considers the global impact and achieves the local optimal solution,and on the other hand,further greatly increases the computing speed.Experimental results indicate that the novel algorithm can reach a better segmentation quality and the processing time of each image is reduced.They enhance efficiency and satisfy the requirements of a highly effective,real-time AUV.
基金Project of Science and Technology Committee of Shanghai Municipality (No.2528(3))
文摘Camera calibration is the key technique in a C-arm based orthopaedic surgical navigation system. The extraction of marker location information is a necessary step in the calibration process. Ideal marker images should possess uniform background and contain marker shadow only, but in fact marker images always possess nonuniform background and are contaminated by noise and unwanted anatomic information, making the extraction very difficult. A target-orientated marker shadow extraction method was proposed. With this method a proper threshold for marker image binarization can be determined.
文摘A novel learning method for multi-valued associative memory network is introduced, which is based on Hebb rule, but utilizes more information. According to the current probe vector, the connection weights matrix could be chosen dynamically. Double-valued and multi-valued associative memory are all realized in our simulation experiment. The experimental results show that the method could enhance the associative success rate.
基金supported by the National Natural Science Foundation of China (Nos. 40373003, 40502029, 40525009, 40638041)State Key Laboratory of Geological Processes and Mineral Resources, China University of Geosciences (No. GPMR2007-23)
文摘The properties of feldspar and quartze are studied in this article from a fractal point of view using gray-scale micro-images of granite samples collected at the Fangshan (房山) granite body in Hebei (河北) Province, China, which can be regarded as an ideal granite in the sense of Vistelius. We found that there exist power-law relationships between the eigenvalues of the gray-scale matrices and their ranks for the feldspar and quartz. The fractal model used here is a λ-R model similar to the N-λ model proposed by Qiuming Cheng in 2005. Meanwhile, we found that average variances for the gray-scale matrices of feldspar are larger than those of quartz on the same sections, and this may be useful for auto-identification of feldspar and quartz as well as other minerals.
文摘Improved gray-scale (IGS) quantization is a known method for re-quantizing digital gray-scale images for data compression while producing halftones by adding a level of randomness to improve visual quality of the resultant images. In this paper, first, analyzing the IGS quantizing operations reveals the capability of conserving a DC signal level of a source image through the quantization. Then, a complete procedure for producing a multi-level halftone image by IGS quantization that can achieve the DC conservation is presented. Also, the procedure uses the scanning of source pixels in an order such that geometric patterns can be prevented from occurring in the resulting halftone image. Next, the performance of the multi-level IGS halftoning is evaluated by experiments conducted on 8-bit gray-scale test images in comparison with the halftoning by error diffusion. The experimental result demonstrates that a signal level to be quantized in the IGS halftoning varies more randomly than that in the error diffusion halftoning, but not entirely randomly. Also, visual quality of the resulting halftone images was measured by subjective evaluations of viewers. The result indicates that for 3 or more-bit, in other words, 8 or more-level halftones, the IGS halftoning achieves image quality comparable to that by the error diffusion.
文摘Everyone knows that thousand of words are represented by a single image. As a result, image search has become a very popular mechanism for the Web-searchers. Image search means, the search results are produced by the search engine should be a set of images along with their Web-page Unified Resource Locator (URL). Now Web-searcher can perform two types of image search, they are “Text to Image” and “Image to Image” search. In “Text to Image” search, search query should be a text. Based on the input text data, system will generate a set of images along with their Web-page URL as an output. On the other hand, in “Image to Image” search, search query should be an image and based on this image, system will generate a set of images along with their Web-page URL as an output. According to the current scenarios, “Text to Image” search mechanism always not returns perfect result. It matches the text data and then displays the corresponding images as an output, which is not always perfect. To resolve this problem, Web researchers have introduced the “Image to Image” search mechanism. In this paper, we have also proposed an alternate approach of “Image to Image” search mechanism using Histogram.
基金Project supported by the Natiooal Natural Science Foundation of China (No. 61303264) and the National Basic Research Program (973) of China (Nos. 2012CB315906 and 0800065111001)
文摘The explosive growth ofmalware variants poses a major threat to information security. Traditional anti-virus systems based on signatures fail to classify unknown malware into their corresponding families and to detect new kinds of malware pro- grams. Therefore, we propose a machine learning based malware analysis system, which is composed of three modules: data processing, decision making, and new malware detection. The data processing module deals with gray-scale images, Opcode n-gram, and import fimctions, which are employed to extract the features of the malware. The decision-making module uses the features to classify the malware and to identify suspicious malware. Finally, the detection module uses the shared nearest neighbor (SNN) clustering algorithm to discover new malware families. Our approach is evaluated on more than 20 000 malware instances, which were collected by Kingsoft, ESET NOD32, and Anubis. The results show that our system can effectively classify the un- known malware with a best accuracy of 98.9%, and successfully detects 86.7% of the new malware.