This study proposes two metrics using the nearest neighbors method to improve the accuracy of time-series forecasting. These two metrics can be treated as a hybrid forecasting approach to combine linear and non-linear...This study proposes two metrics using the nearest neighbors method to improve the accuracy of time-series forecasting. These two metrics can be treated as a hybrid forecasting approach to combine linear and non-linear forecasting techniques. One metric redefines the distance in k-nearest neighbors based on the coefficients of autoregression (AR) in time series. Meanwhile, an improvement to Kulesh's adaptive metrics in the nearest neighbors is also presented. To evaluate the performance of the two proposed metrics, three types of time-series data, namely deterministic synthetic data, chaotic time-series data and real time-series data, are predicted. Experimental results show the superiority of the proposed AR-enhanced k-nearest neighbors methods to the traditional k-nearest neighbors metric and Kulesh's adaptive metrics.展开更多
G-protein coupled receptors (GPCRs) are a class of seven-helix transmembrane proteins that have been used in bioinformatics as the targets to facilitate drug discovery for human diseases. Although thousands of GPCR ...G-protein coupled receptors (GPCRs) are a class of seven-helix transmembrane proteins that have been used in bioinformatics as the targets to facilitate drug discovery for human diseases. Although thousands of GPCR sequences have been collected, the ligand specificity of many GPCRs is still unknown and only one crystal structure of the rhodopsin-like family has been solved. Therefore, identifying GPCR types only from sequence data has become an important research issue. In this study, a novel technique for identifying GPCR types based on the weighted Levenshtein distance between two receptor sequences and the nearest neighbor method (NNM) is introduced, which can deal with receptor sequences with different lengths directly. In our experiments for classifying four classes (acetylcholine, adrenoceptor, dopamine, and serotonin) of the rhodopsin-like family of GPCRs, the error rates from the leave-one-out procedure and the leave-half-out procedure were 0.62% and 1.24%, respectively. These results are prior to those of the covariant discriminant algorithm, the support vector machine method, and the NNM with Euclidean distance.展开更多
The research was carried out on the territory of the Karelian Isthmus of the Leningrad Region using Sentinel-2B images and data from a network of ground sample plots. The ground sample plots are located in the studied...The research was carried out on the territory of the Karelian Isthmus of the Leningrad Region using Sentinel-2B images and data from a network of ground sample plots. The ground sample plots are located in the studied territory mainly in a regular manner, laid and surveyed according to the ICP-Forests methodology with some additions. The total area of the sample plots is a small part of the entire study area. One of the objectives of the study was to determine the possibility of using the k-NN (nearest neighbor method) to assess the state of forests throughout the whole studied territory by joint statistical processing of data from ground sample plots and Sentinel-2B imagery. The data of the ground-based sample plots were divided into 2 equal parts, one for the application of the k-NN method, the second for checking the results of the method application. The systematic error in determining the mean damage class of the tree stands on sample plots by the k-NN method turned out to be zero, the random error is equal to one point. These results offer a possibility to determine the state of the forest in the entire study area. The second objective of the study was to examine the possibility of using the short-wave vegetation index (SWVI) to assess the state of forests. As a result, a close statistically reliable dependence of the average score of the state of plantations and the value of the SWVI index was established, which makes it possible to use the established relationship to determine the state of forests throughout the studied territory. The joint use and statistical processing of remotely sensed data and ground-based test areas by the two studied methods make it possible to assess the state of forests throughout the large studied area within the image. The results obtained can be used to monitor the state of forests in large areas and design appropriate forestry protective measures.展开更多
Standard plenoptic camera can be used to capture multi-dimensional radiation information of high temperature luminous flame to reconstruct the temperature distribution. In this study, a novel method for reconstructing...Standard plenoptic camera can be used to capture multi-dimensional radiation information of high temperature luminous flame to reconstruct the temperature distribution. In this study, a novel method for reconstructing three-dimensional temperature field is proposed. This method is based on the optical tomography combined with standard plenoptic camera. The flame projection information from different planes is contained in one radiation image. In this model, we introduced the effective concept of the nearest neighbor method in the frequency domain to strip the interference of redundant information in the projection and to realize three-dimensional deconvolution. The flame emission intensity received by the pixels on the charge-coupled device sensor can be obtained according to the optical tomographic model. The temperature distributions of the axisymmetric and nonaxisymmetric flames can be reconstructed by solving the mathematical model with the nearest neighbor method. The numerical results show that three-dimensional temperature fields of high temperature luminous flames can be retrieved, proving the validity of the proposed method.展开更多
Mode tracking is required in the structural optimization when the frequencies of certain specified modes must be maintained within a suitable range.A simple tracking method employing the mode number is invalid or misl...Mode tracking is required in the structural optimization when the frequencies of certain specified modes must be maintained within a suitable range.A simple tracking method employing the mode number is invalid or misleading when local modes appear or disappear during mesh updating.In this work,a mode tracking scheme combining the nearest neighbor method(NNM)with the modal assurance criterion(MAC)is proposed.Several NNM algorithms are compared,and the k-dimensional tree(kd-tree)NNM is used to transform eigenvectors(mode shapes)from different scales to identical one.A threshold determination method is implemented for the MAC to assess the similarities in all the calculated modes.On the basis of the mode tracking scheme,specified modes can be tracked between different finite element method(FEM)models which have different meshes and optimized shapes.The effectiveness is verified through an example of shape optimization using an electric motor structure.展开更多
基金the National Natural Science Foundation of China(No.61203337)the Natural Science Foundation of Shanghai(No.12ZR1440200)
文摘This study proposes two metrics using the nearest neighbors method to improve the accuracy of time-series forecasting. These two metrics can be treated as a hybrid forecasting approach to combine linear and non-linear forecasting techniques. One metric redefines the distance in k-nearest neighbors based on the coefficients of autoregression (AR) in time series. Meanwhile, an improvement to Kulesh's adaptive metrics in the nearest neighbors is also presented. To evaluate the performance of the two proposed metrics, three types of time-series data, namely deterministic synthetic data, chaotic time-series data and real time-series data, are predicted. Experimental results show the superiority of the proposed AR-enhanced k-nearest neighbors methods to the traditional k-nearest neighbors metric and Kulesh's adaptive metrics.
基金supported by the Natural Science Foundation of Jiangsu Province(No.BK2004142)partly by the National Natural Science Foundation of China(No.60275007).
文摘G-protein coupled receptors (GPCRs) are a class of seven-helix transmembrane proteins that have been used in bioinformatics as the targets to facilitate drug discovery for human diseases. Although thousands of GPCR sequences have been collected, the ligand specificity of many GPCRs is still unknown and only one crystal structure of the rhodopsin-like family has been solved. Therefore, identifying GPCR types only from sequence data has become an important research issue. In this study, a novel technique for identifying GPCR types based on the weighted Levenshtein distance between two receptor sequences and the nearest neighbor method (NNM) is introduced, which can deal with receptor sequences with different lengths directly. In our experiments for classifying four classes (acetylcholine, adrenoceptor, dopamine, and serotonin) of the rhodopsin-like family of GPCRs, the error rates from the leave-one-out procedure and the leave-half-out procedure were 0.62% and 1.24%, respectively. These results are prior to those of the covariant discriminant algorithm, the support vector machine method, and the NNM with Euclidean distance.
文摘The research was carried out on the territory of the Karelian Isthmus of the Leningrad Region using Sentinel-2B images and data from a network of ground sample plots. The ground sample plots are located in the studied territory mainly in a regular manner, laid and surveyed according to the ICP-Forests methodology with some additions. The total area of the sample plots is a small part of the entire study area. One of the objectives of the study was to determine the possibility of using the k-NN (nearest neighbor method) to assess the state of forests throughout the whole studied territory by joint statistical processing of data from ground sample plots and Sentinel-2B imagery. The data of the ground-based sample plots were divided into 2 equal parts, one for the application of the k-NN method, the second for checking the results of the method application. The systematic error in determining the mean damage class of the tree stands on sample plots by the k-NN method turned out to be zero, the random error is equal to one point. These results offer a possibility to determine the state of the forest in the entire study area. The second objective of the study was to examine the possibility of using the short-wave vegetation index (SWVI) to assess the state of forests. As a result, a close statistically reliable dependence of the average score of the state of plantations and the value of the SWVI index was established, which makes it possible to use the established relationship to determine the state of forests throughout the studied territory. The joint use and statistical processing of remotely sensed data and ground-based test areas by the two studied methods make it possible to assess the state of forests throughout the large studied area within the image. The results obtained can be used to monitor the state of forests in large areas and design appropriate forestry protective measures.
基金supported by the National Natural Science Foundation of China (Grant No. 51976044)the National Science and Technology Major Project (Grant No. 2017-V-0016-0069)the Foundation of Heilongjiang Touyan Innovation Team Program。
文摘Standard plenoptic camera can be used to capture multi-dimensional radiation information of high temperature luminous flame to reconstruct the temperature distribution. In this study, a novel method for reconstructing three-dimensional temperature field is proposed. This method is based on the optical tomography combined with standard plenoptic camera. The flame projection information from different planes is contained in one radiation image. In this model, we introduced the effective concept of the nearest neighbor method in the frequency domain to strip the interference of redundant information in the projection and to realize three-dimensional deconvolution. The flame emission intensity received by the pixels on the charge-coupled device sensor can be obtained according to the optical tomographic model. The temperature distributions of the axisymmetric and nonaxisymmetric flames can be reconstructed by solving the mathematical model with the nearest neighbor method. The numerical results show that three-dimensional temperature fields of high temperature luminous flames can be retrieved, proving the validity of the proposed method.
基金the National Natural Science Foundation of China(No.51775336)the Shanghai Pujiang Program(No.17PJD019)
文摘Mode tracking is required in the structural optimization when the frequencies of certain specified modes must be maintained within a suitable range.A simple tracking method employing the mode number is invalid or misleading when local modes appear or disappear during mesh updating.In this work,a mode tracking scheme combining the nearest neighbor method(NNM)with the modal assurance criterion(MAC)is proposed.Several NNM algorithms are compared,and the k-dimensional tree(kd-tree)NNM is used to transform eigenvectors(mode shapes)from different scales to identical one.A threshold determination method is implemented for the MAC to assess the similarities in all the calculated modes.On the basis of the mode tracking scheme,specified modes can be tracked between different finite element method(FEM)models which have different meshes and optimized shapes.The effectiveness is verified through an example of shape optimization using an electric motor structure.