提出基于蛋白质长度信息和深度卷积神经网络分类建模的方法(Length Information and Deep Convolutional Neural Networks, LIM-DCNN),实现对于蛋白质二级结构的预测。实验得到的6分段模型,预测CASP9、CASP10、CASP11、CASP12和CB513的Q...提出基于蛋白质长度信息和深度卷积神经网络分类建模的方法(Length Information and Deep Convolutional Neural Networks, LIM-DCNN),实现对于蛋白质二级结构的预测。实验得到的6分段模型,预测CASP9、CASP10、CASP11、CASP12和CB513的Q3准确率分别为83.67%、78.99%、78.53%、71.52%和85.94%,说明了基于蛋白质长度分类建模的有效性,并且实验得到的CB513结果明显优于其他许多经典的预测算法。展开更多
An improved Gaussian mixture model (GMM)- based clustering method is proposed for the difficult case where the true distribution of data is against the assumed GMM. First, an improved model selection criterion, the ...An improved Gaussian mixture model (GMM)- based clustering method is proposed for the difficult case where the true distribution of data is against the assumed GMM. First, an improved model selection criterion, the completed likelihood minimum message length criterion, is derived. It can measure both the goodness-of-fit of the candidate GMM to the data and the goodness-of-partition of the data. Secondly, by utilizing the proposed criterion as the clustering objective function, an improved expectation- maximization (EM) algorithm is developed, which can avoid poor local optimal solutions compared to the standard EM algorithm for estimating the model parameters. The experimental results demonstrate that the proposed method can rectify the over-fitting tendency of representative GMM-based clustering approaches and can robustly provide more accurate clustering results.展开更多
To study test stability of Advanced Fiber Information System(AFIS),card sliver produced in two experiments(12 plans in each experiment)were tested by AFIS.By a statistic analysis of the test results,the number of test...To study test stability of Advanced Fiber Information System(AFIS),card sliver produced in two experiments(12 plans in each experiment)were tested by AFIS.By a statistic analysis of the test results,the number of test times that can get a reliable test reliability(hereinafter this number of test times is referred to as Reliable Test Times,RTT)of test parameters and the coefficient of variation(CV%)values of 30 test results of each experiment plan were obtained.It's concluded that some parameters,such as length,seed coat nep(SCN)size,nep size and immature fiber content(IFC),etc.are very reliable by a test of ten or more times,but other parameters,such as SCN content,trash content,and visible foreign matter(VFM)content,etc.are not reliable until they are tested over 100 times.展开更多
We believe that well-known principles and relationships between physical entities are the key for a deep knowledge of reality. However, they must be applied using innovative points of view, points of view that can be ...We believe that well-known principles and relationships between physical entities are the key for a deep knowledge of reality. However, they must be applied using innovative points of view, points of view that can be provided by, for example, experts on information theory (as we are). Our previous efforts in this direction led to a fascinating result: the theoretical total mass ofa spacetime in which Planck's length is an observer-independent scale of length is equal (both in expression and value) to the mass of our portion of Universe measured by national aeronautics and space administration wilkinson microwave anisotropy probe (NASA WMAP) spacecraft. In the following paragraphs we'll show how granularity (i.e., discontinuity) of physical entities of our portion of observable spacetime descends directly from the principles of information theory, and how a physical theory about the discontinuity of reality built on these basis can lead to an elegant descriptions of both microcosm and macrocosm.展开更多
Glacier length is a key morphological element that has many glaciological applications; however, it is often difficult to determine, especially for glaciers that cover larger spatial areas or those that exhibit freque...Glacier length is a key morphological element that has many glaciological applications; however, it is often difficult to determine, especially for glaciers that cover larger spatial areas or those that exhibit frequent temporal change. In this paper, we describe a new Arc GIS-based method that can derive glacier flow lines for determining glacier length based on digital elevation model and glacier outlines. This method involves(1) extraction of the highest and lowest points on a glacier,(2) calculation of 10-m contour lines on the glacier from 10 m to 100 m height, and(3) connection of the midpoints of each contour line with the highest and the lowest points in order to create a flow line, which is subsequently smoothed. In order to assess the reliability of this method, we tested the algorithm's results against flow lines calculated using field measurements, analysing data from the Chinese Glacier Inventory, and manual interpretation. These data showed that the new automated method is effective in deriving glacier flow lines when contour lines are relatively large; in particular, when they are between 70 m and 100 m. Nonetheless, a key limitation of the algorithm is the requirement to automatically delete repeated and closed curves in the pre-treatment processes. In addition to calculating glacier flow lines for derivation of glacier length, this method also can be used to effectively determine glacier terminus change.展开更多
The amplified fragment length polymorphic DNA (AFLP) technique was adopted to estimate the population genetic polymorphism among 30 sporophytes of Laminaria japonica collected from a cultivating farm in Rongcheng,Chin...The amplified fragment length polymorphic DNA (AFLP) technique was adopted to estimate the population genetic polymorphism among 30 sporophytes of Laminaria japonica collected from a cultivating farm in Rongcheng,China.Three methods were used for genomic DNA extraction from Laminaria japonica sporophyte and only the products obtained using the improved genomic DNA extraction kit method proved qualified for AFLP analysis.The parameters of the method were optimized.Samples of forty milligrams and the cell lysis time of 120 min were suggested to replace the parameters recommended by the manufacturer.Thirty individuals of Laminaria japonica from the same cultivating site were investigated using one pair of selective primers.A total of 21 loci were obtained and 17 of them were polymorphic.The mean percent age of polymorphic loci of this population was 80.95%.The Nei's gene diversity (H) within this population was 0.3028 and the average Shannon's Information index (I) was 0.4498.A genetic distance matrix among different individuals was constructed as well.Through this study,an applicable AFLP genetic analysis working system for Laminaria japonica sporophyte was established.The results of this research also revealed a high level of genetic diversity within the studied population.展开更多
ESA is an unsupervised approach to word segmentation previously proposed by Wang, which is an iterative process consisting of three phases: Evaluation, Selection and Adjustment. In this article, we propose Ex ESA, the...ESA is an unsupervised approach to word segmentation previously proposed by Wang, which is an iterative process consisting of three phases: Evaluation, Selection and Adjustment. In this article, we propose Ex ESA, the extension of ESA. In Ex ESA, the original approach is extended to a 2-pass process and the ratio of different word lengths is introduced as the third type of information combined with cohesion and separation. A maximum strategy is adopted to determine the best segmentation of a character sequence in the phrase of Selection. Besides, in Adjustment, Ex ESA re-evaluates separation information and individual information to overcome the overestimation frequencies. Additionally, a smoothing algorithm is applied to alleviate sparseness. The experiment results show that Ex ESA can further improve the performance and is time-saving by properly utilizing more information from un-annotated corpora. Moreover, the parameters of Ex ESA can be predicted by a set of empirical formulae or combined with the minimum description length principle.展开更多
Based on the daily observation data of 824 meteorological stations during 1951-2010 released by the National Meteorological Information Center, this paper evaluated the changes in the heat and moisture conditions of c...Based on the daily observation data of 824 meteorological stations during 1951-2010 released by the National Meteorological Information Center, this paper evaluated the changes in the heat and moisture conditions of crop growth. An average value of ten years was used to analyze the spatio-temporal variation in the agricultural hydrothermal conditions within a 1 km2 grid. Next, the inter-annual changing trend was simulated by regression analysis of the agricultural hydrothermal conditions. The results showed that the contour lines for temperature and accumulated temperatures(the daily mean temperature ≥0°C) increased significantly in most parts of China, and that the temperature contour lines had all moved northwards over the past 60 years. At the same time, the annual precipitation showed a decreasing trend, though more than half of the meteorological stations did not pass the significance test. However, the mean temperatures in the hottest month and the coldest month exhibited a decreasing trend from 1951 to 2010. In addition, the 0°C contour line gradually moved from the Qinling Mountains and Huaihe River Basin to the Yellow River Basin. All these changes would have a significant impact on the distribution of crops and farming systems. Although the mechanisms influencing the interactive temperature and precipitation changes on crops were complex and hard to distinguish, the fact remained that these changes would directly cause corresponding changes in crop characteristics.展开更多
A differential game (DG) model for a developing and a developed country is considered.Each player makes decisions about how much resource to be used to restrict the opponent's developmentso as to maximize his weig...A differential game (DG) model for a developing and a developed country is considered.Each player makes decisions about how much resource to be used to restrict the opponent's developmentso as to maximize his weighted sum of current consumption and final output.Current consumption isassumed to be preferred to final output for both players.The developing country is assumed to havea higher economic growth rate and a higher preference to final output,whereas the developed countryis assumed to have a higher initial income and a higher efficiency in restricting his opponent.Thisproblem is investigated under three kinds of information structures,i.e.,a zerosum,a nonzero-sum,anda Stackelberg game.Open-loop equilibrium solutions are obtained for all the three cases.Economicimplications of the result are provided.展开更多
文摘提出基于蛋白质长度信息和深度卷积神经网络分类建模的方法(Length Information and Deep Convolutional Neural Networks, LIM-DCNN),实现对于蛋白质二级结构的预测。实验得到的6分段模型,预测CASP9、CASP10、CASP11、CASP12和CB513的Q3准确率分别为83.67%、78.99%、78.53%、71.52%和85.94%,说明了基于蛋白质长度分类建模的有效性,并且实验得到的CB513结果明显优于其他许多经典的预测算法。
基金The National Natural Science Foundation of China(No.61105048,60972165)the Doctoral Fund of Ministry of Education of China(No.20110092120034)+2 种基金the Natural Science Foundation of Jiangsu Province(No.BK2010240)the Technology Foundation for Selected Overseas Chinese Scholar,Ministry of Human Resources and Social Security of China(No.6722000008)the Open Fund of Jiangsu Province Key Laboratory for Remote Measuring and Control(No.YCCK201005)
文摘An improved Gaussian mixture model (GMM)- based clustering method is proposed for the difficult case where the true distribution of data is against the assumed GMM. First, an improved model selection criterion, the completed likelihood minimum message length criterion, is derived. It can measure both the goodness-of-fit of the candidate GMM to the data and the goodness-of-partition of the data. Secondly, by utilizing the proposed criterion as the clustering objective function, an improved expectation- maximization (EM) algorithm is developed, which can avoid poor local optimal solutions compared to the standard EM algorithm for estimating the model parameters. The experimental results demonstrate that the proposed method can rectify the over-fitting tendency of representative GMM-based clustering approaches and can robustly provide more accurate clustering results.
基金Key Technologies R&D Program of Liaoning Province of China(No.2003220026)Key Technologies R&D Program of Dandong,China(No.06133)
文摘To study test stability of Advanced Fiber Information System(AFIS),card sliver produced in two experiments(12 plans in each experiment)were tested by AFIS.By a statistic analysis of the test results,the number of test times that can get a reliable test reliability(hereinafter this number of test times is referred to as Reliable Test Times,RTT)of test parameters and the coefficient of variation(CV%)values of 30 test results of each experiment plan were obtained.It's concluded that some parameters,such as length,seed coat nep(SCN)size,nep size and immature fiber content(IFC),etc.are very reliable by a test of ten or more times,but other parameters,such as SCN content,trash content,and visible foreign matter(VFM)content,etc.are not reliable until they are tested over 100 times.
文摘We believe that well-known principles and relationships between physical entities are the key for a deep knowledge of reality. However, they must be applied using innovative points of view, points of view that can be provided by, for example, experts on information theory (as we are). Our previous efforts in this direction led to a fascinating result: the theoretical total mass ofa spacetime in which Planck's length is an observer-independent scale of length is equal (both in expression and value) to the mass of our portion of Universe measured by national aeronautics and space administration wilkinson microwave anisotropy probe (NASA WMAP) spacecraft. In the following paragraphs we'll show how granularity (i.e., discontinuity) of physical entities of our portion of observable spacetime descends directly from the principles of information theory, and how a physical theory about the discontinuity of reality built on these basis can lead to an elegant descriptions of both microcosm and macrocosm.
基金supported by the National Science Foundation of China (grant Nos. 41271024, 41444430204, and J1210065)the Fundamental Research Funds for the Central Universities (Nos. lzujbky-2016-266 and lzujbky2016-270)
文摘Glacier length is a key morphological element that has many glaciological applications; however, it is often difficult to determine, especially for glaciers that cover larger spatial areas or those that exhibit frequent temporal change. In this paper, we describe a new Arc GIS-based method that can derive glacier flow lines for determining glacier length based on digital elevation model and glacier outlines. This method involves(1) extraction of the highest and lowest points on a glacier,(2) calculation of 10-m contour lines on the glacier from 10 m to 100 m height, and(3) connection of the midpoints of each contour line with the highest and the lowest points in order to create a flow line, which is subsequently smoothed. In order to assess the reliability of this method, we tested the algorithm's results against flow lines calculated using field measurements, analysing data from the Chinese Glacier Inventory, and manual interpretation. These data showed that the new automated method is effective in deriving glacier flow lines when contour lines are relatively large; in particular, when they are between 70 m and 100 m. Nonetheless, a key limitation of the algorithm is the requirement to automatically delete repeated and closed curves in the pre-treatment processes. In addition to calculating glacier flow lines for derivation of glacier length, this method also can be used to effectively determine glacier terminus change.
基金funded by the ‘908’ Marine Survey Project of Shandong Province (SD-908-01-01-05.06)
文摘The amplified fragment length polymorphic DNA (AFLP) technique was adopted to estimate the population genetic polymorphism among 30 sporophytes of Laminaria japonica collected from a cultivating farm in Rongcheng,China.Three methods were used for genomic DNA extraction from Laminaria japonica sporophyte and only the products obtained using the improved genomic DNA extraction kit method proved qualified for AFLP analysis.The parameters of the method were optimized.Samples of forty milligrams and the cell lysis time of 120 min were suggested to replace the parameters recommended by the manufacturer.Thirty individuals of Laminaria japonica from the same cultivating site were investigated using one pair of selective primers.A total of 21 loci were obtained and 17 of them were polymorphic.The mean percent age of polymorphic loci of this population was 80.95%.The Nei's gene diversity (H) within this population was 0.3028 and the average Shannon's Information index (I) was 0.4498.A genetic distance matrix among different individuals was constructed as well.Through this study,an applicable AFLP genetic analysis working system for Laminaria japonica sporophyte was established.The results of this research also revealed a high level of genetic diversity within the studied population.
基金supported in part by National Science Foundation of China under Grants No. 61303105 and 61402304the Humanity & Social Science general project of Ministry of Education under Grants No.14YJAZH046+2 种基金the Beijing Natural Science Foundation under Grants No. 4154065the Beijing Educational Committee Science and Technology Development Planned under Grants No.KM201410028017Beijing Key Disciplines of Computer Application Technology
文摘ESA is an unsupervised approach to word segmentation previously proposed by Wang, which is an iterative process consisting of three phases: Evaluation, Selection and Adjustment. In this article, we propose Ex ESA, the extension of ESA. In Ex ESA, the original approach is extended to a 2-pass process and the ratio of different word lengths is introduced as the third type of information combined with cohesion and separation. A maximum strategy is adopted to determine the best segmentation of a character sequence in the phrase of Selection. Besides, in Adjustment, Ex ESA re-evaluates separation information and individual information to overcome the overestimation frequencies. Additionally, a smoothing algorithm is applied to alleviate sparseness. The experiment results show that Ex ESA can further improve the performance and is time-saving by properly utilizing more information from un-annotated corpora. Moreover, the parameters of Ex ESA can be predicted by a set of empirical formulae or combined with the minimum description length principle.
基金National Basic Program of China(973 Program),No.2012CB955800National Natural Science Foundation of China,No.41171438,No.41401504
文摘Based on the daily observation data of 824 meteorological stations during 1951-2010 released by the National Meteorological Information Center, this paper evaluated the changes in the heat and moisture conditions of crop growth. An average value of ten years was used to analyze the spatio-temporal variation in the agricultural hydrothermal conditions within a 1 km2 grid. Next, the inter-annual changing trend was simulated by regression analysis of the agricultural hydrothermal conditions. The results showed that the contour lines for temperature and accumulated temperatures(the daily mean temperature ≥0°C) increased significantly in most parts of China, and that the temperature contour lines had all moved northwards over the past 60 years. At the same time, the annual precipitation showed a decreasing trend, though more than half of the meteorological stations did not pass the significance test. However, the mean temperatures in the hottest month and the coldest month exhibited a decreasing trend from 1951 to 2010. In addition, the 0°C contour line gradually moved from the Qinling Mountains and Huaihe River Basin to the Yellow River Basin. All these changes would have a significant impact on the distribution of crops and farming systems. Although the mechanisms influencing the interactive temperature and precipitation changes on crops were complex and hard to distinguish, the fact remained that these changes would directly cause corresponding changes in crop characteristics.
基金supported by the National Natural Science Foundatiou of China under Grant Nos. 70771118 and 70371030the Scientific Research Foundation for the Returned Overseas Chinese Scholars, State Education Ministry under Grant No. 2006.331
文摘A differential game (DG) model for a developing and a developed country is considered.Each player makes decisions about how much resource to be used to restrict the opponent's developmentso as to maximize his weighted sum of current consumption and final output.Current consumption isassumed to be preferred to final output for both players.The developing country is assumed to havea higher economic growth rate and a higher preference to final output,whereas the developed countryis assumed to have a higher initial income and a higher efficiency in restricting his opponent.Thisproblem is investigated under three kinds of information structures,i.e.,a zerosum,a nonzero-sum,anda Stackelberg game.Open-loop equilibrium solutions are obtained for all the three cases.Economicimplications of the result are provided.