Almost all the cellular processes in a living system are controlled by proteins:They regulate gene expression,catalyze chemical reactions,transport small molecules across membranes,and transmit signal across membranes...Almost all the cellular processes in a living system are controlled by proteins:They regulate gene expression,catalyze chemical reactions,transport small molecules across membranes,and transmit signal across membranes.Even,a viral infection is often initiated through virus-host protein interactions.Protein-protein interactions(PPIs)are the physical contacts between two or more proteins and they represent complex biological functions.Nowadays,PPIs have been used to construct PPI networks to study complex pathways for revealing the functions of unknown proteins.Scientists have used PPIs to find the molecular basis of certain diseases and also some potential drug targets.In this review,we will discuss how PPI networks are essential to understand the molecular basis of virus-host relationships and several databases which are dedicated to virus-host interaction studies.Here,we present a short but comprehensive review on PPIs,including the experimental and computational methods of finding PPIs,the databases dedicated to virus-host PPIs,and the associated various applications in protein interaction networks of some lethal viruses with their hosts.展开更多
Considering that there is no single full reference image quality assessment method that could give the best performance in all situations, some multi-method fusion metrics were proposed. Machine learning techniques ar...Considering that there is no single full reference image quality assessment method that could give the best performance in all situations, some multi-method fusion metrics were proposed. Machine learning techniques are often involved in such multi-method fusion metrics so that its output would be more consistent with human visual perceptions. On the other hand, the robustness and generalization ability of these multi-method fusion metrics are questioned because of the scarce of images with mean opinion scores. In order to comprehensively validate whether or not the generalization ability of such multi-method fusion IQA metrics are satisfying, we construct a new image database which contains up to 60 reference images. The newly built image database is then used to test the generalization ability of different multi-method fusion IQA metrics. Cross database validation experiment indicates that in our new image database, the performances of all the multi-method fusion IQA metrics have no statistical significant different with some single-method IQA metrics such as FSIM and MAD. In the end, a thorough analysis is given to explain why the performance of multi-method fusion IQA framework drop significantly in cross database validation.展开更多
The database research method is a method that analyses, generalizes and deduces from the data of subject investigated with database techniques, quantitative statistics and mathematical models. As the big data age come...The database research method is a method that analyses, generalizes and deduces from the data of subject investigated with database techniques, quantitative statistics and mathematical models. As the big data age comes with the data explosion in modem society, the International Chinese Language Teaching (ICLT) shows signs of sizable data accumulation, remarkable economic property, strong modeling requirements and notable cross-research trends, which thus make this method necessary as a new and independent research method in the researches on this area. Theory bases, applicative areas, available software and data resources, research program designs, as well as their advantages and disadvantages will be figured out in this paper. In the near future, it will bring about a revolution to the international Chinese language teaching.展开更多
The Mg-Al-Zn-Y-Ce system is one of the key systems for designing high-strength Mg alloys. The purpose of the present article is to develop a thermodynamic database for the Mg-Al-Zn-Y-Ce multicomponent system to design...The Mg-Al-Zn-Y-Ce system is one of the key systems for designing high-strength Mg alloys. The purpose of the present article is to develop a thermodynamic database for the Mg-Al-Zn-Y-Ce multicomponent system to design Mg alloys using the calculation of phase diagrams (CALPHAD) method, where the Gibbs energies of solution phases such as liquid, fcc, bcc, and hcp phases were described by the subregular solution model, whereas those of all the compounds were described by the sublattice model. The thermodynamic parameters describing Gibbs energies of the different phases in this database were evaluated by fitting the experimental data for phase equilibria and thermodynamic properties. On the basis of this database, a lot of information concerning stable and metastable phase equilibria of isothermal and vertical sections, molar fractions of constituent phases, the liquidus projection, etc., can be predicted. This database is expected to play an important role in the design of Mg alloys.展开更多
We developed a GPU based single-pulse search pipeline(GSP)with a candidate-archiving database.Largely based upon the infrastructure of the open source PulsaR Exploration and Search Toolkit(PRESTO),GSP implements GPU a...We developed a GPU based single-pulse search pipeline(GSP)with a candidate-archiving database.Largely based upon the infrastructure of the open source PulsaR Exploration and Search Toolkit(PRESTO),GSP implements GPU acceleration of the de-dispersion and integrates a candidate-archiving database.We applied GSP to the data streams from the Commensal Radio Astronomy FAST Survey(CRAFTS),which resulted in quasi-real-time processing.The integrated candidate database facilitates synergistic usage of multiple machine-learning tools and thus improves efficient identification of radio pulsars such as rotating radio transients(RRATs)and fast radio bursts(FRBs).We first tested GSP on pilot CRAFTS observations with the FAST Ultra-Wide Band(UWB)receiver.GSP detected all pulsars known from the the Parkes multibeam pulsar survey in the corresponding sky area covered by the FAST-UWB.GSP also discovered 13 new pulsars.We measured the computational efficiency of GSP to be~120 times faster than the original PRESTO and~60 times faster than an MPI-parallelized version of PRESTO.展开更多
The searching method of spatial information on traditional geo-archives catalog database(TGCD) is based on the text,and the result of retrieval can be only from the text of fields of relational database.The informatio...The searching method of spatial information on traditional geo-archives catalog database(TGCD) is based on the text,and the result of retrieval can be only from the text of fields of relational database.The information queried must be input into the relational database as a text form in advance,otherwise,the visitors would not get any result from it.So。展开更多
With the development of electric power industry,the requirement of information sharing and application integration between each application system is salience.To realize the real "sharing information,data mainten...With the development of electric power industry,the requirement of information sharing and application integration between each application system is salience.To realize the real "sharing information,data maintenance uniform",and effectively eliminate "island of information",a standard,open information model of power system should be followed urgently by different systems,and a common data interface should be provided.The Common Information Model(CIM) proposed by standard of IEC-61970 solve the problem effectively.The characteristics of the CIM Model and relational database of power system are analyzed,a mapping method between CIM model based on standard of IEC61970 and relational database is proposed,and corresponding problems between object-oriented model and the relational model are solved flexiblely.展开更多
A large database is desired for machine learning(ML) technology to make accurate predictions of materials physicochemical properties based on their molecular structure.When a large database is not available,the develo...A large database is desired for machine learning(ML) technology to make accurate predictions of materials physicochemical properties based on their molecular structure.When a large database is not available,the development of proper featurization method based on physicochemical nature of target proprieties can improve the predictive power of ML models with a smaller database.In this work,we show that two new featurization methods,volume occupation spatial matrix and heat contribution spatial matrix,can improve the accuracy in predicting energetic materials' crystal density(ρ_(crystal)) and solid phase enthalpy of formation(H_(f,solid)) using a database containing 451 energetic molecules.Their mean absolute errors are reduced from 0.048 g/cm~3 and 24.67 kcal/mol to 0.035 g/cm~3 and 9.66 kcal/mol,respectively.By leave-one-out-cross-validation,the newly developed ML models can be used to determine the performance of most kinds of energetic materials except cubanes.Our ML models are applied to predict ρ_(crystal) and H_(f,solid) of CHON-based molecules of the 150 million sized PubChem database,and screened out 56 candidates with competitive detonation performance and reasonable chemical structures.With further improvement in future,spatial matrices have the potential of becoming multifunctional ML simulation tools that could provide even better predictions in wider fields of materials science.展开更多
The spherical indentation test has been successfully applied to inversely derive the tensile properties of small regions in a non-destructive way.Current inverse methods mainly rely on extensive iterative calculations...The spherical indentation test has been successfully applied to inversely derive the tensile properties of small regions in a non-destructive way.Current inverse methods mainly rely on extensive iterative calculations,which yield a considerable computational costs.In this paper,a database method is proposed to determine tensile flow properties from a single indentation force-depth curves to avoid iterative simulations.Firstly,a database that contain numerous indentation force-depth curves is established by inputting varied Ludwic material parameters into the indentation finite elements model.Secondly,for a given experimental indentation curve,a mean square error(MSE)is designated to evaluate the deviation between the experimental curve and each curve in the database.Finally,the true stresses at a series of plastic strain can be acquired by analyzing these deviations.To validate this new method,three different steels,i.e.A508,2.25Cr1 Mo and 316L are selected.Both simulated indentation curves and experimental indentation curves are used as inputs of the database to inversely acquire the flow properties.The result indicates that the pro-posed approach provides impressive accuracy when simulated indentation curves are used,but is less accurate when experimental curves are used.This new method can derive tensile properties in a much higher efficiency compared with traditional inverse method and are therefore more adaptive to engineering application.展开更多
The Xinjiang Astronomical Observatory Data Center faces issues related to delay-affected services. As a result, these services cannot be implemented in a timely manner due to the overloading of transmission links. In ...The Xinjiang Astronomical Observatory Data Center faces issues related to delay-affected services. As a result, these services cannot be implemented in a timely manner due to the overloading of transmission links. In this paper, the software-defined network technology is applied to the Xinjiang Astronomical Observatory Data Center Network(XAODCN). Specifically, a novel reconfiguration method is proposed to realise the software-defined Xinjiang Astronomical Observatory Data Center Network(SDXAO-DCN), and a network model is constructed. To overcome the congestion problem, a traffic load-balancing algorithm is designed for fast transmission of the service traffic by combining three factors: network structure, congestion level and transmission service. The proposed algorithm is compared with current commonly load-balancing algorithms which are used in data center to verify its efficiency. Simulation experiments show that the algorithm improved transmission performance and transmission quality for the SDXAO-DCN.展开更多
This paper deals with the efficiency of the search, with a method of organization and storage of the information that allows better results than the research trees or binary trees. No one ever dared to present better ...This paper deals with the efficiency of the search, with a method of organization and storage of the information that allows better results than the research trees or binary trees. No one ever dared to present better results than 0(log n) complexity, and when they wish to improve, they use balanced trees, but they continue to use principles that do not impact the pre-semantic information treatment. The Heru search method has as main characteristic the total or partial substitution of the use of the binary trees, enabling the elimination of the approximate results and informing the user the desired information instead of occurrences by sampling outside the desired information. The breakdown of the 0(log n) paradigm and the refinement of the searches are achieved with the use of a set of unpublished mathematical formulas and concepts called Infinite Series with Multiple Ratios.展开更多
The spatial prediction of the water table can be used for many applications related to civil works (foundations, excavations) and other urban and environmental management activities. Deterministic and geostatistical i...The spatial prediction of the water table can be used for many applications related to civil works (foundations, excavations) and other urban and environmental management activities. Deterministic and geostatistical interpolation methods were used to predict the spatial distribution of water table levels (unconfined aquifers) of important geological formations of the Joao Pessoa City (capital of Paraiba State, Brazil) with dense urban occupation and high demand for new civil works. The deterministic (topo to raster) and geostatistical (ordinary kriging) interpolation methods were evaluated using a Geographic Information System (GIS)-based investigation. The water table levels were obtained from 276 boring logs of Standard Penetration Test (SPT) in situ investigation distributed over the geological formations studied (an area of 59.8 km<sup>2</sup>, covering 40 districts of the Joao Pessoa City). The Nspt values and textural characterization data are stored for levels of 1 m depth. Some boreholes located in the area investigated were not included in the interpolation processes in order to be compared with estimated values (validation of the results). Maps of the water table depths were also produced to further analyze the quality of the water table surfaces interpolated by both methods. The phreatic surface interpolations provided satisfactory results for both methods (RMSE = 1.8 m). The topo to raster method showed a slight general tendency to be less affected by local values in relation to the kriging method and also has the advantage of integrating the drainage flow system, which is a relevant aspect for spatial models of the water table levels of unconfined aquifers. The ordinary kriging (geostatistical method) provides a prediction surface and some measure of the certainty or accuracy of the predictions.展开更多
基金National Natural Science Foundation of China,No.31971180 and No.11474013.
文摘Almost all the cellular processes in a living system are controlled by proteins:They regulate gene expression,catalyze chemical reactions,transport small molecules across membranes,and transmit signal across membranes.Even,a viral infection is often initiated through virus-host protein interactions.Protein-protein interactions(PPIs)are the physical contacts between two or more proteins and they represent complex biological functions.Nowadays,PPIs have been used to construct PPI networks to study complex pathways for revealing the functions of unknown proteins.Scientists have used PPIs to find the molecular basis of certain diseases and also some potential drug targets.In this review,we will discuss how PPI networks are essential to understand the molecular basis of virus-host relationships and several databases which are dedicated to virus-host interaction studies.Here,we present a short but comprehensive review on PPIs,including the experimental and computational methods of finding PPIs,the databases dedicated to virus-host PPIs,and the associated various applications in protein interaction networks of some lethal viruses with their hosts.
基金supported by “the Fundamental Research Funds for the Central Universities” No.2018CUCTJ081
文摘Considering that there is no single full reference image quality assessment method that could give the best performance in all situations, some multi-method fusion metrics were proposed. Machine learning techniques are often involved in such multi-method fusion metrics so that its output would be more consistent with human visual perceptions. On the other hand, the robustness and generalization ability of these multi-method fusion metrics are questioned because of the scarce of images with mean opinion scores. In order to comprehensively validate whether or not the generalization ability of such multi-method fusion IQA metrics are satisfying, we construct a new image database which contains up to 60 reference images. The newly built image database is then used to test the generalization ability of different multi-method fusion IQA metrics. Cross database validation experiment indicates that in our new image database, the performances of all the multi-method fusion IQA metrics have no statistical significant different with some single-method IQA metrics such as FSIM and MAD. In the end, a thorough analysis is given to explain why the performance of multi-method fusion IQA framework drop significantly in cross database validation.
文摘The database research method is a method that analyses, generalizes and deduces from the data of subject investigated with database techniques, quantitative statistics and mathematical models. As the big data age comes with the data explosion in modem society, the International Chinese Language Teaching (ICLT) shows signs of sizable data accumulation, remarkable economic property, strong modeling requirements and notable cross-research trends, which thus make this method necessary as a new and independent research method in the researches on this area. Theory bases, applicative areas, available software and data resources, research program designs, as well as their advantages and disadvantages will be figured out in this paper. In the near future, it will bring about a revolution to the international Chinese language teaching.
基金This study was financially supported by the National Natural Science Foundation of China and Chongqing Science and Technology Commission.
文摘The Mg-Al-Zn-Y-Ce system is one of the key systems for designing high-strength Mg alloys. The purpose of the present article is to develop a thermodynamic database for the Mg-Al-Zn-Y-Ce multicomponent system to design Mg alloys using the calculation of phase diagrams (CALPHAD) method, where the Gibbs energies of solution phases such as liquid, fcc, bcc, and hcp phases were described by the subregular solution model, whereas those of all the compounds were described by the sublattice model. The thermodynamic parameters describing Gibbs energies of the different phases in this database were evaluated by fitting the experimental data for phase equilibria and thermodynamic properties. On the basis of this database, a lot of information concerning stable and metastable phase equilibria of isothermal and vertical sections, molar fractions of constituent phases, the liquidus projection, etc., can be predicted. This database is expected to play an important role in the design of Mg alloys.
基金supported by the National Natural Science Foundation of China(NSFCGrant Nos.11988101,11725313,11690024,12041303,U1731238,U2031117,U1831131 and U1831207)+2 种基金supported by the Science and Technology Foundation of Guizhou Province(No.LKS[2010]38)support by the Youth Innovation Promotion Association CAS(id.2021055)cultivation project for FAST scientific payoff and research achievement of CAMS-CAS。
文摘We developed a GPU based single-pulse search pipeline(GSP)with a candidate-archiving database.Largely based upon the infrastructure of the open source PulsaR Exploration and Search Toolkit(PRESTO),GSP implements GPU acceleration of the de-dispersion and integrates a candidate-archiving database.We applied GSP to the data streams from the Commensal Radio Astronomy FAST Survey(CRAFTS),which resulted in quasi-real-time processing.The integrated candidate database facilitates synergistic usage of multiple machine-learning tools and thus improves efficient identification of radio pulsars such as rotating radio transients(RRATs)and fast radio bursts(FRBs).We first tested GSP on pilot CRAFTS observations with the FAST Ultra-Wide Band(UWB)receiver.GSP detected all pulsars known from the the Parkes multibeam pulsar survey in the corresponding sky area covered by the FAST-UWB.GSP also discovered 13 new pulsars.We measured the computational efficiency of GSP to be~120 times faster than the original PRESTO and~60 times faster than an MPI-parallelized version of PRESTO.
文摘The searching method of spatial information on traditional geo-archives catalog database(TGCD) is based on the text,and the result of retrieval can be only from the text of fields of relational database.The information queried must be input into the relational database as a text form in advance,otherwise,the visitors would not get any result from it.So。
文摘With the development of electric power industry,the requirement of information sharing and application integration between each application system is salience.To realize the real "sharing information,data maintenance uniform",and effectively eliminate "island of information",a standard,open information model of power system should be followed urgently by different systems,and a common data interface should be provided.The Common Information Model(CIM) proposed by standard of IEC-61970 solve the problem effectively.The characteristics of the CIM Model and relational database of power system are analyzed,a mapping method between CIM model based on standard of IEC61970 and relational database is proposed,and corresponding problems between object-oriented model and the relational model are solved flexiblely.
基金support from the Ministry of Education(MOE) Singapore Tier 1 (RG8/20)。
文摘A large database is desired for machine learning(ML) technology to make accurate predictions of materials physicochemical properties based on their molecular structure.When a large database is not available,the development of proper featurization method based on physicochemical nature of target proprieties can improve the predictive power of ML models with a smaller database.In this work,we show that two new featurization methods,volume occupation spatial matrix and heat contribution spatial matrix,can improve the accuracy in predicting energetic materials' crystal density(ρ_(crystal)) and solid phase enthalpy of formation(H_(f,solid)) using a database containing 451 energetic molecules.Their mean absolute errors are reduced from 0.048 g/cm~3 and 24.67 kcal/mol to 0.035 g/cm~3 and 9.66 kcal/mol,respectively.By leave-one-out-cross-validation,the newly developed ML models can be used to determine the performance of most kinds of energetic materials except cubanes.Our ML models are applied to predict ρ_(crystal) and H_(f,solid) of CHON-based molecules of the 150 million sized PubChem database,and screened out 56 candidates with competitive detonation performance and reasonable chemical structures.With further improvement in future,spatial matrices have the potential of becoming multifunctional ML simulation tools that could provide even better predictions in wider fields of materials science.
基金Supported by China Postdoctoral Science Foundation(Grant No.2019M661406).
文摘The spherical indentation test has been successfully applied to inversely derive the tensile properties of small regions in a non-destructive way.Current inverse methods mainly rely on extensive iterative calculations,which yield a considerable computational costs.In this paper,a database method is proposed to determine tensile flow properties from a single indentation force-depth curves to avoid iterative simulations.Firstly,a database that contain numerous indentation force-depth curves is established by inputting varied Ludwic material parameters into the indentation finite elements model.Secondly,for a given experimental indentation curve,a mean square error(MSE)is designated to evaluate the deviation between the experimental curve and each curve in the database.Finally,the true stresses at a series of plastic strain can be acquired by analyzing these deviations.To validate this new method,three different steels,i.e.A508,2.25Cr1 Mo and 316L are selected.Both simulated indentation curves and experimental indentation curves are used as inputs of the database to inversely acquire the flow properties.The result indicates that the pro-posed approach provides impressive accuracy when simulated indentation curves are used,but is less accurate when experimental curves are used.This new method can derive tensile properties in a much higher efficiency compared with traditional inverse method and are therefore more adaptive to engineering application.
基金supported by National Key R&D Program of China No.2021YFC2203502the National Natural Science Foundation of China (NSFC)(11803080,12173077,11873082,12003062)+2 种基金the Tianshan Innovation Team Plan of Xinjiang Uygur Autonomous Region (2022D14020)the Youth Innovation Promotion Association CASNational Key R&D Program of China No.2018 YFA0404704。
文摘The Xinjiang Astronomical Observatory Data Center faces issues related to delay-affected services. As a result, these services cannot be implemented in a timely manner due to the overloading of transmission links. In this paper, the software-defined network technology is applied to the Xinjiang Astronomical Observatory Data Center Network(XAODCN). Specifically, a novel reconfiguration method is proposed to realise the software-defined Xinjiang Astronomical Observatory Data Center Network(SDXAO-DCN), and a network model is constructed. To overcome the congestion problem, a traffic load-balancing algorithm is designed for fast transmission of the service traffic by combining three factors: network structure, congestion level and transmission service. The proposed algorithm is compared with current commonly load-balancing algorithms which are used in data center to verify its efficiency. Simulation experiments show that the algorithm improved transmission performance and transmission quality for the SDXAO-DCN.
文摘This paper deals with the efficiency of the search, with a method of organization and storage of the information that allows better results than the research trees or binary trees. No one ever dared to present better results than 0(log n) complexity, and when they wish to improve, they use balanced trees, but they continue to use principles that do not impact the pre-semantic information treatment. The Heru search method has as main characteristic the total or partial substitution of the use of the binary trees, enabling the elimination of the approximate results and informing the user the desired information instead of occurrences by sampling outside the desired information. The breakdown of the 0(log n) paradigm and the refinement of the searches are achieved with the use of a set of unpublished mathematical formulas and concepts called Infinite Series with Multiple Ratios.
文摘The spatial prediction of the water table can be used for many applications related to civil works (foundations, excavations) and other urban and environmental management activities. Deterministic and geostatistical interpolation methods were used to predict the spatial distribution of water table levels (unconfined aquifers) of important geological formations of the Joao Pessoa City (capital of Paraiba State, Brazil) with dense urban occupation and high demand for new civil works. The deterministic (topo to raster) and geostatistical (ordinary kriging) interpolation methods were evaluated using a Geographic Information System (GIS)-based investigation. The water table levels were obtained from 276 boring logs of Standard Penetration Test (SPT) in situ investigation distributed over the geological formations studied (an area of 59.8 km<sup>2</sup>, covering 40 districts of the Joao Pessoa City). The Nspt values and textural characterization data are stored for levels of 1 m depth. Some boreholes located in the area investigated were not included in the interpolation processes in order to be compared with estimated values (validation of the results). Maps of the water table depths were also produced to further analyze the quality of the water table surfaces interpolated by both methods. The phreatic surface interpolations provided satisfactory results for both methods (RMSE = 1.8 m). The topo to raster method showed a slight general tendency to be less affected by local values in relation to the kriging method and also has the advantage of integrating the drainage flow system, which is a relevant aspect for spatial models of the water table levels of unconfined aquifers. The ordinary kriging (geostatistical method) provides a prediction surface and some measure of the certainty or accuracy of the predictions.