The Data Platform of Resource and Environment—whose data mainly come from field observation stations,spatial observations,and internet service institutions—is the base of data analysis and model simulation in geosci...The Data Platform of Resource and Environment—whose data mainly come from field observation stations,spatial observations,and internet service institutions—is the base of data analysis and model simulation in geoscience research in China.Among this integrated data platform,the tasks of the data platform of field observation stations are principally data collection,management,assimilation,and share service.Taking into consideration the distributing characteristics of the data sources and the service objects,the authors formulated the framework of the field observation stations' data platform based on the grid technology and designed its operating processes.The authors have further defined and analyzed the key functions and implementing techniques for each module.In a Linux operating system,validation tests for the data platform's function on data replication,data synchronization,and unified data service have been conducted under an environment that of the simulating field stations.展开更多
The calculation results of the rolling force and torque model based on Orowan's differential equation numerical solution method do not fit with the industrial measurements very well.In particular,a quite large dev...The calculation results of the rolling force and torque model based on Orowan's differential equation numerical solution method do not fit with the industrial measurements very well.In particular,a quite large deviation on the torque model was found.On the basis of analyzing the shortcomings of the existing method,an improved rolling force and torque model algorithm aided by the Process Integrated Data Application System platform is proposed.Accordingly,the calculation accuracy of the rolling torque model is improved.The improved models are verified by 1711136 records of a data platform.The improved models are also based on Orowan's differential equation.Two coefficients,namely,friction factor and forward slip,are recognized as the crucial factors to be determined from industrial measurements to improve the accuracy.Therefore,the proposed method is a hybrid method that can be used to deeply understand the rolling process and improve the model's accuracy by combining traditional plastic mechanics and data-driving global optimization algorithms.This paper proposes a new approach to studying theoretical rolling deformation models powered by the industrial data platform.展开更多
Objective To introduce the relevant big data platforms of FDA regulatory sciences and to provide reference for the construction of big data platform for China’s regulatory science under the“14th five-year plan”to d...Objective To introduce the relevant big data platforms of FDA regulatory sciences and to provide reference for the construction of big data platform for China’s regulatory science under the“14th five-year plan”to deepen the reform of medical and health system.Methods A comparative analysis was made on China’s big data for regulatory science after studying the development process,operation mode,practical significance and characteristics of the big data platform for FDA regulatory science,which would help China to establish a perfect big database.Results and Conclusion The construction of big data platform for China’s regulatory science is not comprehensive compared with that in the United States.It is necessary to build data platforms in line with China’s national conditions through efforts in law,talents,standards,and other aspects.展开更多
Northeast Asia is a key area for Earth system studies, global change frontier science research and regional sustainable development research. It has a complex ecological environment, a variety of climatic zones and ty...Northeast Asia is a key area for Earth system studies, global change frontier science research and regional sustainable development research. It has a complex ecological environment, a variety of climatic zones and typical human-Earth relationships. This paper outlines a data resources integration system fulfilling the data accumulation and management requirements of the Northeast Asia Resources and Environment Scientific Expedition. The data resources integration system has three subsystems:(i) data resources collection and management standards and specifications system, (ii) data classification system and (iii) a data management and publication software platform. The data resources collection and management standard and specification system has 23 specifications,divided into three types. They are: (i) data collection and processing specification type,(ii) data analysis and archiving specification and (iii) data management and sharing specification. The data resources classification system has four classes, 25 sub classes and 128 data elements. The data management and publication software platform has five function models:(i) data catalogue search model, (ii) metadata management model, (iii) data publication and virtualization model, (iv) data view model and (v) data download model. Based on the designed data integration system a prototype system has been developed and is supported by computer and Web GIS technologies. So far 144 datasets have been integrated into this data system. As more data are accumulated and integrated, it will play an important role in future scientific expedition data application and analysis.展开更多
China Unicorn, the largest WCDMA 3G operator in China, meets the requirements of the historical Mobile Internet Explosion, or the surging of Mobile Internet Traffic from mobile terminals. According to the internal sta...China Unicorn, the largest WCDMA 3G operator in China, meets the requirements of the historical Mobile Internet Explosion, or the surging of Mobile Internet Traffic from mobile terminals. According to the internal statistics of China Unicom, mobile user traffic has increased rapidly with a Compound Annual Growth Rate (CAGR) of 135%. Currently China Unicorn monthly stores more than 2 trillion records, data volume is over 525 TB, and the highest data volume has reached a peak of 5 PB. Since October 2009, China Unicom has been developing a home-brewed big data storage and analysis platform based on the open source Hadoop Distributed File System (HDFS) as it has a long-term strategy to make full use of this Big Data. All Mobile Internet Traffic is well served using this big data platform. Currently, the writing speed has reached 1 390 000 records per second, and the record retrieval time in the table that contains trillions of records is less than 100 ms. To take advantage of this opportunity to be a Big Data Operator, China Unicom has developed new functions and has multiple innovations to solve space and time constraint challenges presented in data processing. In this paper, we will introduce our big data platform in detail. Based on this big data platform, China Unicom is building an industry ecosystem based on Mobile Internet Big Data, and considers that a telecom operator centric ecosystem can be formed that is critical to reach prosperity in the modern communications business.展开更多
The continuing expansion of connected and electro-mobility products and services has led to their ability to rapidly generate very large amounts of data,leading to a demand for effective data management solutions.This...The continuing expansion of connected and electro-mobility products and services has led to their ability to rapidly generate very large amounts of data,leading to a demand for effective data management solutions.This is further catalysed through the need for society to make informed policies and decisions that can properly support their emerging growth.While data systems and platforms exist,they are often proprietary,being only compatible to the products that they are designed for.Given the products and services generate energy and spatial-temporal data that can often correlate,a lack of interoperability between these systems would impede decision making,as data from each system must be considered independently.By studying currently available data platforms and frameworks,this paper weighs the problems that these products address,and identifies necessary gaps for a more cohesive platform to exist.This is performed through a top-down approach,whereby broader vehicle-toeverything approaches are first studied,before moving to the components that could comprise a data platform to integrate and ingest these various data feeds.Finally,potential design considerations for a data platform is presented,along with examples of application bene.展开更多
This paper makes astudy on the interactive digital gener-alization, where map generalizationcan be divided into intellective reason-ing procedure and operational proce-dure, which are done by human andcomputer, respec...This paper makes astudy on the interactive digital gener-alization, where map generalizationcan be divided into intellective reason-ing procedure and operational proce-dure, which are done by human andcomputer, respectively. And an inter-active map generalization environmentfor large scale topographic map is thendesigned and realized. This researchfocuses on: ① the significance of re-searching an interactive map generali-zation environment, ② the features oflarge scale topographic map and inter-active map generalization, ③ the con-struction of map generalization-orien-ted database platform.展开更多
To solve the problems in the quality control and improvement of coiled tubing steel strips production, such as scattered and inefficient production data, difficult performance fluctuation factor analysis, complex mult...To solve the problems in the quality control and improvement of coiled tubing steel strips production, such as scattered and inefficient production data, difficult performance fluctuation factor analysis, complex multivariate statistical analysis, and low accuracy and difficulty in mechanical property prediction, an industrial data analysis platform for coiled tubing steel strips production has been preliminarily developed.As the premise and foundation of analysis, industrial data collection, storage, and utilization are realized by using multiple big data technologies.With Django as the agile development framework, data visualization and comprehensive analyses are achieved.The platform has functions including overview survey, stability analysis, comprehensive analysis(such as exploratory data analysis, correlation analysis, and multivariate statistics),precise steel strength prediction, and skin-passing process recommendation.The platform is helpful for production overviewing and prompt responding, laying a foundation for an in-depth understanding of product characteristics and improving product performance stability.展开更多
Remote data monitoring system which adopts virtual instrument usually applies data sharing, acquisition and remote transmission technology via internet. It is able to finish concurrent data acquisition and processing ...Remote data monitoring system which adopts virtual instrument usually applies data sharing, acquisition and remote transmission technology via internet. It is able to finish concurrent data acquisition and processing for multi-user and multi-task and also build a personalized virtual testing environment for more people but with fewer instruments. In this paper, we' 11 elaborate on the design and implementation of information sharing platform through a typical example of how to build multi-user concurrent virtual testing environment based on the virtnal software LabVIEW.展开更多
"Data Structure and Algorithm",which is an important major subject in computer science,has a lot of problems in teaching activity.This paper introduces and analyzes the situation and problems in this course ..."Data Structure and Algorithm",which is an important major subject in computer science,has a lot of problems in teaching activity.This paper introduces and analyzes the situation and problems in this course study.A "programming factory" method is then brought out which is indeed a practice-oriented platform of the teachingstudy process.Good results are obtained by this creative method.展开更多
In this paper,a variety of classical convolutional neural networks are trained on two different datasets using transfer learning method.We demonstrated that the training dataset has a significant impact on the trainin...In this paper,a variety of classical convolutional neural networks are trained on two different datasets using transfer learning method.We demonstrated that the training dataset has a significant impact on the training results,in addition to the optimization achieved through the model structure.However,the lack of open-source agricultural data,combined with the absence of a comprehensive open-source data sharing platform,remains a substantial obstacle.This issue is closely related to the difficulty and high cost of obtaining high-quality agricultural data,the low level of education of most employees,underdeveloped distributed training systems and unsecured data security.To address these challenges,this paper proposes a novel idea of constructing an agricultural data sharing platform based on a federated learning(FL)framework,aiming to overcome the deficiency of high-quality data in agricultural field training.展开更多
Various code development platforms, such as the ATHENA Framework [1] of the ATLAS [2] experiment encounter lengthy compilation/linking times. To augment this situation, the IRIS Development Platform was built as a sof...Various code development platforms, such as the ATHENA Framework [1] of the ATLAS [2] experiment encounter lengthy compilation/linking times. To augment this situation, the IRIS Development Platform was built as a software development framework acting as compiler, cross-project linker and data fetcher, which allow hot-swaps in order to compare various versions of software under test. The flexibility fostered by IRIS allowed modular exchange of software libraries among developers, making it a powerful development tool. The IRIS platform used input data ROOT-ntuples [3];however a new data model is sought, in line with the facilities offered by IRIS. The schematic of a possible new data structuring—as a user implemented object oriented data base, is presented.展开更多
Nowadays, we experience an abundance of Internet of Things middleware solutions that make the sensors and the actuators are able to connect to the Internet. These solutions, referred to as platforms to gain a widespre...Nowadays, we experience an abundance of Internet of Things middleware solutions that make the sensors and the actuators are able to connect to the Internet. These solutions, referred to as platforms to gain a widespread adoption, have to meet the expectations of different players in the IoT ecosystem, including devices [1]. Low cost devices are easily able to connect wirelessly to the Internet, from handhelds to coffee machines, also known as Internet of Things (IoT). This research describes the methodology and the development process of creating an IoT platform. This paper also presents the architecture and implementation for the IoT platform. The goal of this research is to develop an analytics engine which can gather sensor data from different devices and provide the ability to gain meaningful information from IoT data and act on it using machine learning algorithms. The proposed system is introducing the use of a messaging system to improve the overall system performance as well as provide easy scalability.展开更多
In view of the problems such as frequent fluctuation of garlic price, lack ofefficient forecasting means and difficulty in realizing the steady development of garlicindustry, combined with the current situation of gar...In view of the problems such as frequent fluctuation of garlic price, lack ofefficient forecasting means and difficulty in realizing the steady development of garlicindustry, combined with the current situation of garlic industry and the collected datainformation. Taking Big Data platform of garlic industry chain as the core, using themethods of correlation analysis, smoothness test, co-integration test, and Grangercausality test, this paper analyzes the correlation, dynamic, and causality between garlicprice and young garlic shoot price. According to the current situation of garlic industry,the garlic industry service based on Big Data is put forward. It is concluded that there is apositive correlation between garlic price and young garlic shoot price, and there is a longtermstable dynamic equilibrium relationship between young garlic shoot price and garlicprice fluctuation, and young garlic shoot price can affect garlic price. Finally, it isproposed to strengthen the infrastructure construction of garlic Big Data, increase thetechnological innovation and application of garlic Big Data technology, and promote thesafety and security ability of the whole industry to promote the development of garlicindustry.展开更多
基金supported by the Incubation Foundation for Special Disciplines of National Science Foundation of China (NSFC) (grant number: J0630966)Chinese Research Network on Special Environment and Disaster (CRENSED) of Ministry of Science and Technology of the People’s Republic of China (grant number:1Z2005DKA10600)the Knowledge Innovation Important Program of Chinese Academy of Sciences (Grant Number:NF105-SDB-1-21)
文摘The Data Platform of Resource and Environment—whose data mainly come from field observation stations,spatial observations,and internet service institutions—is the base of data analysis and model simulation in geoscience research in China.Among this integrated data platform,the tasks of the data platform of field observation stations are principally data collection,management,assimilation,and share service.Taking into consideration the distributing characteristics of the data sources and the service objects,the authors formulated the framework of the field observation stations' data platform based on the grid technology and designed its operating processes.The authors have further defined and analyzed the key functions and implementing techniques for each module.In a Linux operating system,validation tests for the data platform's function on data replication,data synchronization,and unified data service have been conducted under an environment that of the simulating field stations.
文摘The calculation results of the rolling force and torque model based on Orowan's differential equation numerical solution method do not fit with the industrial measurements very well.In particular,a quite large deviation on the torque model was found.On the basis of analyzing the shortcomings of the existing method,an improved rolling force and torque model algorithm aided by the Process Integrated Data Application System platform is proposed.Accordingly,the calculation accuracy of the rolling torque model is improved.The improved models are verified by 1711136 records of a data platform.The improved models are also based on Orowan's differential equation.Two coefficients,namely,friction factor and forward slip,are recognized as the crucial factors to be determined from industrial measurements to improve the accuracy.Therefore,the proposed method is a hybrid method that can be used to deeply understand the rolling process and improve the model's accuracy by combining traditional plastic mechanics and data-driving global optimization algorithms.This paper proposes a new approach to studying theoretical rolling deformation models powered by the industrial data platform.
文摘Objective To introduce the relevant big data platforms of FDA regulatory sciences and to provide reference for the construction of big data platform for China’s regulatory science under the“14th five-year plan”to deepen the reform of medical and health system.Methods A comparative analysis was made on China’s big data for regulatory science after studying the development process,operation mode,practical significance and characteristics of the big data platform for FDA regulatory science,which would help China to establish a perfect big database.Results and Conclusion The construction of big data platform for China’s regulatory science is not comprehensive compared with that in the United States.It is necessary to build data platforms in line with China’s national conditions through efforts in law,talents,standards,and other aspects.
基金National Scientific & Technology Basic Work Program of China(2007FY110300,2011FY110400)National Nature Science Foundation of China(40801180)
文摘Northeast Asia is a key area for Earth system studies, global change frontier science research and regional sustainable development research. It has a complex ecological environment, a variety of climatic zones and typical human-Earth relationships. This paper outlines a data resources integration system fulfilling the data accumulation and management requirements of the Northeast Asia Resources and Environment Scientific Expedition. The data resources integration system has three subsystems:(i) data resources collection and management standards and specifications system, (ii) data classification system and (iii) a data management and publication software platform. The data resources collection and management standard and specification system has 23 specifications,divided into three types. They are: (i) data collection and processing specification type,(ii) data analysis and archiving specification and (iii) data management and sharing specification. The data resources classification system has four classes, 25 sub classes and 128 data elements. The data management and publication software platform has five function models:(i) data catalogue search model, (ii) metadata management model, (iii) data publication and virtualization model, (iv) data view model and (v) data download model. Based on the designed data integration system a prototype system has been developed and is supported by computer and Web GIS technologies. So far 144 datasets have been integrated into this data system. As more data are accumulated and integrated, it will play an important role in future scientific expedition data application and analysis.
基金supported in part by the National Key Basic Research and Development(973)Program of China(Nos.2013CB228206 and 2012CB315801)the National Natural Science Foundation of China(Nos.61233016 and 61140320)supported by the Intel Research Council under the title of"Security Vulnerability Analysis Based on Cloud Platform with Intel IA Architecture"
文摘China Unicorn, the largest WCDMA 3G operator in China, meets the requirements of the historical Mobile Internet Explosion, or the surging of Mobile Internet Traffic from mobile terminals. According to the internal statistics of China Unicom, mobile user traffic has increased rapidly with a Compound Annual Growth Rate (CAGR) of 135%. Currently China Unicorn monthly stores more than 2 trillion records, data volume is over 525 TB, and the highest data volume has reached a peak of 5 PB. Since October 2009, China Unicom has been developing a home-brewed big data storage and analysis platform based on the open source Hadoop Distributed File System (HDFS) as it has a long-term strategy to make full use of this Big Data. All Mobile Internet Traffic is well served using this big data platform. Currently, the writing speed has reached 1 390 000 records per second, and the record retrieval time in the table that contains trillions of records is less than 100 ms. To take advantage of this opportunity to be a Big Data Operator, China Unicom has developed new functions and has multiple innovations to solve space and time constraint challenges presented in data processing. In this paper, we will introduce our big data platform in detail. Based on this big data platform, China Unicom is building an industry ecosystem based on Mobile Internet Big Data, and considers that a telecom operator centric ecosystem can be formed that is critical to reach prosperity in the modern communications business.
文摘The continuing expansion of connected and electro-mobility products and services has led to their ability to rapidly generate very large amounts of data,leading to a demand for effective data management solutions.This is further catalysed through the need for society to make informed policies and decisions that can properly support their emerging growth.While data systems and platforms exist,they are often proprietary,being only compatible to the products that they are designed for.Given the products and services generate energy and spatial-temporal data that can often correlate,a lack of interoperability between these systems would impede decision making,as data from each system must be considered independently.By studying currently available data platforms and frameworks,this paper weighs the problems that these products address,and identifies necessary gaps for a more cohesive platform to exist.This is performed through a top-down approach,whereby broader vehicle-toeverything approaches are first studied,before moving to the components that could comprise a data platform to integrate and ingest these various data feeds.Finally,potential design considerations for a data platform is presented,along with examples of application bene.
文摘This paper makes astudy on the interactive digital gener-alization, where map generalizationcan be divided into intellective reason-ing procedure and operational proce-dure, which are done by human andcomputer, respectively. And an inter-active map generalization environmentfor large scale topographic map is thendesigned and realized. This researchfocuses on: ① the significance of re-searching an interactive map generali-zation environment, ② the features oflarge scale topographic map and inter-active map generalization, ③ the con-struction of map generalization-orien-ted database platform.
文摘To solve the problems in the quality control and improvement of coiled tubing steel strips production, such as scattered and inefficient production data, difficult performance fluctuation factor analysis, complex multivariate statistical analysis, and low accuracy and difficulty in mechanical property prediction, an industrial data analysis platform for coiled tubing steel strips production has been preliminarily developed.As the premise and foundation of analysis, industrial data collection, storage, and utilization are realized by using multiple big data technologies.With Django as the agile development framework, data visualization and comprehensive analyses are achieved.The platform has functions including overview survey, stability analysis, comprehensive analysis(such as exploratory data analysis, correlation analysis, and multivariate statistics),precise steel strength prediction, and skin-passing process recommendation.The platform is helpful for production overviewing and prompt responding, laying a foundation for an in-depth understanding of product characteristics and improving product performance stability.
文摘Remote data monitoring system which adopts virtual instrument usually applies data sharing, acquisition and remote transmission technology via internet. It is able to finish concurrent data acquisition and processing for multi-user and multi-task and also build a personalized virtual testing environment for more people but with fewer instruments. In this paper, we' 11 elaborate on the design and implementation of information sharing platform through a typical example of how to build multi-user concurrent virtual testing environment based on the virtnal software LabVIEW.
基金supported by NSF B55101680,NTIF B2090571,B2110140,SCUT x2rjD2116860,Y1080170,Y1090160,Y1100030,Y1100050,Y1110020 and S1010561121,G101056137
文摘"Data Structure and Algorithm",which is an important major subject in computer science,has a lot of problems in teaching activity.This paper introduces and analyzes the situation and problems in this course study.A "programming factory" method is then brought out which is indeed a practice-oriented platform of the teachingstudy process.Good results are obtained by this creative method.
基金National Key Research and Development Program of China(2021ZD0113704).
文摘In this paper,a variety of classical convolutional neural networks are trained on two different datasets using transfer learning method.We demonstrated that the training dataset has a significant impact on the training results,in addition to the optimization achieved through the model structure.However,the lack of open-source agricultural data,combined with the absence of a comprehensive open-source data sharing platform,remains a substantial obstacle.This issue is closely related to the difficulty and high cost of obtaining high-quality agricultural data,the low level of education of most employees,underdeveloped distributed training systems and unsecured data security.To address these challenges,this paper proposes a novel idea of constructing an agricultural data sharing platform based on a federated learning(FL)framework,aiming to overcome the deficiency of high-quality data in agricultural field training.
文摘Various code development platforms, such as the ATHENA Framework [1] of the ATLAS [2] experiment encounter lengthy compilation/linking times. To augment this situation, the IRIS Development Platform was built as a software development framework acting as compiler, cross-project linker and data fetcher, which allow hot-swaps in order to compare various versions of software under test. The flexibility fostered by IRIS allowed modular exchange of software libraries among developers, making it a powerful development tool. The IRIS platform used input data ROOT-ntuples [3];however a new data model is sought, in line with the facilities offered by IRIS. The schematic of a possible new data structuring—as a user implemented object oriented data base, is presented.
文摘Nowadays, we experience an abundance of Internet of Things middleware solutions that make the sensors and the actuators are able to connect to the Internet. These solutions, referred to as platforms to gain a widespread adoption, have to meet the expectations of different players in the IoT ecosystem, including devices [1]. Low cost devices are easily able to connect wirelessly to the Internet, from handhelds to coffee machines, also known as Internet of Things (IoT). This research describes the methodology and the development process of creating an IoT platform. This paper also presents the architecture and implementation for the IoT platform. The goal of this research is to develop an analytics engine which can gather sensor data from different devices and provide the ability to gain meaningful information from IoT data and act on it using machine learning algorithms. The proposed system is introducing the use of a messaging system to improve the overall system performance as well as provide easy scalability.
文摘In view of the problems such as frequent fluctuation of garlic price, lack ofefficient forecasting means and difficulty in realizing the steady development of garlicindustry, combined with the current situation of garlic industry and the collected datainformation. Taking Big Data platform of garlic industry chain as the core, using themethods of correlation analysis, smoothness test, co-integration test, and Grangercausality test, this paper analyzes the correlation, dynamic, and causality between garlicprice and young garlic shoot price. According to the current situation of garlic industry,the garlic industry service based on Big Data is put forward. It is concluded that there is apositive correlation between garlic price and young garlic shoot price, and there is a longtermstable dynamic equilibrium relationship between young garlic shoot price and garlicprice fluctuation, and young garlic shoot price can affect garlic price. Finally, it isproposed to strengthen the infrastructure construction of garlic Big Data, increase thetechnological innovation and application of garlic Big Data technology, and promote thesafety and security ability of the whole industry to promote the development of garlicindustry.