An ocean state monitor and analysis radar(OSMAR), developed by Wuhan University in China, have been mounted at six stations along the coasts of East China Sea(ECS) to measure velocities(currents, waves and winds...An ocean state monitor and analysis radar(OSMAR), developed by Wuhan University in China, have been mounted at six stations along the coasts of East China Sea(ECS) to measure velocities(currents, waves and winds) at the sea surface. Radar-observed surface current is taken as an example to illustrate the operational high-frequency(HF) radar observing and data service platform(OP), presenting an operational flow from data observing, transmitting, processing, visualizing, to end-user service. Three layers(systems): radar observing system(ROS), data service system(DSS) and visualization service system(VSS), as well as the data flow within the platform are introduced. Surface velocities observed at stations are synthesized at the radar data receiving and preprocessing center of the ROS, and transmitted to the DSS, in which the data processing and quality control(QC) are conducted. Users are allowed to browse the processed data on the portal of the DSS, and access to those data files. The VSS aims to better show the data products by displaying the information on a visual globe. By utilizing the OP, the surface currents in East China Sea are monitored, and hourly and seasonal variabilities of them are investigated.展开更多
To extract structured data from a web page with customized requirements,a user labels some DOM elements on the page with attribute names.The common features of the labeled elements are utilized to guide the user throu...To extract structured data from a web page with customized requirements,a user labels some DOM elements on the page with attribute names.The common features of the labeled elements are utilized to guide the user through the labeling process to minimize user efforts,and are also utilized to retrieve attribute values.To turn the attribute values into a structured result,the attribute pattern needs to be induced.For this purpose,a space-optimized suffix tree called attribute tree is built to transform the document object model(DOM) tree into a simpler form while preserving its useful properties such as attribute sequence order.The pattern is induced bottom-up on the attribute tree,and is further used to build the structured result.Experiments are conducted and show high performance of our approach in terms of precision,recall and structural correctness.展开更多
For China’s telecom industry,2009 is destined to be an extraordinary year due to the approach of long-thirsted-for mobile 3G era,which will have significant impact on current work and lifestyles.2009 will also be a y...For China’s telecom industry,2009 is destined to be an extraordinary year due to the approach of long-thirsted-for mobile 3G era,which will have significant impact on current work and lifestyles.2009 will also be a year full of opportunities and challenges because the coming 3G era will bring limitless business opportunities and impose more challenges on Chinese telecom operators.The reshuffling of Chinese telecom markets has been brought to an end.The new China Unicom,China Mobile and China Telecom all focus their strategies on broadband mobile data services in order to achieve the objective of a smooth transforming from voice services to data services.Technologically,various 3G technologies and their evolutions become great concerns of telecom operators;while in terms of services,the key for 3G systems is their data services.As a result,high speed broadband data services see an era of rapid development.展开更多
This paper introduces the application of data mining technology in data service. Data mining is a new technology, but its application time is long, and the application effect is obvious. Data mining technology can be ...This paper introduces the application of data mining technology in data service. Data mining is a new technology, but its application time is long, and the application effect is obvious. Data mining technology can be used in enterprise customer service system, which can help enterprises to find potential customers, while retaining the most valuable customers. We pointed out the necessity of establishing data mining technology and service intelligence analysis system based on the combination of intelligence analysis and service characteristics and processes, the method of data mining, knowledge management ideas applied to intelligence analysis and service system.展开更多
In the era of the big data. the national strategies and the rapid development of computers and storage technologies bring opportunities and challenges to the library's data services. Based on the investigation litera...In the era of the big data. the national strategies and the rapid development of computers and storage technologies bring opportunities and challenges to the library's data services. Based on the investigation literature of the scientific data services in the university libraries in the United States, the development process of the scientific data is analyzed from three aspects of the service types, the service mode and the service contents. The author of this paper also proposes opportunities and challenges from 5 aspects of the policy support. strengthening the publicity, the self learning, the self positioning and relying on the embedded subject librarians, to promote the development of the library scientific data services.展开更多
As technology and the internet develop,more data are generated every day.These data are in large sizes,high dimensions,and complex structures.The combination of these three features is the“Big Data”[1].Big data is r...As technology and the internet develop,more data are generated every day.These data are in large sizes,high dimensions,and complex structures.The combination of these three features is the“Big Data”[1].Big data is revolutionizing all industries,bringing colossal impacts to them[2].Many researchers have pointed out the huge impact that big data can have on our daily lives[3].We can utilize the information we obtain and help us make decisions.Also,the conclusions we drew from the big data we analyzed can be used as a prediction for the future,helping us to make more accurate and benign decisions earlier than others.If we apply these technics in finance,for example,in stock,we can get detailed information for stocks.Moreover,we can use the analyzed data to predict certain stocks.This can help people decide whether to buy a stock or not by providing predicted data for people at a certain convincing level,helping to protect them from potential losses.展开更多
JCOMM has strategy to establish the network of WMO-IOC Centres for Marine-meteorological and Oceanographic Climate Data (CMOCs) under the new Marine Climate Data System (MCDS) in 2012 for improving the quality and...JCOMM has strategy to establish the network of WMO-IOC Centres for Marine-meteorological and Oceanographic Climate Data (CMOCs) under the new Marine Climate Data System (MCDS) in 2012 for improving the quality and timeliness of the marine-meteorological and oceanographic data, metadata and products available to end users. China as a candidate of CMOC China has been approved to run on a trial basis after the 4th Meeting of the Joint IOC/WMO Technical Commission for Oceanography and Marine Meteorology (JCOMM). This article states the developing intention of CMOC China in the next few years through the brief introduction to critical marine data, products and service system and cooperation projects in the world.展开更多
With the increase of different sensors,applications and customers,the demand from data providers and users is for a new geospatial data service model,which supports low cost,high dexterity,and which would provide a co...With the increase of different sensors,applications and customers,the demand from data providers and users is for a new geospatial data service model,which supports low cost,high dexterity,and which would provide a comprehensive service.Based on such requirements and demands,the 21AT TripleSat constellation terminal and data delivery and management system has been developed by a Beijing based high-tech enterprise,Twenty First Century Aerospace Technology Co.,Ltd.(21AT).The company is the first commercial Earth observation satellite operator and service provider in China.This new geospatial data service model allows the user to directly access multi-source satellite data,manage the data order,and carry out automatic massive data production and delivery.The solution also implements safe and hierarchical user management,statistical data analysis,and automatic information reports.In addition,a mobile application is also available for users to easily access system functions.This new geospatial solution has already been successfully applied and installed in many customer sites in China,and is now available globally for international clients interested in fast geospatial solutions.It enables the success of customers’operational services.Besides providing TripleSat Constellation images,the multi-source data access system also allows the users to access other satellite data sources,based on customized agreement.This paper describes and discusses this new geospatial data service model.展开更多
To study characteristics and market structure of data service and the role ofoperator, this paper makes a commercial model by applying the theory of intermediaries andneoclassic economics. Data service has different e...To study characteristics and market structure of data service and the role ofoperator, this paper makes a commercial model by applying the theory of intermediaries andneoclassic economics. Data service has different economic characteristics from voice service.Firstly, production mode of data service is roundabout production, secondly, driving power of dataservice is economies of specialization, and finally, management method of data service is impersonalmanagement. In data service market, information asymmetry and barrier to entry determinetransaction efficiency and the specialization level of service providers indirectly. Therefore,operator should intervene in the market by offering trade service in order to promote development ofservice providers. Because of different quality of service providers, market structure of dataservice must be the state that trade platform built by operator and intermediary platform built byoperator coexists.展开更多
Currently,ocean data portals are being developed around the world based on Geographic Information Systems(GIS) as a source of ocean data and information.However,given the relatively high temporal frequency and the int...Currently,ocean data portals are being developed around the world based on Geographic Information Systems(GIS) as a source of ocean data and information.However,given the relatively high temporal frequency and the intrinsic spatial nature of ocean data and information,no current GIS software is adequate to deal effectively and efficiently with spatiotemporal data.Furthermore,while existing ocean data portals are generally designed to meet the basic needs of a broad range of users,they are sometimes very complicated for general audiences,especially for those without training in GIS.In this paper,a new technical architecture for an ocean data integration and service system is put forward that consists of four layers:the operation layer,the extract,transform,and load(ETL) layer,the data warehouse layer,and the presentation layer.The integration technology based on the XML,ontology,and spatiotemporal data organization scheme for the data warehouse layer is then discussed.In addition,the ocean observing data service technology realized in the presentation layer is also discussed in detail,including the development of the web portal and ocean data sharing platform.The application on the Taiwan Strait shows that the technology studied in this paper can facilitate sharing,access,and use of ocean observation data.The paper is based on an ongoing research project for the development of an ocean observing information system for the Taiwan Strait that will facilitate the prevention of ocean disasters.展开更多
Massive ocean data acquired by various observing platforms and sensors poses new challenges to data management and utilization.Typically,it is difficult to find the desired data from the large amount of datasets effic...Massive ocean data acquired by various observing platforms and sensors poses new challenges to data management and utilization.Typically,it is difficult to find the desired data from the large amount of datasets efficiently and effectively.Most of existing methods for data discovery are based on the keyword retrieval or direct semantic reasoning,and they are either limited in data access rate or do not take the time cost into account.In this paper,we creatively design and implement a novel system to alleviate the problem by introducing semantics with ontologies,which is referred to as Data Ontology and List-Based Publishing(DOLP).Specifically,we mainly improve the ocean data services in the following three aspects.First,we propose a unified semantic model called OEDO(Ocean Environmental Data Ontology)to represent heterogeneous ocean data by metadata and to be published as data services.Second,we propose an optimized quick service query list(QSQL)data structure for storing the pre-inferred semantically related services,and reducing the service querying time.Third,we propose two algorithms for optimizing QSQL hierarchically and horizontally,respectively,which aim to extend the semantics relationships of the data service and improve the data access rate.Experimental results prove that DOLP outperforms the benchmark methods.First,our QSQL-based data discovery methods obtain a higher recall rate than the keyword-based method,and are faster than the traditional semantic method based on direct reasoning.Second,DOLP can handle more complex semantic relationships than the existing methods.展开更多
With the growing popularity of data-intensive services on the Internet, the traditional process-centric model for business process meets challenges due to the lack of abilities to describe data semantics and dependenc...With the growing popularity of data-intensive services on the Internet, the traditional process-centric model for business process meets challenges due to the lack of abilities to describe data semantics and dependencies, resulting in the inflexibility of the design and implement for the processes. This paper proposes a novel data-aware business process model which is able to describe both explicit control flow and implicit data flow. Data model with dependencies which are formulated by Linear-time Temporal Logic(LTL) is presented, and their satisfiability is validated by an automaton-based model checking algorithm. Data dependencies are fully considered in modeling phase, which helps to improve the efficiency and reliability of programming during developing phase. Finally, a prototype system based on j BPM for data-aware workflow is designed using such model, and has been deployed to Beijing Kingfore heating management system to validate the flexibility, efficacy and convenience of our approach for massive coding and large-scale system management in reality.展开更多
Fengyun meteorological satellites have undergone a series of significant developments over the past 50 years.Two generations,four types,and 21 Fengyun satellites have been developed and launched,with 9 currently opera...Fengyun meteorological satellites have undergone a series of significant developments over the past 50 years.Two generations,four types,and 21 Fengyun satellites have been developed and launched,with 9 currently operational in orbit.The data obtained from Fengyun satellites is employed in a multitude of applications,including weather forecasting,meteorological disaster prevention and reduction,climate change,global environmental monitoring,and space weather.These data products and services are made available to the global community,resulting in tangible social and economic benefits.In 2023,two Fengyun meteorological satellites were successfully launched.This report presents an overview of the two recently launched Fengyun satellites and currently in orbit Fengyun satellites,including an evaluation of their remote sensing instruments since 2022.Additionally,it addresses the subject of Fengyun satellite data archiving,data services,application services,international cooperation,and supporting activities.Furthermore,the development prospects have been outlined.展开更多
In existing web services-based workflow, data exchanging across the web services is centralized, the workflow engine intermediates at each step of the application sequence. However, many grid applications, especially ...In existing web services-based workflow, data exchanging across the web services is centralized, the workflow engine intermediates at each step of the application sequence. However, many grid applications, especially data intensive scientific applications, require exchanging large amount of data across the grid services. Having a central workflow engine relay the data between the services would resu'lts in a bottleneck in these cases. This paper proposes a data exchange model for individual grid workflow and multiworkflows composition respectively. The model enables direct communication for large amounts of data between two grid services. To enable data to exchange among multiple workflows, the bridge data service is used.展开更多
World Data Center(WDC)for Seismology,Beijing has developed for 20 years in China until this year.The sustained and stable data sharing service system has already taken shape.This article gives an overview of the const...World Data Center(WDC)for Seismology,Beijing has developed for 20 years in China until this year.The sustained and stable data sharing service system has already taken shape.This article gives an overview of the construction and development of WDC for Seismology,Beijing.It outlines the history,facilities and technical specifications of the center.It also illustrates the data service,the website,and gives a brief description of the perspective.展开更多
In wastewater treatment process(WWTP), the accurate and real-time monitoring values of key variables are crucial for the operational strategies. However, most of the existing methods have difficulty in obtaining the r...In wastewater treatment process(WWTP), the accurate and real-time monitoring values of key variables are crucial for the operational strategies. However, most of the existing methods have difficulty in obtaining the real-time values of some key variables in the process. In order to handle this issue, a data-driven intelligent monitoring system, using the soft sensor technique and data distribution service, is developed to monitor the concentrations of effluent total phosphorous(TP) and ammonia nitrogen(NH_4-N). In this intelligent monitoring system, a fuzzy neural network(FNN) is applied for designing the soft sensor model, and a principal component analysis(PCA) method is used to select the input variables of the soft sensor model. Moreover, data transfer software is exploited to insert the soft sensor technique to the supervisory control and data acquisition(SCADA) system. Finally, this proposed intelligent monitoring system is tested in several real plants to demonstrate the reliability and effectiveness of the monitoring performance.展开更多
Purpose: This paper relates the definition of data quality procedures for knowledge organizations such as Higher Education Institutions. The main purpose is to present the flexible approach developed for monitoring th...Purpose: This paper relates the definition of data quality procedures for knowledge organizations such as Higher Education Institutions. The main purpose is to present the flexible approach developed for monitoring the data quality of the European Tertiary Education Register(ETER) database, illustrating its functioning and highlighting the main challenges that still have to be faced in this domain.Design/methodology/approach: The proposed data quality methodology is based on two kinds of checks, one to assess the consistency of cross-sectional data and the other to evaluate the stability of multiannual data. This methodology has an operational and empirical orientation. This means that the proposed checks do not assume any theoretical distribution for the determination of the threshold parameters that identify potential outliers, inconsistencies, and errors in the data. Findings: We show that the proposed cross-sectional checks and multiannual checks are helpful to identify outliers, extreme observations and to detect ontological inconsistencies not described in the available meta-data. For this reason, they may be a useful complement to integrate the processing of the available information.Research limitations: The coverage of the study is limited to European Higher Education Institutions. The cross-sectional and multiannual checks are not yet completely integrated.Practical implications: The consideration of the quality of the available data and information is important to enhance data quality-aware empirical investigations, highlighting problems, and areas where to invest for improving the coverage and interoperability of data in future data collection initiatives.Originality/value: The data-driven quality checks proposed in this paper may be useful as a reference for building and monitoring the data quality of new databases or of existing databases available for other countries or systems characterized by high heterogeneity and complexity of the units of analysis without relying on pre-specified theoretical distributions.展开更多
Recent emergence of diverse services have led to explosive traffic growth in cellular data networks. Understanding the service dynamics in large cellular networks is important for network design, trouble shooting, qua...Recent emergence of diverse services have led to explosive traffic growth in cellular data networks. Understanding the service dynamics in large cellular networks is important for network design, trouble shooting, quality of service(Qo E) support, and resource allocation. In this paper, we present our study to reveal the distributions and temporal patterns of different services in cellular data network from two different perspectives, namely service request times and service duration. Our study is based on big traffic data, which is parsed to readable records by our Hadoop-based packet parsing platform, captured over a week-long period from a tier-1 mobile operator's network in China. We propose a Zipf's ranked model to characterize the distributions of traffic volume, packet, request times and duration of cellular services. Two-stage method(Self-Organizing Map combined with kmeans) is first used to cluster time series of service into four request patterns and three duration patterns. These seven patterns are combined together to better understand the fine-grained temporal patterns of service in cellular network. Results of our distribution models and temporal patterns present cellular network operators with a better understanding of the request and duration characteristics of service, which of great importance in network design, service generation and resource allocation.展开更多
What tasks do technological changes taking place in the world impose on tax administrations,and at the same time,what opportunities do they create in enforcing the principle of public responsibility?How can innovation...What tasks do technological changes taking place in the world impose on tax administrations,and at the same time,what opportunities do they create in enforcing the principle of public responsibility?How can innovations like European Digital Identity Wallet(EUDIW)be applied in the authentication environment?What assistance can the authorities provide in the integrity of taxpayers'business data?What developments are seen in the work of the Hungarian tax administration to use transaction-based data to contribute to a more modern public administration system and,last but not least,to a fair public burden?How does blockchain as a technology platform support data integrity?How does personalized and easy-to-understand communication revolutionize customer information?These questions are answered in this article.展开更多
基金The National Natural Science Foundation of China under contract No.41206012
文摘An ocean state monitor and analysis radar(OSMAR), developed by Wuhan University in China, have been mounted at six stations along the coasts of East China Sea(ECS) to measure velocities(currents, waves and winds) at the sea surface. Radar-observed surface current is taken as an example to illustrate the operational high-frequency(HF) radar observing and data service platform(OP), presenting an operational flow from data observing, transmitting, processing, visualizing, to end-user service. Three layers(systems): radar observing system(ROS), data service system(DSS) and visualization service system(VSS), as well as the data flow within the platform are introduced. Surface velocities observed at stations are synthesized at the radar data receiving and preprocessing center of the ROS, and transmitted to the DSS, in which the data processing and quality control(QC) are conducted. Users are allowed to browse the processed data on the portal of the DSS, and access to those data files. The VSS aims to better show the data products by displaying the information on a visual globe. By utilizing the OP, the surface currents in East China Sea are monitored, and hourly and seasonal variabilities of them are investigated.
文摘为提升医院精细化管理,推动医疗机构科学地开展智慧医院建设,解决国家、省、市逐年增长的数据填报工作难题,本文构建了符合实际场景的智慧医疗数据管理平台。该平台利用Data Services技术将HANA数据库计算出的指标抽取到平台数据库,利用JAVA SSM框架完成平台开发,可实现各科室数据自动填报,同时实现了业务处理、数据核对、流程管理、统计分析等上报数据的精细化管理。以SAP Data Services为工具,实现平台指标的自动计算展示,优化流程,建设数据为驱动的高水平智慧医院,从而提升医院核心竞争力。
基金Supported by the National High Technology Research and Development Programme of China(No.2009AA01 Z141)the National Natural Science Foundation of China(No.60573117)Beijing Natural Science Foundation(No.4131001)
文摘To extract structured data from a web page with customized requirements,a user labels some DOM elements on the page with attribute names.The common features of the labeled elements are utilized to guide the user through the labeling process to minimize user efforts,and are also utilized to retrieve attribute values.To turn the attribute values into a structured result,the attribute pattern needs to be induced.For this purpose,a space-optimized suffix tree called attribute tree is built to transform the document object model(DOM) tree into a simpler form while preserving its useful properties such as attribute sequence order.The pattern is induced bottom-up on the attribute tree,and is further used to build the structured result.Experiments are conducted and show high performance of our approach in terms of precision,recall and structural correctness.
文摘For China’s telecom industry,2009 is destined to be an extraordinary year due to the approach of long-thirsted-for mobile 3G era,which will have significant impact on current work and lifestyles.2009 will also be a year full of opportunities and challenges because the coming 3G era will bring limitless business opportunities and impose more challenges on Chinese telecom operators.The reshuffling of Chinese telecom markets has been brought to an end.The new China Unicom,China Mobile and China Telecom all focus their strategies on broadband mobile data services in order to achieve the objective of a smooth transforming from voice services to data services.Technologically,various 3G technologies and their evolutions become great concerns of telecom operators;while in terms of services,the key for 3G systems is their data services.As a result,high speed broadband data services see an era of rapid development.
文摘This paper introduces the application of data mining technology in data service. Data mining is a new technology, but its application time is long, and the application effect is obvious. Data mining technology can be used in enterprise customer service system, which can help enterprises to find potential customers, while retaining the most valuable customers. We pointed out the necessity of establishing data mining technology and service intelligence analysis system based on the combination of intelligence analysis and service characteristics and processes, the method of data mining, knowledge management ideas applied to intelligence analysis and service system.
文摘In the era of the big data. the national strategies and the rapid development of computers and storage technologies bring opportunities and challenges to the library's data services. Based on the investigation literature of the scientific data services in the university libraries in the United States, the development process of the scientific data is analyzed from three aspects of the service types, the service mode and the service contents. The author of this paper also proposes opportunities and challenges from 5 aspects of the policy support. strengthening the publicity, the self learning, the self positioning and relying on the embedded subject librarians, to promote the development of the library scientific data services.
文摘As technology and the internet develop,more data are generated every day.These data are in large sizes,high dimensions,and complex structures.The combination of these three features is the“Big Data”[1].Big data is revolutionizing all industries,bringing colossal impacts to them[2].Many researchers have pointed out the huge impact that big data can have on our daily lives[3].We can utilize the information we obtain and help us make decisions.Also,the conclusions we drew from the big data we analyzed can be used as a prediction for the future,helping us to make more accurate and benign decisions earlier than others.If we apply these technics in finance,for example,in stock,we can get detailed information for stocks.Moreover,we can use the analyzed data to predict certain stocks.This can help people decide whether to buy a stock or not by providing predicted data for people at a certain convincing level,helping to protect them from potential losses.
文摘JCOMM has strategy to establish the network of WMO-IOC Centres for Marine-meteorological and Oceanographic Climate Data (CMOCs) under the new Marine Climate Data System (MCDS) in 2012 for improving the quality and timeliness of the marine-meteorological and oceanographic data, metadata and products available to end users. China as a candidate of CMOC China has been approved to run on a trial basis after the 4th Meeting of the Joint IOC/WMO Technical Commission for Oceanography and Marine Meteorology (JCOMM). This article states the developing intention of CMOC China in the next few years through the brief introduction to critical marine data, products and service system and cooperation projects in the world.
基金supported by the project of Beijing Municipal Science and Technology Commission and Science and Technology Innovation Base of Cultivating and Developing Engineering[grant number Z161100005016069]the National High Technology Research and Development Program[grant number 2013AA12A303].
文摘With the increase of different sensors,applications and customers,the demand from data providers and users is for a new geospatial data service model,which supports low cost,high dexterity,and which would provide a comprehensive service.Based on such requirements and demands,the 21AT TripleSat constellation terminal and data delivery and management system has been developed by a Beijing based high-tech enterprise,Twenty First Century Aerospace Technology Co.,Ltd.(21AT).The company is the first commercial Earth observation satellite operator and service provider in China.This new geospatial data service model allows the user to directly access multi-source satellite data,manage the data order,and carry out automatic massive data production and delivery.The solution also implements safe and hierarchical user management,statistical data analysis,and automatic information reports.In addition,a mobile application is also available for users to easily access system functions.This new geospatial solution has already been successfully applied and installed in many customer sites in China,and is now available globally for international clients interested in fast geospatial solutions.It enables the success of customers’operational services.Besides providing TripleSat Constellation images,the multi-source data access system also allows the users to access other satellite data sources,based on customized agreement.This paper describes and discusses this new geospatial data service model.
基金This work is supported by National Science Foundation of China (No.70472073).
文摘To study characteristics and market structure of data service and the role ofoperator, this paper makes a commercial model by applying the theory of intermediaries andneoclassic economics. Data service has different economic characteristics from voice service.Firstly, production mode of data service is roundabout production, secondly, driving power of dataservice is economies of specialization, and finally, management method of data service is impersonalmanagement. In data service market, information asymmetry and barrier to entry determinetransaction efficiency and the specialization level of service providers indirectly. Therefore,operator should intervene in the market by offering trade service in order to promote development ofservice providers. Because of different quality of service providers, market structure of dataservice must be the state that trade platform built by operator and intermediary platform built byoperator coexists.
基金Supported by National High Technology Research and Development Program of China (863 Program) (Nos. 2009AA12Z225,2009AA12Z208)the National Natural Science Foundation of China (No. 61074132)
文摘Currently,ocean data portals are being developed around the world based on Geographic Information Systems(GIS) as a source of ocean data and information.However,given the relatively high temporal frequency and the intrinsic spatial nature of ocean data and information,no current GIS software is adequate to deal effectively and efficiently with spatiotemporal data.Furthermore,while existing ocean data portals are generally designed to meet the basic needs of a broad range of users,they are sometimes very complicated for general audiences,especially for those without training in GIS.In this paper,a new technical architecture for an ocean data integration and service system is put forward that consists of four layers:the operation layer,the extract,transform,and load(ETL) layer,the data warehouse layer,and the presentation layer.The integration technology based on the XML,ontology,and spatiotemporal data organization scheme for the data warehouse layer is then discussed.In addition,the ocean observing data service technology realized in the presentation layer is also discussed in detail,including the development of the web portal and ocean data sharing platform.The application on the Taiwan Strait shows that the technology studied in this paper can facilitate sharing,access,and use of ocean observation data.The paper is based on an ongoing research project for the development of an ocean observing information system for the Taiwan Strait that will facilitate the prevention of ocean disasters.
基金supported by the National Key Research and Development Program of China under Grant No.2018YFB0203801the National Natural Science Foundation of China under Grant Nos.61702529 and 61802424.
文摘Massive ocean data acquired by various observing platforms and sensors poses new challenges to data management and utilization.Typically,it is difficult to find the desired data from the large amount of datasets efficiently and effectively.Most of existing methods for data discovery are based on the keyword retrieval or direct semantic reasoning,and they are either limited in data access rate or do not take the time cost into account.In this paper,we creatively design and implement a novel system to alleviate the problem by introducing semantics with ontologies,which is referred to as Data Ontology and List-Based Publishing(DOLP).Specifically,we mainly improve the ocean data services in the following three aspects.First,we propose a unified semantic model called OEDO(Ocean Environmental Data Ontology)to represent heterogeneous ocean data by metadata and to be published as data services.Second,we propose an optimized quick service query list(QSQL)data structure for storing the pre-inferred semantically related services,and reducing the service querying time.Third,we propose two algorithms for optimizing QSQL hierarchically and horizontally,respectively,which aim to extend the semantics relationships of the data service and improve the data access rate.Experimental results prove that DOLP outperforms the benchmark methods.First,our QSQL-based data discovery methods obtain a higher recall rate than the keyword-based method,and are faster than the traditional semantic method based on direct reasoning.Second,DOLP can handle more complex semantic relationships than the existing methods.
基金supported by the National Natural Science Foundation of China (No. 61502043, No. 61132001)Beijing Natural Science Foundation (No. 4162042)BeiJing Talents Fund (No. 2015000020124G082)
文摘With the growing popularity of data-intensive services on the Internet, the traditional process-centric model for business process meets challenges due to the lack of abilities to describe data semantics and dependencies, resulting in the inflexibility of the design and implement for the processes. This paper proposes a novel data-aware business process model which is able to describe both explicit control flow and implicit data flow. Data model with dependencies which are formulated by Linear-time Temporal Logic(LTL) is presented, and their satisfiability is validated by an automaton-based model checking algorithm. Data dependencies are fully considered in modeling phase, which helps to improve the efficiency and reliability of programming during developing phase. Finally, a prototype system based on j BPM for data-aware workflow is designed using such model, and has been deployed to Beijing Kingfore heating management system to validate the flexibility, efficacy and convenience of our approach for massive coding and large-scale system management in reality.
基金Supported by National Natural Science Foundation of China(42274217)。
文摘Fengyun meteorological satellites have undergone a series of significant developments over the past 50 years.Two generations,four types,and 21 Fengyun satellites have been developed and launched,with 9 currently operational in orbit.The data obtained from Fengyun satellites is employed in a multitude of applications,including weather forecasting,meteorological disaster prevention and reduction,climate change,global environmental monitoring,and space weather.These data products and services are made available to the global community,resulting in tangible social and economic benefits.In 2023,two Fengyun meteorological satellites were successfully launched.This report presents an overview of the two recently launched Fengyun satellites and currently in orbit Fengyun satellites,including an evaluation of their remote sensing instruments since 2022.Additionally,it addresses the subject of Fengyun satellite data archiving,data services,application services,international cooperation,and supporting activities.Furthermore,the development prospects have been outlined.
基金Supported by the National Natural Science Foun-dation of China(60373072)
文摘In existing web services-based workflow, data exchanging across the web services is centralized, the workflow engine intermediates at each step of the application sequence. However, many grid applications, especially data intensive scientific applications, require exchanging large amount of data across the grid services. Having a central workflow engine relay the data between the services would resu'lts in a bottleneck in these cases. This paper proposes a data exchange model for individual grid workflow and multiworkflows composition respectively. The model enables direct communication for large amounts of data between two grid services. To enable data to exchange among multiple workflows, the bridge data service is used.
文摘World Data Center(WDC)for Seismology,Beijing has developed for 20 years in China until this year.The sustained and stable data sharing service system has already taken shape.This article gives an overview of the construction and development of WDC for Seismology,Beijing.It outlines the history,facilities and technical specifications of the center.It also illustrates the data service,the website,and gives a brief description of the perspective.
基金Supported by the National Natural Science Foundation of China(61622301,61533002)Beijing Natural Science Foundation(4172005)Major National Science and Technology Project(2017ZX07104)
文摘In wastewater treatment process(WWTP), the accurate and real-time monitoring values of key variables are crucial for the operational strategies. However, most of the existing methods have difficulty in obtaining the real-time values of some key variables in the process. In order to handle this issue, a data-driven intelligent monitoring system, using the soft sensor technique and data distribution service, is developed to monitor the concentrations of effluent total phosphorous(TP) and ammonia nitrogen(NH_4-N). In this intelligent monitoring system, a fuzzy neural network(FNN) is applied for designing the soft sensor model, and a principal component analysis(PCA) method is used to select the input variables of the soft sensor model. Moreover, data transfer software is exploited to insert the soft sensor technique to the supervisory control and data acquisition(SCADA) system. Finally, this proposed intelligent monitoring system is tested in several real plants to demonstrate the reliability and effectiveness of the monitoring performance.
基金support of the European Commission ETER Project (No. 934533-2017-AO8-CH)H2020 RISIS 2 project (No. 824091)。
文摘Purpose: This paper relates the definition of data quality procedures for knowledge organizations such as Higher Education Institutions. The main purpose is to present the flexible approach developed for monitoring the data quality of the European Tertiary Education Register(ETER) database, illustrating its functioning and highlighting the main challenges that still have to be faced in this domain.Design/methodology/approach: The proposed data quality methodology is based on two kinds of checks, one to assess the consistency of cross-sectional data and the other to evaluate the stability of multiannual data. This methodology has an operational and empirical orientation. This means that the proposed checks do not assume any theoretical distribution for the determination of the threshold parameters that identify potential outliers, inconsistencies, and errors in the data. Findings: We show that the proposed cross-sectional checks and multiannual checks are helpful to identify outliers, extreme observations and to detect ontological inconsistencies not described in the available meta-data. For this reason, they may be a useful complement to integrate the processing of the available information.Research limitations: The coverage of the study is limited to European Higher Education Institutions. The cross-sectional and multiannual checks are not yet completely integrated.Practical implications: The consideration of the quality of the available data and information is important to enhance data quality-aware empirical investigations, highlighting problems, and areas where to invest for improving the coverage and interoperability of data in future data collection initiatives.Originality/value: The data-driven quality checks proposed in this paper may be useful as a reference for building and monitoring the data quality of new databases or of existing databases available for other countries or systems characterized by high heterogeneity and complexity of the units of analysis without relying on pre-specified theoretical distributions.
基金supported by the National Basic Research Program of China (973 Program: 2013CB329004)
文摘Recent emergence of diverse services have led to explosive traffic growth in cellular data networks. Understanding the service dynamics in large cellular networks is important for network design, trouble shooting, quality of service(Qo E) support, and resource allocation. In this paper, we present our study to reveal the distributions and temporal patterns of different services in cellular data network from two different perspectives, namely service request times and service duration. Our study is based on big traffic data, which is parsed to readable records by our Hadoop-based packet parsing platform, captured over a week-long period from a tier-1 mobile operator's network in China. We propose a Zipf's ranked model to characterize the distributions of traffic volume, packet, request times and duration of cellular services. Two-stage method(Self-Organizing Map combined with kmeans) is first used to cluster time series of service into four request patterns and three duration patterns. These seven patterns are combined together to better understand the fine-grained temporal patterns of service in cellular network. Results of our distribution models and temporal patterns present cellular network operators with a better understanding of the request and duration characteristics of service, which of great importance in network design, service generation and resource allocation.
文摘What tasks do technological changes taking place in the world impose on tax administrations,and at the same time,what opportunities do they create in enforcing the principle of public responsibility?How can innovations like European Digital Identity Wallet(EUDIW)be applied in the authentication environment?What assistance can the authorities provide in the integrity of taxpayers'business data?What developments are seen in the work of the Hungarian tax administration to use transaction-based data to contribute to a more modern public administration system and,last but not least,to a fair public burden?How does blockchain as a technology platform support data integrity?How does personalized and easy-to-understand communication revolutionize customer information?These questions are answered in this article.