Since the British National Archive put forward the concept of the digital continuity in 2007,several developed countries have worked out their digital continuity action plan.However,the technologies of the digital con...Since the British National Archive put forward the concept of the digital continuity in 2007,several developed countries have worked out their digital continuity action plan.However,the technologies of the digital continuity guarantee are still lacked.At first,this paper analyzes the requirements of digital continuity guarantee for electronic record based on data quality theory,then points out the necessity of data quality guarantee for electronic record.Moreover,we convert the digital continuity guarantee of electronic record to ensure the consistency,completeness and timeliness of electronic record,and construct the first technology framework of the digital continuity guarantee for electronic record.Finally,the temporal functional dependencies technology is utilized to build the first integration method to insure the consistency,completeness and timeliness of electronic record.展开更多
Nowadays,several research projects show interest in employing volunteered geographic information(VGI)to improve their systems through using up-to-date and detailed data.The European project CAP4Access is one of the su...Nowadays,several research projects show interest in employing volunteered geographic information(VGI)to improve their systems through using up-to-date and detailed data.The European project CAP4Access is one of the successful examples of such international-wide research projects that aims to improve the accessibility of people with restricted mobility using crowdsourced data.In this project,OpenStreetMap(OSM)is used to extend OpenRouteService,a well-known routing platform.However,a basic challenge that this project tackled was the incompleteness of OSM data with regards to certain information that is required for wheelchair accessibility(e.g.sidewalk information,kerb data,etc.).In this article,we present the results of initial assessment of sidewalk data in OSM at the beginning of the project as well as our approach in awareness raising and using tools for tagging accessibility data into OSM database for enriching the sidewalk data completeness.Several experiments have been carried out in different European cities,and discussion on the results of the experiments as well as the lessons learned are provided.The lessons learned provide recommendations that help in organizing better mapping party events in the future.We conclude by reporting on how and to what extent the OSM sidewalk data completeness in these study areas have benefited from the mapping parties by the end of the project.展开更多
Digital educational content is gaining importance as an incubator of pedagogical methodologies in formal and informal online educational settings. Its educational efficiency is directly dependent on its quality, howev...Digital educational content is gaining importance as an incubator of pedagogical methodologies in formal and informal online educational settings. Its educational efficiency is directly dependent on its quality, however educational content is more than information and data. This paper presents a new data quality framework for assessing digital educational content used for teaching in distance learning environments. The model relies on the ISO2500 series quality standard and beside providing the mechanisms for multi-facet quality assessment it also supports organizations that design, create, manage and use educational content with the quality tools (expressed as quality metrics and measurement methods) to provide a more efficient distance education experience. The model describes the quality characteristics of the educational material content using data and software quality characteristics.展开更多
Availability of digital elevation models (DEMs) of a high quality is becoming more and more important in spatial studies. Standard methods for DEM creation use only intentionally acquired data sources. Two approache...Availability of digital elevation models (DEMs) of a high quality is becoming more and more important in spatial studies. Standard methods for DEM creation use only intentionally acquired data sources. Two approaches which employ various types of data sets for DEM production are proposed: (1) Method of weighted sum of different data sources with morphological enhancement that conflates any additional data sources to principal DEM, and (2) DEM updating methods of modeling absolute and relative temporal changes, considering landslides, earthquakes, quarries, watererosion, building and highway constructions, etc. Spatial modeling of environmental variables concerning both approaches for (a) quality control of data sources, considering regions, (b) pre-processing of data sources, and (c) processing of the final DEM, have been applied. The variables are called rate of karst, morphologic roughness (modeled from slope, profile curvature and elevation), characteristic features, rate of forestation, hydrological network, and rate of urbanization. Only the variables evidenced as significant were used in spatial modeling to generate homogeneous regions in spatial modeling a-c. The production process uses different regions to define high quality conflation of data sources to the final DEM. The methodology had been confirmed by case studies. The result is an overall high quality DEM with various well-known parameters.展开更多
从天地图融合数据质量检查出发,依据数据标准,通过结合具体的质检规则,研究了一种基于ArcGIS Data Reviewer模块的自动化、批量化并且可使数据在处理阶段就可进行检查的天地图融合数据检验方法,这种灵活的质检机制大大减少了数据融合过...从天地图融合数据质量检查出发,依据数据标准,通过结合具体的质检规则,研究了一种基于ArcGIS Data Reviewer模块的自动化、批量化并且可使数据在处理阶段就可进行检查的天地图融合数据检验方法,这种灵活的质检机制大大减少了数据融合过程中的人工反复处理,提高了生产单位的作业效率及成果质量,也可为其他项目的质检系统开发提供借鉴。展开更多
实体识别是数据量质融合管理中的一项关键技术,对能否提高数据质量起着决定性作用.其目的在于识别出数据中表示同一对象的不同形式;以及同一形式所代表的不同对象.随着大数据研究技术的发展,大数据上的实体识别问题受到了广泛关注.因此...实体识别是数据量质融合管理中的一项关键技术,对能否提高数据质量起着决定性作用.其目的在于识别出数据中表示同一对象的不同形式;以及同一形式所代表的不同对象.随着大数据研究技术的发展,大数据上的实体识别问题受到了广泛关注.因此,在大数据的信息集成背景下,给出了一个基于Map-Reduce框架的大数据实体识别算法(entity identification in big data based on Map-Reduce,EIBM).该算法首先通过属性值计算记录间的相似程度,而后基于图聚类的方法进行实体识别从而输出得到最终结果.最后,在Hadoop平台上对真实数据集和人造数据集进行了多组实验,实验结果验证了算法的并行程度和对于处理大数据的有效性与高效性.展开更多
基金This work is supported by the NSFC(Nos.61772280,61772454)the Changzhou Sci&Tech Program(No.CJ20179027)the PAPD fund from NUIST.Prof.Jin Wang is the corresponding author。
文摘Since the British National Archive put forward the concept of the digital continuity in 2007,several developed countries have worked out their digital continuity action plan.However,the technologies of the digital continuity guarantee are still lacked.At first,this paper analyzes the requirements of digital continuity guarantee for electronic record based on data quality theory,then points out the necessity of data quality guarantee for electronic record.Moreover,we convert the digital continuity guarantee of electronic record to ensure the consistency,completeness and timeliness of electronic record,and construct the first technology framework of the digital continuity guarantee for electronic record.Finally,the temporal functional dependencies technology is utilized to build the first integration method to insure the consistency,completeness and timeliness of electronic record.
基金supported by the European Community’s Seventh Framework Programme[FP7/2007–2013],[Grant No 612096(CAP4Access)].
文摘Nowadays,several research projects show interest in employing volunteered geographic information(VGI)to improve their systems through using up-to-date and detailed data.The European project CAP4Access is one of the successful examples of such international-wide research projects that aims to improve the accessibility of people with restricted mobility using crowdsourced data.In this project,OpenStreetMap(OSM)is used to extend OpenRouteService,a well-known routing platform.However,a basic challenge that this project tackled was the incompleteness of OSM data with regards to certain information that is required for wheelchair accessibility(e.g.sidewalk information,kerb data,etc.).In this article,we present the results of initial assessment of sidewalk data in OSM at the beginning of the project as well as our approach in awareness raising and using tools for tagging accessibility data into OSM database for enriching the sidewalk data completeness.Several experiments have been carried out in different European cities,and discussion on the results of the experiments as well as the lessons learned are provided.The lessons learned provide recommendations that help in organizing better mapping party events in the future.We conclude by reporting on how and to what extent the OSM sidewalk data completeness in these study areas have benefited from the mapping parties by the end of the project.
文摘Digital educational content is gaining importance as an incubator of pedagogical methodologies in formal and informal online educational settings. Its educational efficiency is directly dependent on its quality, however educational content is more than information and data. This paper presents a new data quality framework for assessing digital educational content used for teaching in distance learning environments. The model relies on the ISO2500 series quality standard and beside providing the mechanisms for multi-facet quality assessment it also supports organizations that design, create, manage and use educational content with the quality tools (expressed as quality metrics and measurement methods) to provide a more efficient distance education experience. The model describes the quality characteristics of the educational material content using data and software quality characteristics.
文摘Availability of digital elevation models (DEMs) of a high quality is becoming more and more important in spatial studies. Standard methods for DEM creation use only intentionally acquired data sources. Two approaches which employ various types of data sets for DEM production are proposed: (1) Method of weighted sum of different data sources with morphological enhancement that conflates any additional data sources to principal DEM, and (2) DEM updating methods of modeling absolute and relative temporal changes, considering landslides, earthquakes, quarries, watererosion, building and highway constructions, etc. Spatial modeling of environmental variables concerning both approaches for (a) quality control of data sources, considering regions, (b) pre-processing of data sources, and (c) processing of the final DEM, have been applied. The variables are called rate of karst, morphologic roughness (modeled from slope, profile curvature and elevation), characteristic features, rate of forestation, hydrological network, and rate of urbanization. Only the variables evidenced as significant were used in spatial modeling to generate homogeneous regions in spatial modeling a-c. The production process uses different regions to define high quality conflation of data sources to the final DEM. The methodology had been confirmed by case studies. The result is an overall high quality DEM with various well-known parameters.
文摘从天地图融合数据质量检查出发,依据数据标准,通过结合具体的质检规则,研究了一种基于ArcGIS Data Reviewer模块的自动化、批量化并且可使数据在处理阶段就可进行检查的天地图融合数据检验方法,这种灵活的质检机制大大减少了数据融合过程中的人工反复处理,提高了生产单位的作业效率及成果质量,也可为其他项目的质检系统开发提供借鉴。
文摘实体识别是数据量质融合管理中的一项关键技术,对能否提高数据质量起着决定性作用.其目的在于识别出数据中表示同一对象的不同形式;以及同一形式所代表的不同对象.随着大数据研究技术的发展,大数据上的实体识别问题受到了广泛关注.因此,在大数据的信息集成背景下,给出了一个基于Map-Reduce框架的大数据实体识别算法(entity identification in big data based on Map-Reduce,EIBM).该算法首先通过属性值计算记录间的相似程度,而后基于图聚类的方法进行实体识别从而输出得到最终结果.最后,在Hadoop平台上对真实数据集和人造数据集进行了多组实验,实验结果验证了算法的并行程度和对于处理大数据的有效性与高效性.