The Soil Conservation Monitorins Information System (SCMIS) presented in this paper is oriented to soil erosion control, resources exploitation, utilization, planning and management for a small watershed (about 10 sq....The Soil Conservation Monitorins Information System (SCMIS) presented in this paper is oriented to soil erosion control, resources exploitation, utilization, planning and management for a small watershed (about 10 sq. km.) on the Loess Plateau. It sums up Remote sensing (RS), Geographical Information System (GIS) and Expert System (ES) and consists of a integrated system. As a basic level information system of Loess Plateau, its perfection and psreading will bring about a great advance in resources exploitation and management of Loess Plateau.展开更多
There has long been discussion about the distinctions of library science,information science,and informatics,and how these areas differ and overlap with computer science.Today the term data science is emerging that ge...There has long been discussion about the distinctions of library science,information science,and informatics,and how these areas differ and overlap with computer science.Today the term data science is emerging that generates excitement and questions about how it relates to and differs from these other areas of study.展开更多
The main areas of interest of JDIS are:1)new theories,methods,and techniques of big data based data mining,knowledge discovery,and informatics,including but not limited to scientometrics,communication analysis,social ...The main areas of interest of JDIS are:1)new theories,methods,and techniques of big data based data mining,knowledge discovery,and informatics,including but not limited to scientometrics,communication analysis,social network analysis,tech&industry; analysis,competitive intelligence,knowledge mapping,evidence based policy analysis,and predictive analysis.展开更多
经国内外专家深入探讨和反复论证,经主管、主办单位同意,经国家新闻出版广电总局正式批准(新广出审[2015]1187号文),由中国科学院文献情报中心主办的Chinese Journal of Library and Information Science(《中国文献情报(英)》,CJLIS)将...经国内外专家深入探讨和反复论证,经主管、主办单位同意,经国家新闻出版广电总局正式批准(新广出审[2015]1187号文),由中国科学院文献情报中心主办的Chinese Journal of Library and Information Science(《中国文献情报(英)》,CJLIS)将于2016年起正式更名为Journal of Data and Information Science(《数据与情报科学学报(英)》,JDIS)。作为国内唯一的图书馆学情报学领域英文学术期刊,CJLIS自2008年创刊以来,以刊发符合国际规范的高水平学术研究论文、推动中国图书馆学情报学学科发展为己任,组织优秀稿源、坚守学术规范、推动开放获取、严控评议流程,赢得了业界的充分肯定和广展开更多
Purpose:The main objective of this work is to show the potentialities of recently developed approaches for automatic knowledge extraction directly from the universities’websites.The information automatically extracte...Purpose:The main objective of this work is to show the potentialities of recently developed approaches for automatic knowledge extraction directly from the universities’websites.The information automatically extracted can be potentially updated with a frequency higher than once per year,and be safe from manipulations or misinterpretations.Moreover,this approach allows us flexibility in collecting indicators about the efficiency of universities’websites and their effectiveness in disseminating key contents.These new indicators can complement traditional indicators of scientific research(e.g.number of articles and number of citations)and teaching(e.g.number of students and graduates)by introducing further dimensions to allow new insights for“profiling”the analyzed universities.Design/methodology/approach:Webometrics relies on web mining methods and techniques to perform quantitative analyses of the web.This study implements an advanced application of the webometric approach,exploiting all the three categories of web mining:web content mining;web structure mining;web usage mining.The information to compute our indicators has been extracted from the universities’websites by using web scraping and text mining techniques.The scraped information has been stored in a NoSQL DB according to a semistructured form to allow for retrieving information efficiently by text mining techniques.This provides increased flexibility in the design of new indicators,opening the door to new types of analyses.Some data have also been collected by means of batch interrogations of search engines(Bing,www.bing.com)or from a leading provider of Web analytics(SimilarWeb,http://www.similarweb.com).The information extracted from the Web has been combined with the University structural information taken from the European Tertiary Education Register(https://eter.joanneum.at/#/home),a database collecting information on Higher Education Institutions(HEIs)at European level.All the above was used to perform a clusterization of 79 Italian universities based on structural and digital indicators.Findings:The main findings of this study concern the evaluation of the potential in digitalization of universities,in particular by presenting techniques for the automatic extraction of information from the web to build indicators of quality and impact of universities’websites.These indicators can complement traditional indicators and can be used to identify groups of universities with common features using clustering techniques working with the above indicators.Research limitations:The results reported in this study refers to Italian universities only,but the approach could be extended to other university systems abroad.Practical implications:The approach proposed in this study and its illustration on Italian universities show the usefulness of recently introduced automatic data extraction and web scraping approaches and its practical relevance for characterizing and profiling the activities of universities on the basis of their websites.The approach could be applied to other university systems.Originality/value:This work applies for the first time to university websites some recently introduced techniques for automatic knowledge extraction based on web scraping,optical character recognition and nontrivial text mining operations(Bruni&Bianchi,2020).展开更多
Purpose: This paper relates the definition of data quality procedures for knowledge organizations such as Higher Education Institutions. The main purpose is to present the flexible approach developed for monitoring th...Purpose: This paper relates the definition of data quality procedures for knowledge organizations such as Higher Education Institutions. The main purpose is to present the flexible approach developed for monitoring the data quality of the European Tertiary Education Register(ETER) database, illustrating its functioning and highlighting the main challenges that still have to be faced in this domain.Design/methodology/approach: The proposed data quality methodology is based on two kinds of checks, one to assess the consistency of cross-sectional data and the other to evaluate the stability of multiannual data. This methodology has an operational and empirical orientation. This means that the proposed checks do not assume any theoretical distribution for the determination of the threshold parameters that identify potential outliers, inconsistencies, and errors in the data. Findings: We show that the proposed cross-sectional checks and multiannual checks are helpful to identify outliers, extreme observations and to detect ontological inconsistencies not described in the available meta-data. For this reason, they may be a useful complement to integrate the processing of the available information.Research limitations: The coverage of the study is limited to European Higher Education Institutions. The cross-sectional and multiannual checks are not yet completely integrated.Practical implications: The consideration of the quality of the available data and information is important to enhance data quality-aware empirical investigations, highlighting problems, and areas where to invest for improving the coverage and interoperability of data in future data collection initiatives.Originality/value: The data-driven quality checks proposed in this paper may be useful as a reference for building and monitoring the data quality of new databases or of existing databases available for other countries or systems characterized by high heterogeneity and complexity of the units of analysis without relying on pre-specified theoretical distributions.展开更多
The farmland irrigation project is an important part of China's water conservancy project. To keep abreast of the farmland water conservancy and various related environmental parameters,we can use the external sen...The farmland irrigation project is an important part of China's water conservancy project. To keep abreast of the farmland water conservancy and various related environmental parameters,we can use the external sensors to acquire real-time data,employ the computer technology,networking and communication technology,database technology,geographic information system( GIS) technology,component technology and other advanced modern information technologies combined with the automated water conservancy technology,in order to build an accurate,efficient,fast,comprehensive,standardized decision support system integrating farmland water conservancy and irrigation information collection,transmission,storage,management and analysis. In this paper,we give details about the overall framework design of farmland water conservancy and irrigation management system based on GIS,information collection and data input,and information processing and data output,to provide a reference for the digital water conservancy construction in China.展开更多
In order to extract the boundary of rural habitation, based on geographic name data and basic geographic information data, an extraction method that use polygon aggregation is raised, it can extract the boundary of th...In order to extract the boundary of rural habitation, based on geographic name data and basic geographic information data, an extraction method that use polygon aggregation is raised, it can extract the boundary of three levels of rural habitation consists of town, administrative village and nature village. The method first extracts the boundary of nature village by aggregating the resident polygon, then extracts the boundary of administrative village by aggregating the boundary of nature village, and last extracts the boundary of town by aggregating the boundary of administrative village. The related methods of extracting the boundary of those three levels rural habitation has been given in detail during the experiment with basic geographic information data and geographic name data. Experimental results show the method can be a reference for boundary extraction of rural habitation.展开更多
The rapid development and globalization of modern tourism have brought more and more tourists and tourist attractions, bringing opportunities to the tourism industry. However, challenges also come with it, including t...The rapid development and globalization of modern tourism have brought more and more tourists and tourist attractions, bringing opportunities to the tourism industry. However, challenges also come with it, including the cost of travelers’ trust, the satisfaction of personalized tourism needs, and the supervision and management of tourist attractions. In order to solve these problems and improve the tourism experience, this paper designs a blockchain-based smart tourism platform. First, by adopting blockchain technology, a decentralized database is established to reduce the cost of trust for travelers. Secondly, through the blockchain-based smart tourism platform, travelers can customize personalized travel routes and itineraries according to their preferences and needs. The platform uses the data interaction function of blockchain to visually display various resources and services of tourist attractions, and travelers can choose according to their own preferences to enhance the personalized experience of tourism. Third, the smart tourism platform based on the blockchain can provide the reliability and transparency of data information, and through the decentralized nature of blockchain, the relevant data of tourist attractions can be recorded and stored, and cannot be tampered with, ensuring the authenticity and credibility of the data. Regulatory authorities can easily access and review these data, and more effectively control and supervise scenic spots. Finally, this paper combines DPOS (Delegated Proof of Stake) and PBFT (Practical Byzantine Fault Tolerance) consensus algorithms to improve the on-chain consensus efficiency of blockchain-based smart tourism platforms. The consensus algorithm is the core mechanism to ensure the normal operation and consistency of the blockchain system, and improve the throughput and performance of the system by introducing DPOS and PBFT algorithms.展开更多
新一代信息技术促使冷链智能化水平得以快速发展,精准把控智慧冷链发展水平是实现技术瓶颈重点突破与发展方向战略布局的前提基础与保障。基于此,本研究从供给能力、仓储能力、运输能力、经济效益、信息化水平等维度构建了中国智慧冷链...新一代信息技术促使冷链智能化水平得以快速发展,精准把控智慧冷链发展水平是实现技术瓶颈重点突破与发展方向战略布局的前提基础与保障。基于此,本研究从供给能力、仓储能力、运输能力、经济效益、信息化水平等维度构建了中国智慧冷链发展评价指标体系,运用熵权法并结合优劣解距离法(Technique for Order Preference by Similarity to Ideal Solution,TOPSIS)对2017—2021年中国30个省市(不包含西藏、中国香港、中国澳门、中国台湾)的智慧冷链发展水平进行定量评价,通过探索性空间数据分析法(Exploratory Spatial Data Analys,ESDA)和地理加权回归(Geographically Weighted Regression,GWR)分析评价指标对不同省市的影响变化。研究结果表明,经济发展状况、配套设施建设与信息化水平对智慧冷链建设影响作用较大;东部沿海地区智慧冷链发展普遍较好,西南和西北地区发展缓慢,整体发展较为落后;评价指标具有显著时空异质性,影响程度随空间位置和时间发生变化。为提升中国智慧冷链整体发展水平,就加强配套设施建设、促进信息化转型升级提出相应发展对策。本研究可为实现中国智慧冷链全局性谋划、战略性布局、整体性推进等提供科学依据。展开更多
目的利用信息系统规范护理敏感质量指标的提取。方法分别通过完善及应用护理质量管理系统、医院信息网络系统(hospital information system,HIS)、蓝蜻蜓系统、"问卷星"在线调查工具4个信息系统采集变量,采集过程中进行质量...目的利用信息系统规范护理敏感质量指标的提取。方法分别通过完善及应用护理质量管理系统、医院信息网络系统(hospital information system,HIS)、蓝蜻蜓系统、"问卷星"在线调查工具4个信息系统采集变量,采集过程中进行质量控制。结果得出不同指标提取的相应信息渠道,规范各指标获取方法。结论利用信息系统进行护理敏感质量指标变量采集,保证了指标的可测量性、客观性、有效性和及时性。展开更多
文摘The Soil Conservation Monitorins Information System (SCMIS) presented in this paper is oriented to soil erosion control, resources exploitation, utilization, planning and management for a small watershed (about 10 sq. km.) on the Loess Plateau. It sums up Remote sensing (RS), Geographical Information System (GIS) and Expert System (ES) and consists of a integrated system. As a basic level information system of Loess Plateau, its perfection and psreading will bring about a great advance in resources exploitation and management of Loess Plateau.
文摘There has long been discussion about the distinctions of library science,information science,and informatics,and how these areas differ and overlap with computer science.Today the term data science is emerging that generates excitement and questions about how it relates to and differs from these other areas of study.
文摘The main areas of interest of JDIS are:1)new theories,methods,and techniques of big data based data mining,knowledge discovery,and informatics,including but not limited to scientometrics,communication analysis,social network analysis,tech&industry; analysis,competitive intelligence,knowledge mapping,evidence based policy analysis,and predictive analysis.
文摘经国内外专家深入探讨和反复论证,经主管、主办单位同意,经国家新闻出版广电总局正式批准(新广出审[2015]1187号文),由中国科学院文献情报中心主办的Chinese Journal of Library and Information Science(《中国文献情报(英)》,CJLIS)将于2016年起正式更名为Journal of Data and Information Science(《数据与情报科学学报(英)》,JDIS)。作为国内唯一的图书馆学情报学领域英文学术期刊,CJLIS自2008年创刊以来,以刊发符合国际规范的高水平学术研究论文、推动中国图书馆学情报学学科发展为己任,组织优秀稿源、坚守学术规范、推动开放获取、严控评议流程,赢得了业界的充分肯定和广
基金This work is developed with the support of the H2020 RISIS 2 Project(No.824091)and of the“Sapienza”Research Awards No.RM1161550376E40E of 2016 and RM11916B8853C925 of 2019.This article is a largely extended version of Bianchi et al.(2019)presented at the ISSI 2019 Conference held in Rome,2–5 September 2019.
文摘Purpose:The main objective of this work is to show the potentialities of recently developed approaches for automatic knowledge extraction directly from the universities’websites.The information automatically extracted can be potentially updated with a frequency higher than once per year,and be safe from manipulations or misinterpretations.Moreover,this approach allows us flexibility in collecting indicators about the efficiency of universities’websites and their effectiveness in disseminating key contents.These new indicators can complement traditional indicators of scientific research(e.g.number of articles and number of citations)and teaching(e.g.number of students and graduates)by introducing further dimensions to allow new insights for“profiling”the analyzed universities.Design/methodology/approach:Webometrics relies on web mining methods and techniques to perform quantitative analyses of the web.This study implements an advanced application of the webometric approach,exploiting all the three categories of web mining:web content mining;web structure mining;web usage mining.The information to compute our indicators has been extracted from the universities’websites by using web scraping and text mining techniques.The scraped information has been stored in a NoSQL DB according to a semistructured form to allow for retrieving information efficiently by text mining techniques.This provides increased flexibility in the design of new indicators,opening the door to new types of analyses.Some data have also been collected by means of batch interrogations of search engines(Bing,www.bing.com)or from a leading provider of Web analytics(SimilarWeb,http://www.similarweb.com).The information extracted from the Web has been combined with the University structural information taken from the European Tertiary Education Register(https://eter.joanneum.at/#/home),a database collecting information on Higher Education Institutions(HEIs)at European level.All the above was used to perform a clusterization of 79 Italian universities based on structural and digital indicators.Findings:The main findings of this study concern the evaluation of the potential in digitalization of universities,in particular by presenting techniques for the automatic extraction of information from the web to build indicators of quality and impact of universities’websites.These indicators can complement traditional indicators and can be used to identify groups of universities with common features using clustering techniques working with the above indicators.Research limitations:The results reported in this study refers to Italian universities only,but the approach could be extended to other university systems abroad.Practical implications:The approach proposed in this study and its illustration on Italian universities show the usefulness of recently introduced automatic data extraction and web scraping approaches and its practical relevance for characterizing and profiling the activities of universities on the basis of their websites.The approach could be applied to other university systems.Originality/value:This work applies for the first time to university websites some recently introduced techniques for automatic knowledge extraction based on web scraping,optical character recognition and nontrivial text mining operations(Bruni&Bianchi,2020).
基金support of the European Commission ETER Project (No. 934533-2017-AO8-CH)H2020 RISIS 2 project (No. 824091)。
文摘Purpose: This paper relates the definition of data quality procedures for knowledge organizations such as Higher Education Institutions. The main purpose is to present the flexible approach developed for monitoring the data quality of the European Tertiary Education Register(ETER) database, illustrating its functioning and highlighting the main challenges that still have to be faced in this domain.Design/methodology/approach: The proposed data quality methodology is based on two kinds of checks, one to assess the consistency of cross-sectional data and the other to evaluate the stability of multiannual data. This methodology has an operational and empirical orientation. This means that the proposed checks do not assume any theoretical distribution for the determination of the threshold parameters that identify potential outliers, inconsistencies, and errors in the data. Findings: We show that the proposed cross-sectional checks and multiannual checks are helpful to identify outliers, extreme observations and to detect ontological inconsistencies not described in the available meta-data. For this reason, they may be a useful complement to integrate the processing of the available information.Research limitations: The coverage of the study is limited to European Higher Education Institutions. The cross-sectional and multiannual checks are not yet completely integrated.Practical implications: The consideration of the quality of the available data and information is important to enhance data quality-aware empirical investigations, highlighting problems, and areas where to invest for improving the coverage and interoperability of data in future data collection initiatives.Originality/value: The data-driven quality checks proposed in this paper may be useful as a reference for building and monitoring the data quality of new databases or of existing databases available for other countries or systems characterized by high heterogeneity and complexity of the units of analysis without relying on pre-specified theoretical distributions.
文摘The farmland irrigation project is an important part of China's water conservancy project. To keep abreast of the farmland water conservancy and various related environmental parameters,we can use the external sensors to acquire real-time data,employ the computer technology,networking and communication technology,database technology,geographic information system( GIS) technology,component technology and other advanced modern information technologies combined with the automated water conservancy technology,in order to build an accurate,efficient,fast,comprehensive,standardized decision support system integrating farmland water conservancy and irrigation information collection,transmission,storage,management and analysis. In this paper,we give details about the overall framework design of farmland water conservancy and irrigation management system based on GIS,information collection and data input,and information processing and data output,to provide a reference for the digital water conservancy construction in China.
文摘In order to extract the boundary of rural habitation, based on geographic name data and basic geographic information data, an extraction method that use polygon aggregation is raised, it can extract the boundary of three levels of rural habitation consists of town, administrative village and nature village. The method first extracts the boundary of nature village by aggregating the resident polygon, then extracts the boundary of administrative village by aggregating the boundary of nature village, and last extracts the boundary of town by aggregating the boundary of administrative village. The related methods of extracting the boundary of those three levels rural habitation has been given in detail during the experiment with basic geographic information data and geographic name data. Experimental results show the method can be a reference for boundary extraction of rural habitation.
文摘The rapid development and globalization of modern tourism have brought more and more tourists and tourist attractions, bringing opportunities to the tourism industry. However, challenges also come with it, including the cost of travelers’ trust, the satisfaction of personalized tourism needs, and the supervision and management of tourist attractions. In order to solve these problems and improve the tourism experience, this paper designs a blockchain-based smart tourism platform. First, by adopting blockchain technology, a decentralized database is established to reduce the cost of trust for travelers. Secondly, through the blockchain-based smart tourism platform, travelers can customize personalized travel routes and itineraries according to their preferences and needs. The platform uses the data interaction function of blockchain to visually display various resources and services of tourist attractions, and travelers can choose according to their own preferences to enhance the personalized experience of tourism. Third, the smart tourism platform based on the blockchain can provide the reliability and transparency of data information, and through the decentralized nature of blockchain, the relevant data of tourist attractions can be recorded and stored, and cannot be tampered with, ensuring the authenticity and credibility of the data. Regulatory authorities can easily access and review these data, and more effectively control and supervise scenic spots. Finally, this paper combines DPOS (Delegated Proof of Stake) and PBFT (Practical Byzantine Fault Tolerance) consensus algorithms to improve the on-chain consensus efficiency of blockchain-based smart tourism platforms. The consensus algorithm is the core mechanism to ensure the normal operation and consistency of the blockchain system, and improve the throughput and performance of the system by introducing DPOS and PBFT algorithms.
文摘新一代信息技术促使冷链智能化水平得以快速发展,精准把控智慧冷链发展水平是实现技术瓶颈重点突破与发展方向战略布局的前提基础与保障。基于此,本研究从供给能力、仓储能力、运输能力、经济效益、信息化水平等维度构建了中国智慧冷链发展评价指标体系,运用熵权法并结合优劣解距离法(Technique for Order Preference by Similarity to Ideal Solution,TOPSIS)对2017—2021年中国30个省市(不包含西藏、中国香港、中国澳门、中国台湾)的智慧冷链发展水平进行定量评价,通过探索性空间数据分析法(Exploratory Spatial Data Analys,ESDA)和地理加权回归(Geographically Weighted Regression,GWR)分析评价指标对不同省市的影响变化。研究结果表明,经济发展状况、配套设施建设与信息化水平对智慧冷链建设影响作用较大;东部沿海地区智慧冷链发展普遍较好,西南和西北地区发展缓慢,整体发展较为落后;评价指标具有显著时空异质性,影响程度随空间位置和时间发生变化。为提升中国智慧冷链整体发展水平,就加强配套设施建设、促进信息化转型升级提出相应发展对策。本研究可为实现中国智慧冷链全局性谋划、战略性布局、整体性推进等提供科学依据。
文摘目的利用信息系统规范护理敏感质量指标的提取。方法分别通过完善及应用护理质量管理系统、医院信息网络系统(hospital information system,HIS)、蓝蜻蜓系统、"问卷星"在线调查工具4个信息系统采集变量,采集过程中进行质量控制。结果得出不同指标提取的相应信息渠道,规范各指标获取方法。结论利用信息系统进行护理敏感质量指标变量采集,保证了指标的可测量性、客观性、有效性和及时性。