There has long been discussion about the distinctions of library science,information science,and informatics,and how these areas differ and overlap with computer science.Today the term data science is emerging that ge...There has long been discussion about the distinctions of library science,information science,and informatics,and how these areas differ and overlap with computer science.Today the term data science is emerging that generates excitement and questions about how it relates to and differs from these other areas of study.展开更多
Traditional grid computing focuses on the movement of data to compute resources and the management of large scale simulations. Data grid computing focuses on moving the operations to the storage location and on operat...Traditional grid computing focuses on the movement of data to compute resources and the management of large scale simulations. Data grid computing focuses on moving the operations to the storage location and on operations on data collections. We present three types of data grid operations that facilitate data driven research: the manipulation of time series data, the reproducible execution of workflows, and the mapping of data access to software-defined networks. These data grid operations have been implemented as operations on collections within the NSF DataNet Federation Consortium project. The operations can be applied at the remote resource where data are stored, improving the ability of researchers to interact with large collections.展开更多
Purpose: To design an efficient high-performance algorithm for semantic annotation of biodiversity documents in Chinese.Design/methodology/approach: Data set consists of 1,000 randomly selected documents from Flora of...Purpose: To design an efficient high-performance algorithm for semantic annotation of biodiversity documents in Chinese.Design/methodology/approach: Data set consists of 1,000 randomly selected documents from Flora of China. Comparative evaluation of the proposed approach with the Na ve Bayes algorithm have been developed before for the same purpose.Findings: Experimental results show that the heuristics based algorithm outperformed the Na ve Bayes algorithm. The use of leading words helped improving the annotation performance while prioritizing rule application based on their weights had no significant impact on algorithm performance.Research limitations: The ICTCLAS was used to identify word boundaries off-shelf without optimatization for biodiversity domain. This may have not made the best use of the tool.Practical implications & Originality/value: The performance of heuristics based approach,enhanced by leading words analysis, reached an F value of 0.9216, which is sufficiently accurate for practical use.展开更多
With statistical analysis of Weibo altmetrics indicator, the paper explored features of a typical altmetrics indicator in Chinese environment. Results show that the coverage of Weibo is below 1%. However, the coverage...With statistical analysis of Weibo altmetrics indicator, the paper explored features of a typical altmetrics indicator in Chinese environment. Results show that the coverage of Weibo is below 1%. However, the coverage is underestimated due to limitation of tracking time and objects. Weibo mentions and discusses articles mainly from disciplines like 'General', 'Biochemistry, genetics and molecular biology', 'Health science', 'Medicine' and 'Life science' etc. In addition to traditional distinguished interdisciplinary journals like Nature, preprint platform and open access journal like ar Xiv, PLo S ONE and SSRN also drew much attention from Weibo. Meanwhile, 'Biology Science' and 'Medical science' have the most highlighted journals. Weibo mainly tracks latest articles, reflected in that articles tracked within 180 days occupy 68.66%, it also tracks classic articles. Weibo authors prefer to disseminate, recommend and criticize articles that are adherent to daily life, funny, useful or related to health, which conveys social value and scholarly value beyond citations. Weibo altmetrics indicator is highly scattered and concentrated. 5.1% of the articles have harvested 50% of the weibos. Besides, articles tracked by Weibo gain global attention much higher than the average level.展开更多
文摘There has long been discussion about the distinctions of library science,information science,and informatics,and how these areas differ and overlap with computer science.Today the term data science is emerging that generates excitement and questions about how it relates to and differs from these other areas of study.
文摘Traditional grid computing focuses on the movement of data to compute resources and the management of large scale simulations. Data grid computing focuses on moving the operations to the storage location and on operations on data collections. We present three types of data grid operations that facilitate data driven research: the manipulation of time series data, the reproducible execution of workflows, and the mapping of data access to software-defined networks. These data grid operations have been implemented as operations on collections within the NSF DataNet Federation Consortium project. The operations can be applied at the remote resource where data are stored, improving the ability of researchers to interact with large collections.
基金supported by the National Social Science Foundation of China (Grant No.:11BTQ024)the Foundation for Humanities and Social Sciences of the Chinese Ministry of Education (Grant No.:10YJC87004)
文摘Purpose: To design an efficient high-performance algorithm for semantic annotation of biodiversity documents in Chinese.Design/methodology/approach: Data set consists of 1,000 randomly selected documents from Flora of China. Comparative evaluation of the proposed approach with the Na ve Bayes algorithm have been developed before for the same purpose.Findings: Experimental results show that the heuristics based algorithm outperformed the Na ve Bayes algorithm. The use of leading words helped improving the annotation performance while prioritizing rule application based on their weights had no significant impact on algorithm performance.Research limitations: The ICTCLAS was used to identify word boundaries off-shelf without optimatization for biodiversity domain. This may have not made the best use of the tool.Practical implications & Originality/value: The performance of heuristics based approach,enhanced by leading words analysis, reached an F value of 0.9216, which is sufficiently accurate for practical use.
基金an outcome of the project “Theoretical and Empirical Studies of Altmetrics”(No.2014104010201)supported by Key Project of Special Funding from China University Fundamental Research Funding
文摘With statistical analysis of Weibo altmetrics indicator, the paper explored features of a typical altmetrics indicator in Chinese environment. Results show that the coverage of Weibo is below 1%. However, the coverage is underestimated due to limitation of tracking time and objects. Weibo mentions and discusses articles mainly from disciplines like 'General', 'Biochemistry, genetics and molecular biology', 'Health science', 'Medicine' and 'Life science' etc. In addition to traditional distinguished interdisciplinary journals like Nature, preprint platform and open access journal like ar Xiv, PLo S ONE and SSRN also drew much attention from Weibo. Meanwhile, 'Biology Science' and 'Medical science' have the most highlighted journals. Weibo mainly tracks latest articles, reflected in that articles tracked within 180 days occupy 68.66%, it also tracks classic articles. Weibo authors prefer to disseminate, recommend and criticize articles that are adherent to daily life, funny, useful or related to health, which conveys social value and scholarly value beyond citations. Weibo altmetrics indicator is highly scattered and concentrated. 5.1% of the articles have harvested 50% of the weibos. Besides, articles tracked by Weibo gain global attention much higher than the average level.