This paper focuses on research e-infrastructures in the open science era.We analyze some of the challenges and opportunities of cloud-based science and introduce an example of a national solution in the China Science ...This paper focuses on research e-infrastructures in the open science era.We analyze some of the challenges and opportunities of cloud-based science and introduce an example of a national solution in the China Science and Technology Cloud(CSTCloud).We selected three CSTCloud use cases in deploying open science modules,including scalable engineering in astronomical data management,integrated Earth-science resources for SDG-13 decision making,and the coupling of citizen science and artificial intelligence(AI)techniques in biodiversity.We conclude with a forecast on the future development of research e-infrastructures and introduce the idea of the Global Open Science Cloud(GOSC).We hope this analysis can provide some insights into the future development of research e-infrastructures in support of open science.展开更多
Open innovation benefits from access to cutting-edge discoveries to increase their transformation into tangible applications for the benefit of society.Improving research quality has been proposed as a primary objecti...Open innovation benefits from access to cutting-edge discoveries to increase their transformation into tangible applications for the benefit of society.Improving research quality has been proposed as a primary objective of open science by the United Nations,to increase science reproducibility,impact,and trust,leading to robust decision-making and policies.However,opening access to data and processes is insufficient for researchers to achieve open innovation in the context of globalization,for example,by gathering insights from external and internal sources.Developing the appropriate mindset to manage complexity and generate synergy among researchers in academia,industry,and the government is essential to catalyze knowledge and transform it into relevant innovations for society.To gain insights into the roles and challenges of researchers aiming to bridge the gap between open science and open innovation,a decade-plus Mapping Literature Review was conducted based on the complex thinking paradigm.Complex thinking allows for novel connections of the information collected through open science and open innovation,considering different forms of engaging with alternative means of knowledge creation that may promote innovative and critical thinking.The findings revealed:a)broad positioning of the terms in the European Union;b)open access and open data as current driving themes;c)a constant trade-off between the terms“open data”and“information protection”;d)lack of studies on researchers’complex thinking to help them manage openness;e)absence of the environmental helix in the initiatives;and(f)challenges in innovative communication and collaborative practices among public and private entities.Overall,we identified an opportunity to develop researchers’complex thinking such that the openness of information becomes a shared responsibility among partners across multiple helices.This shared responsibility can have methodological implications that permeate how open science and open innovation are theorized and,in practice,facilitate the development of fundamental collaborative research procedures.展开更多
Data Science(DS)as defined by Jim Gray is an emerging paradigm in all research areas to help finding non-obvious patterns of relevance in large distributed data collections.“Open Science by Design”(OSD),i.e.,making ...Data Science(DS)as defined by Jim Gray is an emerging paradigm in all research areas to help finding non-obvious patterns of relevance in large distributed data collections.“Open Science by Design”(OSD),i.e.,making artefacts such as data,metadata,models,and algorithms available and re-usable to peers and beyond as early as possible,is a pre-requisite for a flourishing DS landscape.However,a few major aspects can be identified hampering a fast transition:(1)The classical“Open Science by Publication”(OSP)is not sufficient any longer since it serves different functions,leads to non-acceptable delays and is associated with high curation costs.Changing data lab practices towards OSD requires more fundamental changes than OSP.(2)The classical publication-oriented models for metrics,mainly informed by citations,will not work anymore since the roles of contributors are more difficult to assess and will often change,i.e.,other ways for assigning incentives and recognition need to be found.(3)The huge investments in developing DS skills and capacities by some global companies and strong countries is leading to imbalances and fears by different stakeholders hampering the acceptance of Open Science(OS).(4)Finally,OSD will depend on the availability of a global infrastructure fostering an integrated and interoperable data domain-“one data-domain”as George Strawn calls it-which is still not visible due to differences about the technological key pillars.OS therefore is a need for DS,but it will take much more time to implement it than we may have expected.展开更多
Research Data Management(RDM)has become increasingly important for more and more academic institutions.Using the Peking University Open Research Data Repository(PKU-ORDR)project as an example,this paper will review a ...Research Data Management(RDM)has become increasingly important for more and more academic institutions.Using the Peking University Open Research Data Repository(PKU-ORDR)project as an example,this paper will review a library-based university-wide open research data repository project and related RDM services implementation process including project kickoff,needs assessment,partnerships establishment,software investigation and selection,software customization,as well as data curation services and training.Through the review,some issues revealed during the stages of the implementation process are also discussed and addressed in the paper such as awareness of research data,demands from data providers and users,data policies and requirements from home institution,requirements from funding agencies and publishers,the collaboration between administrative units and libraries,and concerns from data providers and users.The significance of the study is that the paper shows an example of creating an Open Data repository and RDM services for other Chinese academic libraries planning to implement their RDM services for their home institutions.The authors of the paper have also observed since the PKU-ORDR and RDM services implemented in 2015,the Peking University Library(PKUL)has helped numerous researchers to support the entire research life cycle and enhanced Open Science(OS)practices on campus,as well as impacted the national OS movement in China through various national events and activities hosted by the PKUL.展开更多
The introduction of a new technology or innovation is often accompanied by“ups and downs”in its fortunes.Gartner Inc.defined a so-called hype cycle to describe a general pattern that many innovations experience:tech...The introduction of a new technology or innovation is often accompanied by“ups and downs”in its fortunes.Gartner Inc.defined a so-called hype cycle to describe a general pattern that many innovations experience:technology trigger,peak of inflated expectations,trough of disillusionment,slope of enlightenment,and plateau of productivity.This article will compare the ongoing introduction of Open Science(OS)with the hype cycle model and speculate on the relevance of that model to OS.Lest the title of this article mislead the reader,be assured that the author believes that OS should happen and that it will happen.However,I also believe that the path to OS will be longer than many of us had hoped.I will give a brief history of the today’s“semi-open”science,define what I mean by OS,define the hype cycle and where OS is now on that cycle,and finally speculate what it will take to traverse the cycle and rise to its plateau of productivity(as described by Gartner).展开更多
Citizen Science(CS)is a prominent field of application for Open Science(OS),and the two have strong synergies,such as:advocating for the data and metadata generated through science to be made publicly available[1];sup...Citizen Science(CS)is a prominent field of application for Open Science(OS),and the two have strong synergies,such as:advocating for the data and metadata generated through science to be made publicly available[1];supporting more equitable collaboration between different types of scientists and citizens;and facilitating knowledge transfer to a wider range of audiences[2].While primarily targeted at CS,the EU-Citizen.Science platform can also support OS.One of its key functions is to act as a knowledge hub to aggregate,disseminate and promote experience and know-how;for example,by profiling CS projects and collecting tools,resources and training materials relevant to both fields.To do this,the platform has developed an information architecture that incorporates the public participation in scientific research(PPSR)-Common Conceptual Model.This model consists of the Project Metadata Model,the Dataset Metadata Model and the Observation Data Model,which were specifically developed for CS initiatives.By implementing these,the platform will strengthen the interoperating arrangements that exist between other,similar platforms(e.g.,BioCollect and SciStarter)to ensure that CS and OS continue to grow globally in terms of participants,impact and fields of application.展开更多
Semantic technologies have emerged as a prominent research area within Big Earth Data.These technologies have provided significant benefits for data discovery and integration.Yet,the formality of the Semantic Web,in l...Semantic technologies have emerged as a prominent research area within Big Earth Data.These technologies have provided significant benefits for data discovery and integration.Yet,the formality of the Semantic Web,in languages such as the Web Ontology Language(OWL),does not always integrate well with the numerical,statistical,and geometric methods of the geosciences.Two prominent challenges in this area are how to semantically model individual measurements and what to do when geoscience needs are not addressed by languages such as OWL.This has led to a fragmented Big Earth Data community with either no solution or incompatible semantic solutions.We use an oceanographic example to highlight the limitations and challenges surrounding the semantic encoding of observations and the use of semantics during analysis.We then present potential solutions to each challenge showing that a full end-to-end application of semantic technologies is not only feasible,but beneficial to Big Earth Data.展开更多
This article will document how the European Open Science Cloud(EOSC)emerged as one of the key policy intentions to foster Open Science(OS)in Europe.It will describe some of the typical,non-rational roadblocks on the w...This article will document how the European Open Science Cloud(EOSC)emerged as one of the key policy intentions to foster Open Science(OS)in Europe.It will describe some of the typical,non-rational roadblocks on the way to implement EOSC.The article will also argue that the only way Europe can take care of its research data in a way that fits the European specificities fully,is by supporting EOSC.展开更多
This article invites us to a concise walk through the past,offering insights defined by the major challenges science encountered during the centuries.Some lessons for today and tomorrow are enumerated in the three sec...This article invites us to a concise walk through the past,offering insights defined by the major challenges science encountered during the centuries.Some lessons for today and tomorrow are enumerated in the three sections of the article,and they go beyond the relatively few perspectives offered by today’s Data Science:Open Science(OS)is what has always happened and is nothing new,because science has always sought to be open.Esthetical values played a relevant role in the past.Former scientists recognized the intrinsic relation between the way they opened science and the way they followed the principles of beauty and the sense of esthetic.Their groundbreaking heritage still inspires us in being ready to open new ways in science.Whereas Latin was the original lingua franca of European science,and English is the recent lingua franca,the new lingua franca is software.Pieces of software are the filter,which connect researchers to the world,through layers of data.They assist in observing,in choosing,and in selecting.Open scientists should be aware of the fact that their autonomy in science depends on the quality of these pieces.Another lesson is that ethics-regarded as a source of innovative activities-must be a core component of innovative processes in OS,because society needs a responsible use of data and algorithms in corresponding practices that serve OS.展开更多
The scientific,social,and economic advantages that accrue from Open Science(OS)practices-ways of doing research that emphasize reproducibility,transparency,and accessibility at all stages of the research cycle-are now...The scientific,social,and economic advantages that accrue from Open Science(OS)practices-ways of doing research that emphasize reproducibility,transparency,and accessibility at all stages of the research cycle-are now widely recognized in nations around the world and by international bodies such as the United Nations and the Organization for Economic Cooperation and Development.However,program wide or coordinated instruction of undergraduate students in OS practices remains uncommon.At the University of British Columbia in Canada,we have started to develop a comprehensive undergraduate OS program that can be adapted to and woven into diverse subject curricula.We report on the context and planning of the pilot module of the program,“Open Science 101”,its implementation in first-year Biology in Fall 2019,and qualitative results of an attitudinal survey of students following their course.展开更多
Purpose:The open-access(OA)publishing model can help improve researchers’outreach,thanks to its accessibility and visibility to the public.Therefore,the presentation of female researchers can benefit from the O A pub...Purpose:The open-access(OA)publishing model can help improve researchers’outreach,thanks to its accessibility and visibility to the public.Therefore,the presentation of female researchers can benefit from the O A publishing model.Despite that,little is known about how gender affects OA practices.Thus,the current study explores the effects of female involvement and risk aversion on OA publishing patterns among Vietname se social sciences and humanities.Design/methodology/approach:The study employed Bayesian Mindsponge Framework(BMF)on a dataset of 3,122 Vietnamese social sciences and humanities(SS&H)publications during 2008-2019.The Mindsponge mechanism was specifically used to construct theoretical models,while Bayesian inference was utilized for fitting models.Findings:The result showed a positive association between female participation and OA publishing probability.However,the positive effect of female involvement on OA publishing probability was negated by the high ratio of female researchers in a publication.OA status was negatively associated with the JIF of the journal in which the publication was published,but the relationship was moderated by the involvement of a female researcher(s).The findings suggested that Vietnamese female researchers might be more likely to publish under the OA model in journals with high JIF for avoiding the risk of public criticism.Research limitations:The study could only provide evidence on the association between female involvement and OA publishing probability.However,whether to publish under OA terms is often determined by the first or corresponding authors,but not necessarily gender-based.Practical implications:Systematically coordinated actions are suggested to better support women and promote the OA movement in Vietnam.Originality/value:The findings show the OA publishing patterns of female researchers in Vietnamese SS&H.展开更多
Collaboration and the sharing of knowledge is at the heart of Open Science(OS).However,we need to know that the knowledge we find and share is really what it purports to be;and we need to know that the authors we hope...Collaboration and the sharing of knowledge is at the heart of Open Science(OS).However,we need to know that the knowledge we find and share is really what it purports to be;and we need to know that the authors we hope to collaborate with are really the people they claim to be.In this paper,the author argues that a prerequisite for OS is trust and that persistent identifiers help to build that trust.The persistent identifier systems must themselves be trustworthy and they must be able to connect the user or their machine to the information they need now and into the future.Infrastructure is rather like plumbing:It goes unnoticed and unappreciated until it fails.This paper puts infrastructure for persistent identifiers in the spotlight as a core component of OS.展开更多
The COVID-19 pandemic highlights the urgent need to strengthen global scientific collaboration,and to ensure the fundamental right to universal access to scientific progress and its applications.Open Science(OS)is cen...The COVID-19 pandemic highlights the urgent need to strengthen global scientific collaboration,and to ensure the fundamental right to universal access to scientific progress and its applications.Open Science(OS)is central to achieving these goals.It aims to make science accessible,transparent,and effective by providing barrier-free access to scientific publications,data,and infrastructures,along with open software,Open Educational Resources,and open technologies.OS also promotes public trust in science at a time when it has never been more important to do so.Over the past decade,momentum towards the widespread adoption of OS practices has been primarily driven by declarations(e.g.,DORA,the Leiden Manifesto).These serve an important role,but for OS to truly take root,researchers also must be fully incentivized and rewarded for its practice.This requires research funders and academic leaders to take the lead in collaborating,with researchers in designing,and implementing new incentive structures,and to actively work to socialize these throughout the research ecosystem.The US National Academies of Science,Engineering,and Medicine(NASEM)Roundtable on Aligning Research Incentives for OS is one such effort.This paper examines the strategy behind convening the Roundtable,its current participant makeup,focus,and outputs.It also explores how this approach might be expanded and adapted throughout the global OS community.展开更多
The Open Science(OS)movement has achieved extraordinary results in very few years.In this paper I argue it is now necessary to embed OS in the wider ecosystem of research and innovation,acknowledging some of the outst...The Open Science(OS)movement has achieved extraordinary results in very few years.In this paper I argue it is now necessary to embed OS in the wider ecosystem of research and innovation,acknowledging some of the outstanding issues that need to be resolved as it beds down into the way research is done in the future.By sticking to a purest approach to OS its impact and current momentum may be lost.Digital technologies and global connectivity have ensured that OS is here to stay and will continue to expand its influence in the future.However,OS cannot stand aloof from what is the reality of what is happening elsewhere otherwise it will do a disservice to itself and the challenges facing the world.展开更多
Purpose:Researchers are more likely to read and cite papers to which they have access than those that they cannot obtain.Thus,the objective of this work is to analyze the contribution of the Open Access(OA)modality to...Purpose:Researchers are more likely to read and cite papers to which they have access than those that they cannot obtain.Thus,the objective of this work is to analyze the contribution of the Open Access(OA)modality to the impact of hybrid journals.Design/methodology/approach:The“research articles”in the year 2017 from 200 hybrid journals in four subject areas,and the citations received by such articles in the period 2017-2020 in the Scopus database,were analyzed.The hybrid OA papers were compared with the paywalled ones.The journals were randomly selected from those with share of OA papers higher than some minimal value.More than 60 thousand research articles were analyzed in the sample,of which 24%under the OA modality.Findings:We obtain at journal level that cites per article in both hybrid modalities(OA and paywalled)strongly correlate.However,there is no correlation between the OA prevalence and cites per article.There is OA citation advantage in 80%of hybrid journals.Moreover,the OA citation advantage is consistent across fields and held in time.We obtain an OA citation advantage of 50%in average,and higher than 37%in half of the hybrid journals.Finally,the OA citation advantage is higher in Humanities than in Science and Social Science.Research limitations:Some of the citation advantage is likely due to more access allows more people to read and hence cite articles they otherwise would not.However,causation is difficult to establish and there are many possible bias.Several factors can affect the observed differences in citation rates.Funder mandates can be one of them.Funders are likely to have OA requirement,and well-funded studies are more likely to receive more citations than poorly funded studies.Another discussed factor is the selection bias postulate,which suggests that authors choose only their most impactful studies to be open access.Practical implications:For hybrid journals,the open access modality is positive,in the sense that it provides a greater number of potential readers.This in turn translates into a greater number of citations and an improvement in the position of the journal in the rankings by impact factor.For researchers it is also positive because it increases the potential number of readers and citations received.Originality/value:Our study refines previous results by comparing documents more similar to each other.Although it does not examine the cause of the observed citation advantage,we find that it exists in a very large sample.展开更多
Purpose: To understand how authors and reviewers are accepting and embracing Open Peer Review(OPR), one of the newest innovations in the Open Science movement.Design/methodology/approach: This research collected and a...Purpose: To understand how authors and reviewers are accepting and embracing Open Peer Review(OPR), one of the newest innovations in the Open Science movement.Design/methodology/approach: This research collected and analyzed data from the Open Access journal Peer J over its first three years(2013–2016). Web data were scraped, cleaned, and structured using several Web tools and programs. The structured data were imported into a relational database. Data analyses were conducted using analytical tools as well as programs developed by the researchers.Findings: Peer J, which supports optional OPR, has a broad international representation of authors and referees. Approximately 73.89% of articles provide full review histories. Of the articles with published review histories, 17.61% had identities of all reviewers and 52.57% had at least one signed reviewer. In total, 43.23% of all reviews were signed. The observed proportions of signed reviews have been relatively stable over the period since the Journal’s inception.Research limitations: This research is constrained by the availability of the peer review history data. Some peer reviews were not available when the authors opted out of publishing their review histories. The anonymity of reviewers made it impossible to give an accurate count of reviewers who contributed to the review process. Practical implications: These findings shed light on the current characteristics of OPR. Given the policy that authors are encouraged to make their articles’ review history public and referees are encouraged to sign their review reports, the three years of Peer J review data demonstrate that there is still some reluctance by authors to make their reviews public and by reviewers to identify themselves. Originality/value: This is the first study to closely examine Peer J as an example of an OPR model journal. As Open Science moves further towards open research, OPR is a final and critical component. Research in this area must identify the best policies and paths towards a transparent and open peer review process for scientific communication.展开更多
Purpose:This work aims to normalize the NLPCONTRIBUTIONS scheme(henceforward,NLPCONTRIBUTIONGRAPH)to structure,directly from article sentences,the contributions information in Natural Language Processing(NLP)scholarly...Purpose:This work aims to normalize the NLPCONTRIBUTIONS scheme(henceforward,NLPCONTRIBUTIONGRAPH)to structure,directly from article sentences,the contributions information in Natural Language Processing(NLP)scholarly articles via a two-stage annotation methodology:1)pilot stage-to define the scheme(described in prior work);and 2)adjudication stage-to normalize the graphing model(the focus of this paper).Design/methodology/approach:We re-annotate,a second time,the contributions-pertinent information across 50 prior-annotated NLP scholarly articles in terms of a data pipeline comprising:contribution-centered sentences,phrases,and triple statements.To this end,specifically,care was taken in the adjudication annotation stage to reduce annotation noise while formulating the guidelines for our proposed novel NLP contributions structuring and graphing scheme.Findings:The application of NLPCONTRIBUTIONGRAPH on the 50 articles resulted finally in a dataset of 900 contribution-focused sentences,4,702 contribution-information-centered phrases,and 2,980 surface-structured triples.The intra-annotation agreement between the first and second stages,in terms of F1-score,was 67.92%for sentences,41.82%for phrases,and 22.31%for triple statements indicating that with increased granularity of the information,the annotation decision variance is greater.Research limitations:NLPCONTRIBUTIONGRAPH has limited scope for structuring scholarly contributions compared with STEM(Science,Technology,Engineering,and Medicine)scholarly knowledge at large.Further,the annotation scheme in this work is designed by only an intra-annotator consensus-a single annotator first annotated the data to propose the initial scheme,following which,the same annotator reannotated the data to normalize the annotations in an adjudication stage.However,the expected goal of this work is to achieve a standardized retrospective model of capturing NLP contributions from scholarly articles.This would entail a larger initiative of enlisting multiple annotators to accommodate different worldviews into a“single”set of structures and relationships as the final scheme.Given that the initial scheme is first proposed and the complexity of the annotation task in the realistic timeframe,our intraannotation procedure is well-suited.Nevertheless,the model proposed in this work is presently limited since it does not incorporate multiple annotator worldviews.This is planned as future work to produce a robust model.Practical implications:We demonstrate NLPCONTRIBUTIONGRAPH data integrated into the Open Research Knowledge Graph(ORKG),a next-generation KG-based digital library with intelligent computations enabled over structured scholarly knowledge,as a viable aid to assist researchers in their day-to-day tasks.Originality/value:NLPCONTRIBUTIONGRAPH is a novel scheme to annotate research contributions from NLP articles and integrate them in a knowledge graph,which to the best of our knowledge does not exist in the community.Furthermore,our quantitative evaluations over the two-stage annotation tasks offer insights into task difficulty.展开更多
基金supported by funding from the National Key R&D Program of China(No.2021YFE0111500)the National Natural Science Foundation of China(No.72104229)+1 种基金the CAS Program for fostering international mega-science(No.241711KYSB20200023)the CAS President's International Fellowship Initiative(No.2021VTA0006).
文摘This paper focuses on research e-infrastructures in the open science era.We analyze some of the challenges and opportunities of cloud-based science and introduce an example of a national solution in the China Science and Technology Cloud(CSTCloud).We selected three CSTCloud use cases in deploying open science modules,including scalable engineering in astronomical data management,integrated Earth-science resources for SDG-13 decision making,and the coupling of citizen science and artificial intelligence(AI)techniques in biodiversity.We conclude with a forecast on the future development of research e-infrastructures and introduce the idea of the Global Open Science Cloud(GOSC).We hope this analysis can provide some insights into the future development of research e-infrastructures in support of open science.
基金the financial support of Tecnologico de Monterrey through the“Challenge-Based Research Funding Program 2022”.Project ID#I001-IFE001-C1-T1-E.
文摘Open innovation benefits from access to cutting-edge discoveries to increase their transformation into tangible applications for the benefit of society.Improving research quality has been proposed as a primary objective of open science by the United Nations,to increase science reproducibility,impact,and trust,leading to robust decision-making and policies.However,opening access to data and processes is insufficient for researchers to achieve open innovation in the context of globalization,for example,by gathering insights from external and internal sources.Developing the appropriate mindset to manage complexity and generate synergy among researchers in academia,industry,and the government is essential to catalyze knowledge and transform it into relevant innovations for society.To gain insights into the roles and challenges of researchers aiming to bridge the gap between open science and open innovation,a decade-plus Mapping Literature Review was conducted based on the complex thinking paradigm.Complex thinking allows for novel connections of the information collected through open science and open innovation,considering different forms of engaging with alternative means of knowledge creation that may promote innovative and critical thinking.The findings revealed:a)broad positioning of the terms in the European Union;b)open access and open data as current driving themes;c)a constant trade-off between the terms“open data”and“information protection”;d)lack of studies on researchers’complex thinking to help them manage openness;e)absence of the environmental helix in the initiatives;and(f)challenges in innovative communication and collaborative practices among public and private entities.Overall,we identified an opportunity to develop researchers’complex thinking such that the openness of information becomes a shared responsibility among partners across multiple helices.This shared responsibility can have methodological implications that permeate how open science and open innovation are theorized and,in practice,facilitate the development of fundamental collaborative research procedures.
文摘Data Science(DS)as defined by Jim Gray is an emerging paradigm in all research areas to help finding non-obvious patterns of relevance in large distributed data collections.“Open Science by Design”(OSD),i.e.,making artefacts such as data,metadata,models,and algorithms available and re-usable to peers and beyond as early as possible,is a pre-requisite for a flourishing DS landscape.However,a few major aspects can be identified hampering a fast transition:(1)The classical“Open Science by Publication”(OSP)is not sufficient any longer since it serves different functions,leads to non-acceptable delays and is associated with high curation costs.Changing data lab practices towards OSD requires more fundamental changes than OSP.(2)The classical publication-oriented models for metrics,mainly informed by citations,will not work anymore since the roles of contributors are more difficult to assess and will often change,i.e.,other ways for assigning incentives and recognition need to be found.(3)The huge investments in developing DS skills and capacities by some global companies and strong countries is leading to imbalances and fears by different stakeholders hampering the acceptance of Open Science(OS).(4)Finally,OSD will depend on the availability of a global infrastructure fostering an integrated and interoperable data domain-“one data-domain”as George Strawn calls it-which is still not visible due to differences about the technological key pillars.OS therefore is a need for DS,but it will take much more time to implement it than we may have expected.
文摘Research Data Management(RDM)has become increasingly important for more and more academic institutions.Using the Peking University Open Research Data Repository(PKU-ORDR)project as an example,this paper will review a library-based university-wide open research data repository project and related RDM services implementation process including project kickoff,needs assessment,partnerships establishment,software investigation and selection,software customization,as well as data curation services and training.Through the review,some issues revealed during the stages of the implementation process are also discussed and addressed in the paper such as awareness of research data,demands from data providers and users,data policies and requirements from home institution,requirements from funding agencies and publishers,the collaboration between administrative units and libraries,and concerns from data providers and users.The significance of the study is that the paper shows an example of creating an Open Data repository and RDM services for other Chinese academic libraries planning to implement their RDM services for their home institutions.The authors of the paper have also observed since the PKU-ORDR and RDM services implemented in 2015,the Peking University Library(PKUL)has helped numerous researchers to support the entire research life cycle and enhanced Open Science(OS)practices on campus,as well as impacted the national OS movement in China through various national events and activities hosted by the PKUL.
文摘The introduction of a new technology or innovation is often accompanied by“ups and downs”in its fortunes.Gartner Inc.defined a so-called hype cycle to describe a general pattern that many innovations experience:technology trigger,peak of inflated expectations,trough of disillusionment,slope of enlightenment,and plateau of productivity.This article will compare the ongoing introduction of Open Science(OS)with the hype cycle model and speculate on the relevance of that model to OS.Lest the title of this article mislead the reader,be assured that the author believes that OS should happen and that it will happen.However,I also believe that the path to OS will be longer than many of us had hoped.I will give a brief history of the today’s“semi-open”science,define what I mean by OS,define the hype cycle and where OS is now on that cycle,and finally speculate what it will take to traverse the cycle and rise to its plateau of productivity(as described by Gartner).
基金The EU-Citizen.Science project received funding from the EU’s Horizon 2020 Framework Program for Research and Innovation under grant agreement No.824580The research described in this paper is partly supported by the project“Citizen Science to promote creativity,scientific literacy,and innovation throughout Europe”(COST Action),which received funding from the EU’s Horizon 2020 Framework Program for Research and Innovation under grant agreement No.15212The opinions expressed are those of the authors and not necessarily those of the COST Action or the European Commission.
文摘Citizen Science(CS)is a prominent field of application for Open Science(OS),and the two have strong synergies,such as:advocating for the data and metadata generated through science to be made publicly available[1];supporting more equitable collaboration between different types of scientists and citizens;and facilitating knowledge transfer to a wider range of audiences[2].While primarily targeted at CS,the EU-Citizen.Science platform can also support OS.One of its key functions is to act as a knowledge hub to aggregate,disseminate and promote experience and know-how;for example,by profiling CS projects and collecting tools,resources and training materials relevant to both fields.To do this,the platform has developed an information architecture that incorporates the public participation in scientific research(PPSR)-Common Conceptual Model.This model consists of the Project Metadata Model,the Dataset Metadata Model and the Observation Data Model,which were specifically developed for CS initiatives.By implementing these,the platform will strengthen the interoperating arrangements that exist between other,similar platforms(e.g.,BioCollect and SciStarter)to ensure that CS and OS continue to grow globally in terms of participants,impact and fields of application.
基金This work was supported from the National Science Foundation grant Earth Cube Building Blocks:Collaborative Proposal:GeoLink-Leveraging Semantics and Linked Data for Data Sharing and Discovery in the Geosciences.The authors would like to acknowledge their respective collaborative grants numbered 1435578 and 1440139.
文摘Semantic technologies have emerged as a prominent research area within Big Earth Data.These technologies have provided significant benefits for data discovery and integration.Yet,the formality of the Semantic Web,in languages such as the Web Ontology Language(OWL),does not always integrate well with the numerical,statistical,and geometric methods of the geosciences.Two prominent challenges in this area are how to semantically model individual measurements and what to do when geoscience needs are not addressed by languages such as OWL.This has led to a fragmented Big Earth Data community with either no solution or incompatible semantic solutions.We use an oceanographic example to highlight the limitations and challenges surrounding the semantic encoding of observations and the use of semantics during analysis.We then present potential solutions to each challenge showing that a full end-to-end application of semantic technologies is not only feasible,but beneficial to Big Earth Data.
文摘This article will document how the European Open Science Cloud(EOSC)emerged as one of the key policy intentions to foster Open Science(OS)in Europe.It will describe some of the typical,non-rational roadblocks on the way to implement EOSC.The article will also argue that the only way Europe can take care of its research data in a way that fits the European specificities fully,is by supporting EOSC.
文摘This article invites us to a concise walk through the past,offering insights defined by the major challenges science encountered during the centuries.Some lessons for today and tomorrow are enumerated in the three sections of the article,and they go beyond the relatively few perspectives offered by today’s Data Science:Open Science(OS)is what has always happened and is nothing new,because science has always sought to be open.Esthetical values played a relevant role in the past.Former scientists recognized the intrinsic relation between the way they opened science and the way they followed the principles of beauty and the sense of esthetic.Their groundbreaking heritage still inspires us in being ready to open new ways in science.Whereas Latin was the original lingua franca of European science,and English is the recent lingua franca,the new lingua franca is software.Pieces of software are the filter,which connect researchers to the world,through layers of data.They assist in observing,in choosing,and in selecting.Open scientists should be aware of the fact that their autonomy in science depends on the quality of these pieces.Another lesson is that ethics-regarded as a source of innovative activities-must be a core component of innovative processes in OS,because society needs a responsible use of data and algorithms in corresponding practices that serve OS.
文摘The scientific,social,and economic advantages that accrue from Open Science(OS)practices-ways of doing research that emphasize reproducibility,transparency,and accessibility at all stages of the research cycle-are now widely recognized in nations around the world and by international bodies such as the United Nations and the Organization for Economic Cooperation and Development.However,program wide or coordinated instruction of undergraduate students in OS practices remains uncommon.At the University of British Columbia in Canada,we have started to develop a comprehensive undergraduate OS program that can be adapted to and woven into diverse subject curricula.We report on the context and planning of the pilot module of the program,“Open Science 101”,its implementation in first-year Biology in Fall 2019,and qualitative results of an attitudinal survey of students following their course.
基金supported by National Foundation for Science and Technology Development 502.01-2018.19.
文摘Purpose:The open-access(OA)publishing model can help improve researchers’outreach,thanks to its accessibility and visibility to the public.Therefore,the presentation of female researchers can benefit from the O A publishing model.Despite that,little is known about how gender affects OA practices.Thus,the current study explores the effects of female involvement and risk aversion on OA publishing patterns among Vietname se social sciences and humanities.Design/methodology/approach:The study employed Bayesian Mindsponge Framework(BMF)on a dataset of 3,122 Vietnamese social sciences and humanities(SS&H)publications during 2008-2019.The Mindsponge mechanism was specifically used to construct theoretical models,while Bayesian inference was utilized for fitting models.Findings:The result showed a positive association between female participation and OA publishing probability.However,the positive effect of female involvement on OA publishing probability was negated by the high ratio of female researchers in a publication.OA status was negatively associated with the JIF of the journal in which the publication was published,but the relationship was moderated by the involvement of a female researcher(s).The findings suggested that Vietnamese female researchers might be more likely to publish under the OA model in journals with high JIF for avoiding the risk of public criticism.Research limitations:The study could only provide evidence on the association between female involvement and OA publishing probability.However,whether to publish under OA terms is often determined by the first or corresponding authors,but not necessarily gender-based.Practical implications:Systematically coordinated actions are suggested to better support women and promote the OA movement in Vietnam.Originality/value:The findings show the OA publishing patterns of female researchers in Vietnamese SS&H.
文摘Collaboration and the sharing of knowledge is at the heart of Open Science(OS).However,we need to know that the knowledge we find and share is really what it purports to be;and we need to know that the authors we hope to collaborate with are really the people they claim to be.In this paper,the author argues that a prerequisite for OS is trust and that persistent identifiers help to build that trust.The persistent identifier systems must themselves be trustworthy and they must be able to connect the user or their machine to the information they need now and into the future.Infrastructure is rather like plumbing:It goes unnoticed and unappreciated until it fails.This paper puts infrastructure for persistent identifiers in the spotlight as a core component of OS.
文摘The COVID-19 pandemic highlights the urgent need to strengthen global scientific collaboration,and to ensure the fundamental right to universal access to scientific progress and its applications.Open Science(OS)is central to achieving these goals.It aims to make science accessible,transparent,and effective by providing barrier-free access to scientific publications,data,and infrastructures,along with open software,Open Educational Resources,and open technologies.OS also promotes public trust in science at a time when it has never been more important to do so.Over the past decade,momentum towards the widespread adoption of OS practices has been primarily driven by declarations(e.g.,DORA,the Leiden Manifesto).These serve an important role,but for OS to truly take root,researchers also must be fully incentivized and rewarded for its practice.This requires research funders and academic leaders to take the lead in collaborating,with researchers in designing,and implementing new incentive structures,and to actively work to socialize these throughout the research ecosystem.The US National Academies of Science,Engineering,and Medicine(NASEM)Roundtable on Aligning Research Incentives for OS is one such effort.This paper examines the strategy behind convening the Roundtable,its current participant makeup,focus,and outputs.It also explores how this approach might be expanded and adapted throughout the global OS community.
文摘The Open Science(OS)movement has achieved extraordinary results in very few years.In this paper I argue it is now necessary to embed OS in the wider ecosystem of research and innovation,acknowledging some of the outstanding issues that need to be resolved as it beds down into the way research is done in the future.By sticking to a purest approach to OS its impact and current momentum may be lost.Digital technologies and global connectivity have ensured that OS is here to stay and will continue to expand its influence in the future.However,OS cannot stand aloof from what is the reality of what is happening elsewhere otherwise it will do a disservice to itself and the challenges facing the world.
文摘Purpose:Researchers are more likely to read and cite papers to which they have access than those that they cannot obtain.Thus,the objective of this work is to analyze the contribution of the Open Access(OA)modality to the impact of hybrid journals.Design/methodology/approach:The“research articles”in the year 2017 from 200 hybrid journals in four subject areas,and the citations received by such articles in the period 2017-2020 in the Scopus database,were analyzed.The hybrid OA papers were compared with the paywalled ones.The journals were randomly selected from those with share of OA papers higher than some minimal value.More than 60 thousand research articles were analyzed in the sample,of which 24%under the OA modality.Findings:We obtain at journal level that cites per article in both hybrid modalities(OA and paywalled)strongly correlate.However,there is no correlation between the OA prevalence and cites per article.There is OA citation advantage in 80%of hybrid journals.Moreover,the OA citation advantage is consistent across fields and held in time.We obtain an OA citation advantage of 50%in average,and higher than 37%in half of the hybrid journals.Finally,the OA citation advantage is higher in Humanities than in Science and Social Science.Research limitations:Some of the citation advantage is likely due to more access allows more people to read and hence cite articles they otherwise would not.However,causation is difficult to establish and there are many possible bias.Several factors can affect the observed differences in citation rates.Funder mandates can be one of them.Funders are likely to have OA requirement,and well-funded studies are more likely to receive more citations than poorly funded studies.Another discussed factor is the selection bias postulate,which suggests that authors choose only their most impactful studies to be open access.Practical implications:For hybrid journals,the open access modality is positive,in the sense that it provides a greater number of potential readers.This in turn translates into a greater number of citations and an improvement in the position of the journal in the rankings by impact factor.For researchers it is also positive because it increases the potential number of readers and citations received.Originality/value:Our study refines previous results by comparing documents more similar to each other.Although it does not examine the cause of the observed citation advantage,we find that it exists in a very large sample.
文摘Purpose: To understand how authors and reviewers are accepting and embracing Open Peer Review(OPR), one of the newest innovations in the Open Science movement.Design/methodology/approach: This research collected and analyzed data from the Open Access journal Peer J over its first three years(2013–2016). Web data were scraped, cleaned, and structured using several Web tools and programs. The structured data were imported into a relational database. Data analyses were conducted using analytical tools as well as programs developed by the researchers.Findings: Peer J, which supports optional OPR, has a broad international representation of authors and referees. Approximately 73.89% of articles provide full review histories. Of the articles with published review histories, 17.61% had identities of all reviewers and 52.57% had at least one signed reviewer. In total, 43.23% of all reviews were signed. The observed proportions of signed reviews have been relatively stable over the period since the Journal’s inception.Research limitations: This research is constrained by the availability of the peer review history data. Some peer reviews were not available when the authors opted out of publishing their review histories. The anonymity of reviewers made it impossible to give an accurate count of reviewers who contributed to the review process. Practical implications: These findings shed light on the current characteristics of OPR. Given the policy that authors are encouraged to make their articles’ review history public and referees are encouraged to sign their review reports, the three years of Peer J review data demonstrate that there is still some reluctance by authors to make their reviews public and by reviewers to identify themselves. Originality/value: This is the first study to closely examine Peer J as an example of an OPR model journal. As Open Science moves further towards open research, OPR is a final and critical component. Research in this area must identify the best policies and paths towards a transparent and open peer review process for scientific communication.
基金This work was co-funded by the European Research Council for the project ScienceGRAPH(Grant agreement ID:819536)by the TIB Leibniz Information Centre for Science and Technology.
文摘Purpose:This work aims to normalize the NLPCONTRIBUTIONS scheme(henceforward,NLPCONTRIBUTIONGRAPH)to structure,directly from article sentences,the contributions information in Natural Language Processing(NLP)scholarly articles via a two-stage annotation methodology:1)pilot stage-to define the scheme(described in prior work);and 2)adjudication stage-to normalize the graphing model(the focus of this paper).Design/methodology/approach:We re-annotate,a second time,the contributions-pertinent information across 50 prior-annotated NLP scholarly articles in terms of a data pipeline comprising:contribution-centered sentences,phrases,and triple statements.To this end,specifically,care was taken in the adjudication annotation stage to reduce annotation noise while formulating the guidelines for our proposed novel NLP contributions structuring and graphing scheme.Findings:The application of NLPCONTRIBUTIONGRAPH on the 50 articles resulted finally in a dataset of 900 contribution-focused sentences,4,702 contribution-information-centered phrases,and 2,980 surface-structured triples.The intra-annotation agreement between the first and second stages,in terms of F1-score,was 67.92%for sentences,41.82%for phrases,and 22.31%for triple statements indicating that with increased granularity of the information,the annotation decision variance is greater.Research limitations:NLPCONTRIBUTIONGRAPH has limited scope for structuring scholarly contributions compared with STEM(Science,Technology,Engineering,and Medicine)scholarly knowledge at large.Further,the annotation scheme in this work is designed by only an intra-annotator consensus-a single annotator first annotated the data to propose the initial scheme,following which,the same annotator reannotated the data to normalize the annotations in an adjudication stage.However,the expected goal of this work is to achieve a standardized retrospective model of capturing NLP contributions from scholarly articles.This would entail a larger initiative of enlisting multiple annotators to accommodate different worldviews into a“single”set of structures and relationships as the final scheme.Given that the initial scheme is first proposed and the complexity of the annotation task in the realistic timeframe,our intraannotation procedure is well-suited.Nevertheless,the model proposed in this work is presently limited since it does not incorporate multiple annotator worldviews.This is planned as future work to produce a robust model.Practical implications:We demonstrate NLPCONTRIBUTIONGRAPH data integrated into the Open Research Knowledge Graph(ORKG),a next-generation KG-based digital library with intelligent computations enabled over structured scholarly knowledge,as a viable aid to assist researchers in their day-to-day tasks.Originality/value:NLPCONTRIBUTIONGRAPH is a novel scheme to annotate research contributions from NLP articles and integrate them in a knowledge graph,which to the best of our knowledge does not exist in the community.Furthermore,our quantitative evaluations over the two-stage annotation tasks offer insights into task difficulty.