Research data infrastructures form the cornerstone in both cyber and physical spaces,driving the progression of the data-intensive scientific research paradigm.This opinion paper presents an overview of global researc...Research data infrastructures form the cornerstone in both cyber and physical spaces,driving the progression of the data-intensive scientific research paradigm.This opinion paper presents an overview of global research data infrastructure,drawing insights from national roadmaps and strategic documents related to research data infrastructure.It emphasizes the pivotal role of research data infrastructures by delineating four new missions aimed at positioning them at the core of the current scientific research and communication ecosystem.The four new missions of research data infrastructures are:(1)as a pioneer,to transcend the disciplinary border and address complex,cutting-edge scientific and social challenges with problem-and data-oriented insights;(2)as an architect,to establish a digital,intelligent,flexible research and knowledge services environment;(3)as a platform,to foster the high-end academic communication;(4)as a coordinator,to balance scientific openness with ethics needs.展开更多
Research Data Management(RDM)has become increasingly important for more and more academic institutions.Using the Peking University Open Research Data Repository(PKU-ORDR)project as an example,this paper will review a ...Research Data Management(RDM)has become increasingly important for more and more academic institutions.Using the Peking University Open Research Data Repository(PKU-ORDR)project as an example,this paper will review a library-based university-wide open research data repository project and related RDM services implementation process including project kickoff,needs assessment,partnerships establishment,software investigation and selection,software customization,as well as data curation services and training.Through the review,some issues revealed during the stages of the implementation process are also discussed and addressed in the paper such as awareness of research data,demands from data providers and users,data policies and requirements from home institution,requirements from funding agencies and publishers,the collaboration between administrative units and libraries,and concerns from data providers and users.The significance of the study is that the paper shows an example of creating an Open Data repository and RDM services for other Chinese academic libraries planning to implement their RDM services for their home institutions.The authors of the paper have also observed since the PKU-ORDR and RDM services implemented in 2015,the Peking University Library(PKUL)has helped numerous researchers to support the entire research life cycle and enhanced Open Science(OS)practices on campus,as well as impacted the national OS movement in China through various national events and activities hosted by the PKUL.展开更多
The increased number of data repositories has greatly increased the availability of open data.To enable broad discovery and access to research dataset,some data repositories have begun leveraging the web architecture ...The increased number of data repositories has greatly increased the availability of open data.To enable broad discovery and access to research dataset,some data repositories have begun leveraging the web architecture by embedding structured metadata markup in dataset web landing pages using vocabularies from Schema.org and extensions.This paper aims to examine metadata interoperability for supporting global data discovery.Specifically,the paper reports a survey on which metadata schema has been adopted by participating data repositories,and presents an analysis of crosswalks from fourteen research data schemas to Schema.org.The analysis indicates most descriptive metadata are interoperable among the schemas,the most inconsistent mapping is the rights metadata,and a large gap exists in the structural metadata and controlled vocabularies to specify various property values.The analysis and collated crosswalks can serve as a reference for data repositories when they develop crosswalks from their own schemas to Schema.org,and provide the research data community a benchmark of structured metadata implementation.展开更多
The UK Catalysis Hub(UKCH)is designing a virtual research environment to support data processing and analysis,the Catalysis Research Workbench(CRW).The development of this platform requires identifying the processing ...The UK Catalysis Hub(UKCH)is designing a virtual research environment to support data processing and analysis,the Catalysis Research Workbench(CRW).The development of this platform requires identifying the processing and analysis needs of the UKCH members and mapping them to potential solutions.This paper presents a proposal for a demonstrator to analyse the use of scientific workflows for large scale data processing.The demonstrator provides a concrete target to promote further discussion of the processing and analysis needs of the UKCH community.In this paper,we will discuss the main requirements for data processing elicited and the proposed adaptations that will be incorporated in the design of the CRW and how to integrate the proposed solutions with existing practices of the UKCH.The demonstrator has been used in discussion with researchers and in presentations to the UKCH community,generating increased interest and motivating furtherdevelopment.展开更多
Federated Research Data Infrastructures aim to provide seamless access to research data along with services to facilitate the researchers in performing their data management tasks.During our research on Open Science(O...Federated Research Data Infrastructures aim to provide seamless access to research data along with services to facilitate the researchers in performing their data management tasks.During our research on Open Science(OS),we have built cross-disciplinary federated infrastructures for different types of(open)digital resources:Open Data(OD),Open Educational Resources(OER),and open access documents.In each case,our approach targeted only the resource“metadata”.Based on this experience,we identified some challenges that we had to overcome again and again:lack of(i)harvesters,(ii)common metadata models and(iii)metadata mapping tools.In this paper,we report on the challenges we faced in the federated infrastructure projects we were involved with.We structure the report based on the three challenges listed above.展开更多
Widely used in clinical research, the database is a new type of data management automation technology and the most efficient tool for data management. In this article, we first explain some basic concepts, such as the...Widely used in clinical research, the database is a new type of data management automation technology and the most efficient tool for data management. In this article, we first explain some basic concepts, such as the definition, classification, and establishment of databases. Afterward, the workflow for establishing databases, inputting data, verifying data, and managing databases is presented. Meanwhile, by discussing the application of databases in clinical research, we illuminate the important role of databases in clinical research practice. Lastly, we introduce the reanalysis of randomized controlled trials(RCTs) and cloud computing techniques, showing the most recent advancements of databases in clinical research.展开更多
The research value and market potentials of "big data" in real estate industry are well acknowledged with the development of technologies. But research in this area is far away from systematic and thorough c...The research value and market potentials of "big data" in real estate industry are well acknowledged with the development of technologies. But research in this area is far away from systematic and thorough context. Aiming at this issue, we systematically examined the research outcomes related to real estate big data. It gives a comment to current research status and proposes the future directions in this area from the four aspects, i.e. the hierarchical structuring of real estate "Big Data", integrated implementation, exchange and pricing systems, and the market operation system, in order to assist the researchers for their future works.展开更多
This article has explored the relationships between data and theory in qualitative research from an enthnographic perspective. It has also explicated the discovery of theory from data systematically obtained from enth...This article has explored the relationships between data and theory in qualitative research from an enthnographic perspective. It has also explicated the discovery of theory from data systematically obtained from enthnographic research, the Participant Observation approach based on inductive logic, that is, grounded theory.展开更多
Unlike consumers in the mall or supermarkets, online consumers are “intangible” and their purchasing behaviors are affected by multiple factors, including product pricing, promotion and discounts, quality of product...Unlike consumers in the mall or supermarkets, online consumers are “intangible” and their purchasing behaviors are affected by multiple factors, including product pricing, promotion and discounts, quality of products and brands, and the platforms where they search for the product. In this research, I study the relationship between product sales and consumer characteristics, the relationship between product sales and product qualities, demand curve analysis, and the search friction effect for different platforms. I utilized data from a randomized field experiment involving more than 400 thousand customers and 30 thousand products on JD.com, one of the world’s largest online retailing platforms. There are two focuses of the research: 1) how different consumer characteristics affect sales;2) how to set price and possible search friction for different channels. I find that JD plus membership, education level and age have no significant relationship with product sales, and higher user level leads to higher sales. Sales are highly skewed, with very high numbers of products sold making up only a small percentage of the total. Consumers living in more industrialized cities have more purchasing power. Women and singles lead to higher spending. Also, the better the product performs, the more it sells. Moderate pricing can increase product sales. Based on the research results of search volume in different channels, it is suggested that it is better to focus on app sales. By knowing the results, producers can adjust target consumers for different products and do target advertisements in order to maximize the sales. Also, an appropriate price for a product is also crucial to a seller. By the way, knowing the search friction of different channels can help producers to rearrange platform layout so that search friction can be reduced and more potential deals may be made.展开更多
基金the National Social Science Fund of China(Grant No.22CTQ031)Special Project on Library Capacity Building of the Chinese Academy of Sciences(Grant No.E2290431).
文摘Research data infrastructures form the cornerstone in both cyber and physical spaces,driving the progression of the data-intensive scientific research paradigm.This opinion paper presents an overview of global research data infrastructure,drawing insights from national roadmaps and strategic documents related to research data infrastructure.It emphasizes the pivotal role of research data infrastructures by delineating four new missions aimed at positioning them at the core of the current scientific research and communication ecosystem.The four new missions of research data infrastructures are:(1)as a pioneer,to transcend the disciplinary border and address complex,cutting-edge scientific and social challenges with problem-and data-oriented insights;(2)as an architect,to establish a digital,intelligent,flexible research and knowledge services environment;(3)as a platform,to foster the high-end academic communication;(4)as a coordinator,to balance scientific openness with ethics needs.
文摘Research Data Management(RDM)has become increasingly important for more and more academic institutions.Using the Peking University Open Research Data Repository(PKU-ORDR)project as an example,this paper will review a library-based university-wide open research data repository project and related RDM services implementation process including project kickoff,needs assessment,partnerships establishment,software investigation and selection,software customization,as well as data curation services and training.Through the review,some issues revealed during the stages of the implementation process are also discussed and addressed in the paper such as awareness of research data,demands from data providers and users,data policies and requirements from home institution,requirements from funding agencies and publishers,the collaboration between administrative units and libraries,and concerns from data providers and users.The significance of the study is that the paper shows an example of creating an Open Data repository and RDM services for other Chinese academic libraries planning to implement their RDM services for their home institutions.The authors of the paper have also observed since the PKU-ORDR and RDM services implemented in 2015,the Peking University Library(PKUL)has helped numerous researchers to support the entire research life cycle and enhanced Open Science(OS)practices on campus,as well as impacted the national OS movement in China through various national events and activities hosted by the PKUL.
文摘The increased number of data repositories has greatly increased the availability of open data.To enable broad discovery and access to research dataset,some data repositories have begun leveraging the web architecture by embedding structured metadata markup in dataset web landing pages using vocabularies from Schema.org and extensions.This paper aims to examine metadata interoperability for supporting global data discovery.Specifically,the paper reports a survey on which metadata schema has been adopted by participating data repositories,and presents an analysis of crosswalks from fourteen research data schemas to Schema.org.The analysis indicates most descriptive metadata are interoperable among the schemas,the most inconsistent mapping is the rights metadata,and a large gap exists in the structural metadata and controlled vocabularies to specify various property values.The analysis and collated crosswalks can serve as a reference for data repositories when they develop crosswalks from their own schemas to Schema.org,and provide the research data community a benchmark of structured metadata implementation.
基金funded by EPSRC grant:EP/R026939/1,EP/R026815/1,EP/R026645/1,EP/R027129/1 or EP/M013219/1(biocatalysis)part-funded by the European Regional Development Fund(ERDF)via Welsh Government.
文摘The UK Catalysis Hub(UKCH)is designing a virtual research environment to support data processing and analysis,the Catalysis Research Workbench(CRW).The development of this platform requires identifying the processing and analysis needs of the UKCH members and mapping them to potential solutions.This paper presents a proposal for a demonstrator to analyse the use of scientific workflows for large scale data processing.The demonstrator provides a concrete target to promote further discussion of the processing and analysis needs of the UKCH community.In this paper,we will discuss the main requirements for data processing elicited and the proposed adaptations that will be incorporated in the design of the CRW and how to integrate the proposed solutions with existing practices of the UKCH.The demonstrator has been used in discussion with researchers and in presentations to the UKCH community,generating increased interest and motivating furtherdevelopment.
文摘Federated Research Data Infrastructures aim to provide seamless access to research data along with services to facilitate the researchers in performing their data management tasks.During our research on Open Science(OS),we have built cross-disciplinary federated infrastructures for different types of(open)digital resources:Open Data(OD),Open Educational Resources(OER),and open access documents.In each case,our approach targeted only the resource“metadata”.Based on this experience,we identified some challenges that we had to overcome again and again:lack of(i)harvesters,(ii)common metadata models and(iii)metadata mapping tools.In this paper,we report on the challenges we faced in the federated infrastructure projects we were involved with.We structure the report based on the three challenges listed above.
基金supported by Fundamental Research Funds of State Key Laboratory of Ophthalmology (Grant No.2015QN01)Young Teacher Top-Support project of Sun Yat-sen University(Grant No.2015ykzd11)+4 种基金the Cultivation Projects for Young Teaching Staff of Sun Yat-sen University(Grant No.12ykpy61) from the Fundamental Research Funds for the Central Universitiesthe Pearl River Science and Technology New Star(Grant No.2014J2200060)Project of Guangzhou City,the Guangdong Provincial Natural Science Foundation for Distinguished Young Scholars of China(Grant No. 2014A030306030)Youth Science and Technology Innovation Talents Funds in Special Support Plan for High Level Talents in Guangdong Province(Grant No. 2014TQ01R573)Key Research Plan for National Natural Science Foundation of China in Cultivation Project (No.91546101)
文摘Widely used in clinical research, the database is a new type of data management automation technology and the most efficient tool for data management. In this article, we first explain some basic concepts, such as the definition, classification, and establishment of databases. Afterward, the workflow for establishing databases, inputting data, verifying data, and managing databases is presented. Meanwhile, by discussing the application of databases in clinical research, we illuminate the important role of databases in clinical research practice. Lastly, we introduce the reanalysis of randomized controlled trials(RCTs) and cloud computing techniques, showing the most recent advancements of databases in clinical research.
基金Funded partly by the Post-graduate Students’ ducation and Teaching Reform Program of Chongqing Education Committee(No.Yjg123089)
文摘The research value and market potentials of "big data" in real estate industry are well acknowledged with the development of technologies. But research in this area is far away from systematic and thorough context. Aiming at this issue, we systematically examined the research outcomes related to real estate big data. It gives a comment to current research status and proposes the future directions in this area from the four aspects, i.e. the hierarchical structuring of real estate "Big Data", integrated implementation, exchange and pricing systems, and the market operation system, in order to assist the researchers for their future works.
文摘This article has explored the relationships between data and theory in qualitative research from an enthnographic perspective. It has also explicated the discovery of theory from data systematically obtained from enthnographic research, the Participant Observation approach based on inductive logic, that is, grounded theory.
文摘Unlike consumers in the mall or supermarkets, online consumers are “intangible” and their purchasing behaviors are affected by multiple factors, including product pricing, promotion and discounts, quality of products and brands, and the platforms where they search for the product. In this research, I study the relationship between product sales and consumer characteristics, the relationship between product sales and product qualities, demand curve analysis, and the search friction effect for different platforms. I utilized data from a randomized field experiment involving more than 400 thousand customers and 30 thousand products on JD.com, one of the world’s largest online retailing platforms. There are two focuses of the research: 1) how different consumer characteristics affect sales;2) how to set price and possible search friction for different channels. I find that JD plus membership, education level and age have no significant relationship with product sales, and higher user level leads to higher sales. Sales are highly skewed, with very high numbers of products sold making up only a small percentage of the total. Consumers living in more industrialized cities have more purchasing power. Women and singles lead to higher spending. Also, the better the product performs, the more it sells. Moderate pricing can increase product sales. Based on the research results of search volume in different channels, it is suggested that it is better to focus on app sales. By knowing the results, producers can adjust target consumers for different products and do target advertisements in order to maximize the sales. Also, an appropriate price for a product is also crucial to a seller. By the way, knowing the search friction of different channels can help producers to rearrange platform layout so that search friction can be reduced and more potential deals may be made.