In the United States(US),the Surveillance,Epidemiology,and End Results(SEER)program is the only comprehensive source of population-based information that includes stage of cancer at the time of diagnosis and patient s...In the United States(US),the Surveillance,Epidemiology,and End Results(SEER)program is the only comprehensive source of population-based information that includes stage of cancer at the time of diagnosis and patient survival data.This program aims to provide a database about cancer incidence and survival for studies of surveillance and the development of analytical and methodological tools in the cancer field.Currently,the SEER program covers approximately half of the total cancer patients in the US.A growing number of clinical studies have applied the SEER database in various aspects.However,the intrinsic features of the SEER database,such as the huge data volume and complexity of data types,have hindered its application.In this review,we provided a systematic overview of the commonly used methodologies and study designs for retrospective epidemiological research in order to illustrate the application of the SEER database.Therefore,the goal of this review is to assist researchers in the selection of appropriate methods and study designs for enhancing the robustness and reliability of clinical studies by mining the SEER database.展开更多
Big data on product sales are an emerging resource for supporting modular product design to meet diversified customers’requirements of product specification combinations.To better facilitate decision-making of modula...Big data on product sales are an emerging resource for supporting modular product design to meet diversified customers’requirements of product specification combinations.To better facilitate decision-making of modular product design,correlations among specifications and components originated from customers’conscious and subconscious preferences can be investigated by using big data on product sales.This study proposes a framework and the associated methods for supporting modular product design decisions based on correlation analysis of product specifications and components using big sales data.The correlations of the product specifications are determined by analyzing the collected product sales data.By building the relations between the product components and specifications,a matrix for measuring the correlation among product components is formed for component clustering.Six rules for supporting the decision making of modular product design are proposed based on the frequency analysis of the specification values per component cluster.A case study of electric vehicles illustrates the application of the proposed method.展开更多
Materials development has historically been driven by human needs and desires, and this is likely to con- tinue in the foreseeable future. The global population is expected to reach ten billion by 2050, which will pro...Materials development has historically been driven by human needs and desires, and this is likely to con- tinue in the foreseeable future. The global population is expected to reach ten billion by 2050, which will promote increasingly large demands for clean and high-ef ciency energy, personalized consumer prod- ucts, secure food supplies, and professional healthcare. New functional materials that are made and tai- lored for targeted properties or behaviors will be the key to tackling this challenge. Traditionally, advanced materials are found empirically or through experimental trial-and-error approaches. As big data generated by modern experimental and computational techniques is becoming more readily avail- able, data-driven or machine learning (ML) methods have opened new paradigms for the discovery and rational design of materials. In this review article, we provide a brief introduction on various ML methods and related software or tools. Main ideas and basic procedures for employing ML approaches in materials research are highlighted. We then summarize recent important applications of ML for the large-scale screening and optimal design of polymer and porous materials, catalytic materials, and energetic mate- rials. Finally, concluding remarks and an outlook are provided.展开更多
There are quintillions of data on deoxyribonucleic acid(DNA)and protein in publicly accessible data banks,and that number is expanding at an exponential rate.Many scientific fields,such as bioinformatics and drug disc...There are quintillions of data on deoxyribonucleic acid(DNA)and protein in publicly accessible data banks,and that number is expanding at an exponential rate.Many scientific fields,such as bioinformatics and drug discovery,rely on such data;nevertheless,gathering and extracting data from these resources is a tough undertaking.This data should go through several processes,including mining,data processing,analysis,and classification.This study proposes software that extracts data from big data repositories automatically and with the particular ability to repeat data extraction phases as many times as needed without human intervention.This software simulates the extraction of data from web-based(point-and-click)resources or graphical user interfaces that cannot be accessed using command-line tools.The software was evaluated by creating a novel database of 34 parameters for 1360 physicochemical properties of antimicrobial peptides(AMP)sequences(46240 hits)from various MARVIN software panels,which can be later utilized to develop novel AMPs.Furthermore,for machine learning research,the program was validated by extracting 10,000 protein tertiary structures from the Protein Data Bank.As a result,data collection from the web will become faster and less expensive,with no need for manual data extraction.The software is critical as a first step to preparing large datasets for subsequent stages of analysis,such as those using machine and deep-learning applications.展开更多
Due to the huge amount of increasing data, the requirements of people forelectronic products such as mobile phones, tablets, and notebooks are constantlyimproving. The development and design of various software applic...Due to the huge amount of increasing data, the requirements of people forelectronic products such as mobile phones, tablets, and notebooks are constantlyimproving. The development and design of various software applications attach greatimportance to users’ experiences. The rationalized UI design should allow a user not onlyenjoy the visual design experience of the new product but also operating it morepleasingly. This process is to enhance the attractiveness and performance of the newproduct and thus to promote the active usage and consuming conduct of users. In thispaper, an UI design optimization strategy for general APP in the big data environment isproposed to get better user experience while effectively obtaining information. Anexperimental example of a library APP is designed to optimize the user experience. Theexperimental results show that the user-centered UI design is the core of optimization,and user portrait based on big data platforms is the key to UI design.展开更多
In this paper, we conduct research on the applications of big data on artistic designing and its infl uences on the design behavior. Modern art design enhance the grade of the product, increase the added value of prod...In this paper, we conduct research on the applications of big data on artistic designing and its infl uences on the design behavior. Modern art design enhance the grade of the product, increase the added value of products, promote the development of the economy, make the product of the aesthetic value and economic value of the perfect unity. In today’s society due to the value of the product are much more than the product itself contains the value of performance, usage, etc. is more of a product in gradually improve the aesthetic value, sometimes even more than the use of the product value and exchange value, therefore the product has become the dominant value. Under this basis, we propose the new idea on the design pattern that will be meaningful.展开更多
The arrival of the era of the Internet has brought about the rapid dissemination and spread of a big amount of the information and data. At present, we are surrounded by all kinds of the information, but the rich and ...The arrival of the era of the Internet has brought about the rapid dissemination and spread of a big amount of the information and data. At present, we are surrounded by all kinds of the information, but the rich and diversified information resources also brought about the chaos, so that the query of the messages is no way to start. In fact, the information resources can provide us with more convenience, but we have to spend a lot of energy to organize and filter the information, and the costs and the time of the investment are immeasurable. Usually, the information we want to query is often easy to understand, and the information design uses the more intuitive and vivid computing means to achieve the visualization of the big data, in order to reflect the beauty of the big data.展开更多
How to feed 10 billion human populations is one of the challenges that need to be addressed in the following decades,especially under an unpredicted climate change.Crop breeding,initiating from the phenotype-based sel...How to feed 10 billion human populations is one of the challenges that need to be addressed in the following decades,especially under an unpredicted climate change.Crop breeding,initiating from the phenotype-based selection by local farmers and developing into current biotechnology-based breeding,has played a critical role in securing the global food supply.However,regarding the changing environment and ever-increasing human population,can we breed outstanding crop varieties fast enough to achieve high productivity,good quality,and widespread adaptability?This review outlines the recent achievements in understanding cereal crop breeding,including the current knowledge about crop agronomic traits,newly developed techniques,crop big biological data research,and the possibility of integrating them for intelligence-driven breeding by design,which ushers in a new era of crop breeding practice and shapes the novel architecture of future crops.This review focuses on the major cereal crops,including rice,maize,and wheat,to explain how intelligence-driven breeding by design is becoming a reality.展开更多
This paper is based on the phenomenon of Tittytainment and big data control.It discusses the ways in which working youths live and work in their large suburban communities in Beijing as well as their leisure and enter...This paper is based on the phenomenon of Tittytainment and big data control.It discusses the ways in which working youths live and work in their large suburban communities in Beijing as well as their leisure and entertainment activities.Then develops a problem statement and research questions.In addition,the project explores this reality phenomenon dialectically through the analysis and speculation of the new virtual reality technology“Metaverse.”Based on a critical design approach,the phenomenon is explored through a detailed description and examination of reality,before critically suggesting the possibility of future social control by big data companies through the“metaverse”space.Furthermore,this paper alerts to the neoliberalism that dominates globalization through anti-utopian expressions,waking up young people who are compromised by big data control and addicted to Tittytainment.展开更多
To obtain the platform s big data analytics support,manufacturers in the traditional retail channel must decide whether to use the direct online channel.A retail supply chain model and a direct online supply chain mod...To obtain the platform s big data analytics support,manufacturers in the traditional retail channel must decide whether to use the direct online channel.A retail supply chain model and a direct online supply chain model are built,in which manufacturers design products alone in the retail channel,while the platform and manufacturer complete the product design in the direct online channel.These two models are analyzed using the game theoretical model and numerical simulation.The findings indicate that if the manufacturers design capabilities are not very high and the commission rate is not very low,the manufacturers will choose the direct online channel if the platform s technical efforts are within an interval.When the platform s technical efforts are exogenous,they positively influence the manufacturers decisions;however,in the endogenous case,the platform s effect on the manufacturers is reflected in the interaction of the commission rate and cost efficiency.The manufacturers and the platform should make synthetic effort decisions based on the manufacturer s development capabilities,the intensity of market competition,and the cost efficiency of the platform.展开更多
基金Guangdong Provincial Key Laboratory of Traditional Chinese Medicine Informatization(2021B1212040007)Clinical Frontier Technology Program of the First Affiliated Hospital of Jinan University,China(JNU1AF-CFTP-2022-a01235)Science and Technology Projects in Guangzhou,China(202201020054,2023A03J1032).
文摘In the United States(US),the Surveillance,Epidemiology,and End Results(SEER)program is the only comprehensive source of population-based information that includes stage of cancer at the time of diagnosis and patient survival data.This program aims to provide a database about cancer incidence and survival for studies of surveillance and the development of analytical and methodological tools in the cancer field.Currently,the SEER program covers approximately half of the total cancer patients in the US.A growing number of clinical studies have applied the SEER database in various aspects.However,the intrinsic features of the SEER database,such as the huge data volume and complexity of data types,have hindered its application.In this review,we provided a systematic overview of the commonly used methodologies and study designs for retrospective epidemiological research in order to illustrate the application of the SEER database.Therefore,the goal of this review is to assist researchers in the selection of appropriate methods and study designs for enhancing the robustness and reliability of clinical studies by mining the SEER database.
基金National Key R&D Program of China(Grant No.2018YFB1701701)Sailing Talent Program+1 种基金Guangdong Provincial Science and Technologies Program of China(Grant No.2017B090922008)Special Grand Grant from Tianjin City Government of China。
文摘Big data on product sales are an emerging resource for supporting modular product design to meet diversified customers’requirements of product specification combinations.To better facilitate decision-making of modular product design,correlations among specifications and components originated from customers’conscious and subconscious preferences can be investigated by using big data on product sales.This study proposes a framework and the associated methods for supporting modular product design decisions based on correlation analysis of product specifications and components using big sales data.The correlations of the product specifications are determined by analyzing the collected product sales data.By building the relations between the product components and specifications,a matrix for measuring the correlation among product components is formed for component clustering.Six rules for supporting the decision making of modular product design are proposed based on the frequency analysis of the specification values per component cluster.A case study of electric vehicles illustrates the application of the proposed method.
文摘Materials development has historically been driven by human needs and desires, and this is likely to con- tinue in the foreseeable future. The global population is expected to reach ten billion by 2050, which will promote increasingly large demands for clean and high-ef ciency energy, personalized consumer prod- ucts, secure food supplies, and professional healthcare. New functional materials that are made and tai- lored for targeted properties or behaviors will be the key to tackling this challenge. Traditionally, advanced materials are found empirically or through experimental trial-and-error approaches. As big data generated by modern experimental and computational techniques is becoming more readily avail- able, data-driven or machine learning (ML) methods have opened new paradigms for the discovery and rational design of materials. In this review article, we provide a brief introduction on various ML methods and related software or tools. Main ideas and basic procedures for employing ML approaches in materials research are highlighted. We then summarize recent important applications of ML for the large-scale screening and optimal design of polymer and porous materials, catalytic materials, and energetic mate- rials. Finally, concluding remarks and an outlook are provided.
基金This work was funded by the Graduate Scientific Research School at Yarmouk University under Grant Number:82/2020。
文摘There are quintillions of data on deoxyribonucleic acid(DNA)and protein in publicly accessible data banks,and that number is expanding at an exponential rate.Many scientific fields,such as bioinformatics and drug discovery,rely on such data;nevertheless,gathering and extracting data from these resources is a tough undertaking.This data should go through several processes,including mining,data processing,analysis,and classification.This study proposes software that extracts data from big data repositories automatically and with the particular ability to repeat data extraction phases as many times as needed without human intervention.This software simulates the extraction of data from web-based(point-and-click)resources or graphical user interfaces that cannot be accessed using command-line tools.The software was evaluated by creating a novel database of 34 parameters for 1360 physicochemical properties of antimicrobial peptides(AMP)sequences(46240 hits)from various MARVIN software panels,which can be later utilized to develop novel AMPs.Furthermore,for machine learning research,the program was validated by extracting 10,000 protein tertiary structures from the Protein Data Bank.As a result,data collection from the web will become faster and less expensive,with no need for manual data extraction.The software is critical as a first step to preparing large datasets for subsequent stages of analysis,such as those using machine and deep-learning applications.
基金Hunan Provincial Education Science 13th Five-Year Plan (Grant No.XJK016BXX001)Social Science Foundation of Hunan Province (Grant No.17YBA049)+1 种基金Open Foundation for the University Innovation Platform in the HunanProvince, grant number 16K013. This research work is implemented at the 2011Collaborative Innovation Center for Development and Utilization of Finance andEconomics Big Data Property, Universities of Hunan Province. Open project (Grant Nos.20181901CRP03, 20181901CRP04, 20181901CRP05)National Social Science Fund Project: Research on the Impact Mechanism of China’sCapital Space Flow on Regional Economic Development (Project No. 14BJL086).
文摘Due to the huge amount of increasing data, the requirements of people forelectronic products such as mobile phones, tablets, and notebooks are constantlyimproving. The development and design of various software applications attach greatimportance to users’ experiences. The rationalized UI design should allow a user not onlyenjoy the visual design experience of the new product but also operating it morepleasingly. This process is to enhance the attractiveness and performance of the newproduct and thus to promote the active usage and consuming conduct of users. In thispaper, an UI design optimization strategy for general APP in the big data environment isproposed to get better user experience while effectively obtaining information. Anexperimental example of a library APP is designed to optimize the user experience. Theexperimental results show that the user-centered UI design is the core of optimization,and user portrait based on big data platforms is the key to UI design.
文摘In this paper, we conduct research on the applications of big data on artistic designing and its infl uences on the design behavior. Modern art design enhance the grade of the product, increase the added value of products, promote the development of the economy, make the product of the aesthetic value and economic value of the perfect unity. In today’s society due to the value of the product are much more than the product itself contains the value of performance, usage, etc. is more of a product in gradually improve the aesthetic value, sometimes even more than the use of the product value and exchange value, therefore the product has become the dominant value. Under this basis, we propose the new idea on the design pattern that will be meaningful.
文摘The arrival of the era of the Internet has brought about the rapid dissemination and spread of a big amount of the information and data. At present, we are surrounded by all kinds of the information, but the rich and diversified information resources also brought about the chaos, so that the query of the messages is no way to start. In fact, the information resources can provide us with more convenience, but we have to spend a lot of energy to organize and filter the information, and the costs and the time of the investment are immeasurable. Usually, the information we want to query is often easy to understand, and the information design uses the more intuitive and vivid computing means to achieve the visualization of the big data, in order to reflect the beauty of the big data.
基金supported by the National Science Foundation of China(32341029)Science and Technology Innovation 2030 Major Projects(2023ZD0406804)Outstanding Youth Team Cultivation Project of Center Universities(2662023PY007)。
文摘How to feed 10 billion human populations is one of the challenges that need to be addressed in the following decades,especially under an unpredicted climate change.Crop breeding,initiating from the phenotype-based selection by local farmers and developing into current biotechnology-based breeding,has played a critical role in securing the global food supply.However,regarding the changing environment and ever-increasing human population,can we breed outstanding crop varieties fast enough to achieve high productivity,good quality,and widespread adaptability?This review outlines the recent achievements in understanding cereal crop breeding,including the current knowledge about crop agronomic traits,newly developed techniques,crop big biological data research,and the possibility of integrating them for intelligence-driven breeding by design,which ushers in a new era of crop breeding practice and shapes the novel architecture of future crops.This review focuses on the major cereal crops,including rice,maize,and wheat,to explain how intelligence-driven breeding by design is becoming a reality.
文摘This paper is based on the phenomenon of Tittytainment and big data control.It discusses the ways in which working youths live and work in their large suburban communities in Beijing as well as their leisure and entertainment activities.Then develops a problem statement and research questions.In addition,the project explores this reality phenomenon dialectically through the analysis and speculation of the new virtual reality technology“Metaverse.”Based on a critical design approach,the phenomenon is explored through a detailed description and examination of reality,before critically suggesting the possibility of future social control by big data companies through the“metaverse”space.Furthermore,this paper alerts to the neoliberalism that dominates globalization through anti-utopian expressions,waking up young people who are compromised by big data control and addicted to Tittytainment.
基金The National Natural Science Foundation of China(No.72071039)the Foundation of China Scholarship Council(No.202106090197)。
文摘To obtain the platform s big data analytics support,manufacturers in the traditional retail channel must decide whether to use the direct online channel.A retail supply chain model and a direct online supply chain model are built,in which manufacturers design products alone in the retail channel,while the platform and manufacturer complete the product design in the direct online channel.These two models are analyzed using the game theoretical model and numerical simulation.The findings indicate that if the manufacturers design capabilities are not very high and the commission rate is not very low,the manufacturers will choose the direct online channel if the platform s technical efforts are within an interval.When the platform s technical efforts are exogenous,they positively influence the manufacturers decisions;however,in the endogenous case,the platform s effect on the manufacturers is reflected in the interaction of the commission rate and cost efficiency.The manufacturers and the platform should make synthetic effort decisions based on the manufacturer s development capabilities,the intensity of market competition,and the cost efficiency of the platform.