The release of the generative pre-trained transformer(GPT)series has brought artificial general intelligence(AGI)to the forefront of the artificial intelligence(AI)field once again.However,the questions of how to defi...The release of the generative pre-trained transformer(GPT)series has brought artificial general intelligence(AGI)to the forefront of the artificial intelligence(AI)field once again.However,the questions of how to define and evaluate AGI remain unclear.This perspective article proposes that the evaluation of AGI should be rooted in dynamic embodied physical and social interactions(DEPSI).More specifically,we propose five critical characteristics to be considered as AGI benchmarks and suggest the Tong test as an AGI evaluation system.The Tong test describes a value-and ability-oriented testing system that delineates five levels of AGI milestones through a virtual environment with DEPSI,allowing for infinite task generation.We contrast the Tong test with classical AI testing systems in terms of various aspects and propose a systematic evaluation system to promote standardized,quantitative,and objective benchmarks and evaluation of AGI.展开更多
In recent years,as the infectious diseases caused by pathogens such as novel coronavirus and mpox(formerly called monkeypox)spread globally,the significance of identification,preservation,and sharing of pathogenic res...In recent years,as the infectious diseases caused by pathogens such as novel coronavirus and mpox(formerly called monkeypox)spread globally,the significance of identification,preservation,and sharing of pathogenic resources become prominent.Along with the rapid development of biorepositories,it is imperative to evaluate their operation in a scientific manner.By using the literature review and modified Delphi method,this study develops a benchmarking tool for the comprehensive evaluation of the operation of biorepositories for pathogenic resources.The effective response rates of both rounds of expert surveys were 100%.The authority coefficients(Cr)were 0.82 and 0.85,respectively,manifesting the reliability of consultation results.In the second‐round survey,the Kendall’s coefficient of concordance(Kendall’s W)of all indicators ranged from 0.09 to 0.31(P<0.001),the comprehensive score ranged from 4.02 to 4.94,the standard deviation ranged from 0.21 to 0.77,and the coefficient of variation(CV)ranged from 0.04 to 0.22,indicating that the expert opinions reached consensus.The final benchmarking tool was composed of 4 primary indicators,12 secondary indicators,and 65 tertiary indicators.The weights of the four primary indicators allocated through the rank‐sum ratio method,namely organizational structure,management requirements,biobanking capacity,and sharing capacity,were 30.50%,30.08%,25.45%,and 13.97%,respectively.The benchmarking tool established in this study provides references for the comprehensive evaluation of the operation and puts forward advice for the sustainable development of biorepositories for pathogenic resources.展开更多
Purpose: To develop and test a mission-oriented and multi-dimensional benchmarking method for a small scale university aiming for internationally first-class basic research.Design/methodology/approach: An individualiz...Purpose: To develop and test a mission-oriented and multi-dimensional benchmarking method for a small scale university aiming for internationally first-class basic research.Design/methodology/approach: An individualized evidence-based assessment scheme was employed to benchmark ShanghaiTech University against selected top research institutions,focusing on research impact and competitiveness at the institutional and disciplinary levels.Topic maps opposing ShanghaiTech and corresponding top institutions were produced for the main research disciplines of ShanghaiTech. This provides opportunities for further exploration of strengths and weakness. Findings: This study establishes a preliminary framework for assessing the mission of the university. It further provides assessment principles, assessment questions, and indicators.Analytical methods and data sources were tested and proved to be applicable and efficient.Research limitations: To better fit the selective research focuses of this university, its schema of research disciplines needs to be re-organized and benchmarking targets should include disciplinary top institutions and not necessarily those universities leading overall rankings.Current reliance on research articles and certain databases may neglect important research output types.Practical implications: This study provides a working framework and practical methods for mission-oriented, individual, and multi-dimensional benchmarking that ShanghaiTech decided to use for periodical assessments. It also offers a working reference for other institutions to adapt. Further needs are identified so that ShanghaiTech can tackle them for future benchmarking.Originality/value: This is an effort to develop a mission-oriented, individually designed,systematically structured, and multi-dimensional assessment methodology which differs from often used composite indices.展开更多
The seismic response of a benchmark highway bridge isolated with passive polynomial friction pendulum isolators (PFPIs) is investigated and subjected to six bidirectional ground motion records. The benchmark study i...The seismic response of a benchmark highway bridge isolated with passive polynomial friction pendulum isolators (PFPIs) is investigated and subjected to six bidirectional ground motion records. The benchmark study is based on a lumped mass finite-element model of the 91/5 highway overcrossing located in Southern California. The PFPI system possesses two important parameters; one is horizontal flexibility and the other is energy absorbing capacity through friction. The evaluation criteria of the benchmark bridge arc analyzed considering two parameters, time period of the isolator and coefficient of friction of the isolation surface. The results of the numerical study are compared with those obtained from the traditional friction pendulum system (FPS). Dual design performance of the PFPI system suppressed the displacement and acceleration response of the benchmark highway bridge. The dual design hysteresis loop of the PFPI system is the main advantage over the linear hysteresis loop of the FPS. The numerical result indicates that the seismic performance of the PFPI system is better than that of the traditional FPS isolated system. Further, it is observed that variations of the isolation time period and coefficient of friction of the FPS and PFPI systems have a significant effect on the peak responses of the benchmark highway bridge.展开更多
This paper presents a new measuring method based on the simulating technology and measuring technology, researches and analyzes deeply its network performance evaluating model, measuring methods, evaluating algorithms...This paper presents a new measuring method based on the simulating technology and measuring technology, researches and analyzes deeply its network performance evaluating model, measuring methods, evaluating algorithms and system implementation. Experimental results argue that this method can define expediently different models of manual simulated loads and configure neatly different environments of network applications, can develop adequately characteristics of widespread applications and accuracy in simulating technology, as well as reality, reliability and better adaptability in measuring technology.展开更多
This paper introduced a performance evaluating approach of computer communication system based on the simulation and measurement technology, and discussed its evaluating models. The result of our experiment showed tha...This paper introduced a performance evaluating approach of computer communication system based on the simulation and measurement technology, and discussed its evaluating models. The result of our experiment showed that the outcome of practical measurement on Ether-LAN fitted in well with the theoretical analysis. The approach we presented can be used to define various kinds of artificially simulated load models conveiently, build all kinds of network application environments in a flexible way, and exert sufficiently the widely-used and high-precision features of the traditional simulation technology and the reality, reliability, adaptability features of measurement technology.展开更多
基金supported by the National Key Research and Development Program of China (2022ZD0114900).
文摘The release of the generative pre-trained transformer(GPT)series has brought artificial general intelligence(AGI)to the forefront of the artificial intelligence(AI)field once again.However,the questions of how to define and evaluate AGI remain unclear.This perspective article proposes that the evaluation of AGI should be rooted in dynamic embodied physical and social interactions(DEPSI).More specifically,we propose five critical characteristics to be considered as AGI benchmarks and suggest the Tong test as an AGI evaluation system.The Tong test describes a value-and ability-oriented testing system that delineates five levels of AGI milestones through a virtual environment with DEPSI,allowing for infinite task generation.We contrast the Tong test with classical AI testing systems in terms of various aspects and propose a systematic evaluation system to promote standardized,quantitative,and objective benchmarks and evaluation of AGI.
基金the National Key R&D Program of China(2022YFC2602200)the National Science and Technology Infrastructure of China(No.National Pathogen Resource Center‐NPRC‐32).
文摘In recent years,as the infectious diseases caused by pathogens such as novel coronavirus and mpox(formerly called monkeypox)spread globally,the significance of identification,preservation,and sharing of pathogenic resources become prominent.Along with the rapid development of biorepositories,it is imperative to evaluate their operation in a scientific manner.By using the literature review and modified Delphi method,this study develops a benchmarking tool for the comprehensive evaluation of the operation of biorepositories for pathogenic resources.The effective response rates of both rounds of expert surveys were 100%.The authority coefficients(Cr)were 0.82 and 0.85,respectively,manifesting the reliability of consultation results.In the second‐round survey,the Kendall’s coefficient of concordance(Kendall’s W)of all indicators ranged from 0.09 to 0.31(P<0.001),the comprehensive score ranged from 4.02 to 4.94,the standard deviation ranged from 0.21 to 0.77,and the coefficient of variation(CV)ranged from 0.04 to 0.22,indicating that the expert opinions reached consensus.The final benchmarking tool was composed of 4 primary indicators,12 secondary indicators,and 65 tertiary indicators.The weights of the four primary indicators allocated through the rank‐sum ratio method,namely organizational structure,management requirements,biobanking capacity,and sharing capacity,were 30.50%,30.08%,25.45%,and 13.97%,respectively.The benchmarking tool established in this study provides references for the comprehensive evaluation of the operation and puts forward advice for the sustainable development of biorepositories for pathogenic resources.
文摘Purpose: To develop and test a mission-oriented and multi-dimensional benchmarking method for a small scale university aiming for internationally first-class basic research.Design/methodology/approach: An individualized evidence-based assessment scheme was employed to benchmark ShanghaiTech University against selected top research institutions,focusing on research impact and competitiveness at the institutional and disciplinary levels.Topic maps opposing ShanghaiTech and corresponding top institutions were produced for the main research disciplines of ShanghaiTech. This provides opportunities for further exploration of strengths and weakness. Findings: This study establishes a preliminary framework for assessing the mission of the university. It further provides assessment principles, assessment questions, and indicators.Analytical methods and data sources were tested and proved to be applicable and efficient.Research limitations: To better fit the selective research focuses of this university, its schema of research disciplines needs to be re-organized and benchmarking targets should include disciplinary top institutions and not necessarily those universities leading overall rankings.Current reliance on research articles and certain databases may neglect important research output types.Practical implications: This study provides a working framework and practical methods for mission-oriented, individual, and multi-dimensional benchmarking that ShanghaiTech decided to use for periodical assessments. It also offers a working reference for other institutions to adapt. Further needs are identified so that ShanghaiTech can tackle them for future benchmarking.Originality/value: This is an effort to develop a mission-oriented, individually designed,systematically structured, and multi-dimensional assessment methodology which differs from often used composite indices.
文摘The seismic response of a benchmark highway bridge isolated with passive polynomial friction pendulum isolators (PFPIs) is investigated and subjected to six bidirectional ground motion records. The benchmark study is based on a lumped mass finite-element model of the 91/5 highway overcrossing located in Southern California. The PFPI system possesses two important parameters; one is horizontal flexibility and the other is energy absorbing capacity through friction. The evaluation criteria of the benchmark bridge arc analyzed considering two parameters, time period of the isolator and coefficient of friction of the isolation surface. The results of the numerical study are compared with those obtained from the traditional friction pendulum system (FPS). Dual design performance of the PFPI system suppressed the displacement and acceleration response of the benchmark highway bridge. The dual design hysteresis loop of the PFPI system is the main advantage over the linear hysteresis loop of the FPS. The numerical result indicates that the seismic performance of the PFPI system is better than that of the traditional FPS isolated system. Further, it is observed that variations of the isolation time period and coefficient of friction of the FPS and PFPI systems have a significant effect on the peak responses of the benchmark highway bridge.
文摘This paper presents a new measuring method based on the simulating technology and measuring technology, researches and analyzes deeply its network performance evaluating model, measuring methods, evaluating algorithms and system implementation. Experimental results argue that this method can define expediently different models of manual simulated loads and configure neatly different environments of network applications, can develop adequately characteristics of widespread applications and accuracy in simulating technology, as well as reality, reliability and better adaptability in measuring technology.
基金Supported by the National 863 High-Tech Project of China!(863-30 0-02-09-99) by Key Research Project ofHubei Province!(99
文摘This paper introduced a performance evaluating approach of computer communication system based on the simulation and measurement technology, and discussed its evaluating models. The result of our experiment showed that the outcome of practical measurement on Ether-LAN fitted in well with the theoretical analysis. The approach we presented can be used to define various kinds of artificially simulated load models conveiently, build all kinds of network application environments in a flexible way, and exert sufficiently the widely-used and high-precision features of the traditional simulation technology and the reality, reliability, adaptability features of measurement technology.