This paper aims to formalize a general definition of intelligence beyond human intelligence. We accomplish this by re-imagining the concept of equality as a fundamental abstraction for relation. We discover that the c...This paper aims to formalize a general definition of intelligence beyond human intelligence. We accomplish this by re-imagining the concept of equality as a fundamental abstraction for relation. We discover that the concept of equality = limits the sensitivity of our mathematics to abstract relationships. We propose a new relation principle that does not rely on the concept of equality but is consistent with existing mathematical abstractions. In essence, this paper proposes a conceptual framework for general interaction and argues that this framework is also an abstraction that satisfies the definition of Intelligence. Hence, we define intelligence as a formalization of generality, represented by the abstraction ∆∞Ο, where each symbol represents the concepts infinitesimal, infinite, and finite respectively. In essence, this paper proposes a General Language Model (GLM), where the abstraction ∆∞Ο represents the foundational relationship of the model. This relation is colloquially termed “The theory of everything”.展开更多
Neural Networks (NN) are the functional unit of Deep Learning and are known to mimic the behavior of the human brain to solve complex data-driven problems. Whenever we train our own neural networks, we need to take ca...Neural Networks (NN) are the functional unit of Deep Learning and are known to mimic the behavior of the human brain to solve complex data-driven problems. Whenever we train our own neural networks, we need to take care of something called the generalization of the neural network. The performance of Artificial Neural Networks (ANN) mostly depends upon its generalization capability. In this paper, we propose an innovative approach to enhance the generalization capability of artificial neural networks (ANN) using structural redundancy. A novel perspective on handling input data prototypes and their impact on the development of generalization, which could improve to ANN architectures accuracy and reliability is described.展开更多
Since the 1950s,when the Turing Test was introduced,there has been notable progress in machine language intelligence.Language modeling,crucial for AI development,has evolved from statistical to neural models over the ...Since the 1950s,when the Turing Test was introduced,there has been notable progress in machine language intelligence.Language modeling,crucial for AI development,has evolved from statistical to neural models over the last two decades.Recently,transformer-based Pre-trained Language Models(PLM)have excelled in Natural Language Processing(NLP)tasks by leveraging large-scale training corpora.Increasing the scale of these models enhances performance significantly,introducing abilities like context learning that smaller models lack.The advancement in Large Language Models,exemplified by the development of ChatGPT,has made significant impacts both academically and industrially,capturing widespread societal interest.This survey provides an overview of the development and prospects from Large Language Models(LLM)to Large Multimodal Models(LMM).It first discusses the contributions and technological advancements of LLMs in the field of natural language processing,especially in text generation and language understanding.Then,it turns to the discussion of LMMs,which integrates various data modalities such as text,images,and sound,demonstrating advanced capabilities in understanding and generating cross-modal content,paving new pathways for the adaptability and flexibility of AI systems.Finally,the survey highlights the prospects of LMMs in terms of technological development and application potential,while also pointing out challenges in data integration,cross-modal understanding accuracy,providing a comprehensive perspective on the latest developments in this field.展开更多
Modeling of unsteady aerodynamic loads at high angles of attack using a small amount of experimental or simulation data to construct predictive models for unknown states can greatly improve the efficiency of aircraft ...Modeling of unsteady aerodynamic loads at high angles of attack using a small amount of experimental or simulation data to construct predictive models for unknown states can greatly improve the efficiency of aircraft unsteady aerodynamic design and flight dynamics analysis.In this paper,aiming at the problems of poor generalization of traditional aerodynamic models and intelligent models,an intelligent aerodynamic modeling method based on gated neural units is proposed.The time memory characteristics of the gated neural unit is fully utilized,thus the nonlinear flow field characterization ability of the learning and training process is enhanced,and the generalization ability of the whole prediction model is improved.The prediction and verification of the model are carried out under the maneuvering flight condition of NACA0015 airfoil.The results show that the model has good adaptability.In the interpolation prediction,the maximum prediction error of the lift and drag coefficients and the moment coefficient does not exceed 10%,which can basically represent the variation characteristics of the entire flow field.In the construction of extrapolation models,the training model based on the strong nonlinear data has good accuracy for weak nonlinear prediction.Furthermore,the error is larger,even exceeding 20%,which indicates that the extrapolation and generalization capabilities need to be further optimized by integrating physical models.Compared with the conventional state space equation model,the proposed method can improve the extrapolation accuracy and efficiency by 78%and 60%,respectively,which demonstrates the applied potential of this method in aerodynamic modeling.展开更多
This letter evaluates the article by Gravina et al on ChatGPT’s potential in providing medical information for inflammatory bowel disease patients.While promising,it highlights the need for advanced techniques like r...This letter evaluates the article by Gravina et al on ChatGPT’s potential in providing medical information for inflammatory bowel disease patients.While promising,it highlights the need for advanced techniques like reasoning+action and retrieval-augmented generation to improve accuracy and reliability.Emphasizing that simple question and answer testing is insufficient,it calls for more nuanced evaluation methods to truly gauge large language models’capabilities in clinical applications.展开更多
BACKGROUND Medication errors,especially in dosage calculation,pose risks in healthcare.Artificial intelligence(AI)systems like ChatGPT and Google Bard may help reduce errors,but their accuracy in providing medication ...BACKGROUND Medication errors,especially in dosage calculation,pose risks in healthcare.Artificial intelligence(AI)systems like ChatGPT and Google Bard may help reduce errors,but their accuracy in providing medication information remains to be evaluated.AIM To evaluate the accuracy of AI systems(ChatGPT 3.5,ChatGPT 4,Google Bard)in providing drug dosage information per Harrison's Principles of Internal Medicine.METHODS A set of natural language queries mimicking real-world medical dosage inquiries was presented to the AI systems.Responses were analyzed using a 3-point Likert scale.The analysis,conducted with Python and its libraries,focused on basic statistics,overall system accuracy,and disease-specific and organ system accuracies.RESULTS ChatGPT 4 outperformed the other systems,showing the highest rate of correct responses(83.77%)and the best overall weighted accuracy(0.6775).Disease-specific accuracy varied notably across systems,with some diseases being accurately recognized,while others demonstrated significant discrepancies.Organ system accuracy also showed variable results,underscoring system-specific strengths and weaknesses.CONCLUSION ChatGPT 4 demonstrates superior reliability in medical dosage information,yet variations across diseases emphasize the need for ongoing improvements.These results highlight AI's potential in aiding healthcare professionals,urging continuous development for dependable accuracy in critical medical situations.展开更多
The article is focused on discussing a new methodological approach to the study on specifics of transferring human beings to the posthuman cyber society.The approach in question assists in rethinking interconnected pr...The article is focused on discussing a new methodological approach to the study on specifics of transferring human beings to the posthuman cyber society.The approach in question assists in rethinking interconnected problems both of human origins in the universe and mankind’s digital future.And,besides,such an approach allows to deal with self-organising interconversions between the poles of the cardinal dual opposition of the Global Noosphere Brain and the Artificial General Intelligence.Herewith such phenomena of digital social life as Global Digitalisation,Digital Immortality,Mindcloning,and Technological Zombification being the constituents of Technological Singularity Concept,are rethought as paving the way for oncoming Posthuman Digital Era.This concept is evidently exemplified by a bifurcation resulting in two alternatives to be chosen by human beings,to wit,either to be undergone Mindcloning and become digitally immortal or being destroyed by powerful intelligent machines.The investigation in question is based on such a progressive methodology as the Law of Self-Organizing Ideals,as well as on the Method of Dual Oppositions.Rethinking interrelationships between the problem of a sense of social history and the meaning-of-life of local societies members which any intelligent machine is devoid of permits to substantiate specific regularities of Self-Transforming Homo Faber into Homo Digitalis and Technological Zombies ready to be transferred to posthuman cyberspace.展开更多
We consider a structural stochastic volatility model for the loss from a large portfolio of credit risky assets.Both the asset value and the volatility processes are correlated through systemic Brownian motions,with d...We consider a structural stochastic volatility model for the loss from a large portfolio of credit risky assets.Both the asset value and the volatility processes are correlated through systemic Brownian motions,with default determined by the asset value reaching a lower boundary.We prove that if our volatility models are picked from a class of mean-reverting diffusions,the system converges as the portfolio becomes large and,when the vol-of-vol function satisfies certain regularity and boundedness conditions,the limit of the empirical measure process has a density given in terms of a solution to a stochastic initial-boundary value problem on a half-space.The problem is defined in a special weighted Sobolev space.Regularity results are established for solutions to this problem,and then we show that there exists a unique solution.In contrast to the CIR volatility setting covered by the existing literature,our results hold even when the systemic Brownian motions are taken to be correlated.展开更多
This paper proposes and illustrates an AI embedded object-oriented methodology to formulate the computable general equilibrium (CGE) models. In this framework, a CGE model is viewed as a collection of objects embedd...This paper proposes and illustrates an AI embedded object-oriented methodology to formulate the computable general equilibrium (CGE) models. In this framework, a CGE model is viewed as a collection of objects embedded AI or namely agents in computer world, corresponding to economic agents and entities in real world, such as government, households, markets and so on. A frame representation of major objects in CGE model is used for trade and environment. Embedded Al object-oriented approach (or software agent) is used in the CGE model representation can able to narrow the gap among the semantic representation, formal CGE (mathematical) representation and computer and algorithm representation, and to improve CGE in understanding and maintenance etc. In such a system, constructing a CGE model to appear an intuitive process rather than an abstract process. This intuitive process needs more understanding of the substance of economics and the logic underlying the problem rather than mathematical notation.展开更多
2024年,世界卫生组织发布了“Ethics and governance of artificial intelligence for health.Guidance on large multi-modal models”,将其翻译成中文《卫生领域人工智能的伦理与治理:多模态大模型指南》供中国的同仁参阅,协助规划与...2024年,世界卫生组织发布了“Ethics and governance of artificial intelligence for health.Guidance on large multi-modal models”,将其翻译成中文《卫生领域人工智能的伦理与治理:多模态大模型指南》供中国的同仁参阅,协助规划与卫生领域多模态大模型有关的益处和挑战,并为适当开发、提供和使用多模态大模型提供政策和实践方面的指导。世界卫生组织咨询了20位人工智能领域的顶尖专家,他们确定了在卫生领域使用人工智能的潜在益处和潜在风险,并发布了以协商方式达成一致的六项原则,供正在使用人工智能的政府、开发者和提供者在制定政策和实践时考虑。指南提供了与指导原则相一致的企业内部、政府和国际合作的治理建议,指南的基础是考虑到人类使用卫生领域生成式人工智能独特方式的指导原则和治理建议。生成式人工智能是算法在可用于生成新内容的数据集上进行训练的一种人工智能技术。指南针对其中一种类型的生成式人工智能,即多模态大模型,这种模型可以接受一种或多种类型的数据输入,并产生不局限于输入算法的数据类型的多种输出。据预测,多模态大模型将广泛应用于医疗保健、科学研究、公共卫生和药物开发等领域。多模态大模型也被称为“通用基础模型”,尽管尚未证实多模态大模型能否完成各种任务和目的。展开更多
目的构建一种基于人工智能大语言模型(large language model,LLM)技术、可用于医学教育的新型虚拟患者(virtual patient,VP)系统,评价该系统在基层医生进修学习全科医学临床思维中的应用效果。方法选取2021年1月至2024年2月在东南大学...目的构建一种基于人工智能大语言模型(large language model,LLM)技术、可用于医学教育的新型虚拟患者(virtual patient,VP)系统,评价该系统在基层医生进修学习全科医学临床思维中的应用效果。方法选取2021年1月至2024年2月在东南大学附属中大医院进修的基层社区医生为研究对象,随机分为试验组和对照组,分别采用基于LLM的VP系统教学、传统教学方法进行授课,通过临床思维理论知识考核、临床思维能力考核、课程满意度调查评估教学效果,并对结果进行相应的统计学分析。结果共纳入124名基层社区医生,其中试验组60例、对照组64例,两组在一般基线资料上差异无统计学意义,具有可比性。课程结束后,试验组临床思维理论知识考核成绩显著高于对照组(83.83±3.15 vs.79.92±4.52,P<0.01),且不及格率显著低于对照组(0.00%vs.9.38%,P<0.05);试验组在临床思维能力3个维度(批判性、系统性、循证思维)方面教学后分数均显著高于教学前,而对照组仅在批判性思维维度上教学前后差异有统计学意义;教学后试验组在系统思维、循证思维方面分数均显著高于对照组(P<0.05),但在批判性思维上两组分数差异无统计学意义。试验组对授课的总体满意度也显著高于对照组(93.33%vs.85.48%,P<0.05)。结论基于LLM的VP系统提升了学员对临床思维理论知识的掌握程度,也促进了其临床思维能力的培养,该教学方法可为其他医学教育群体提供新的教学工具和思路。展开更多
文摘This paper aims to formalize a general definition of intelligence beyond human intelligence. We accomplish this by re-imagining the concept of equality as a fundamental abstraction for relation. We discover that the concept of equality = limits the sensitivity of our mathematics to abstract relationships. We propose a new relation principle that does not rely on the concept of equality but is consistent with existing mathematical abstractions. In essence, this paper proposes a conceptual framework for general interaction and argues that this framework is also an abstraction that satisfies the definition of Intelligence. Hence, we define intelligence as a formalization of generality, represented by the abstraction ∆∞Ο, where each symbol represents the concepts infinitesimal, infinite, and finite respectively. In essence, this paper proposes a General Language Model (GLM), where the abstraction ∆∞Ο represents the foundational relationship of the model. This relation is colloquially termed “The theory of everything”.
文摘Neural Networks (NN) are the functional unit of Deep Learning and are known to mimic the behavior of the human brain to solve complex data-driven problems. Whenever we train our own neural networks, we need to take care of something called the generalization of the neural network. The performance of Artificial Neural Networks (ANN) mostly depends upon its generalization capability. In this paper, we propose an innovative approach to enhance the generalization capability of artificial neural networks (ANN) using structural redundancy. A novel perspective on handling input data prototypes and their impact on the development of generalization, which could improve to ANN architectures accuracy and reliability is described.
基金We acknowledge funding from NSFC Grant 62306283.
文摘Since the 1950s,when the Turing Test was introduced,there has been notable progress in machine language intelligence.Language modeling,crucial for AI development,has evolved from statistical to neural models over the last two decades.Recently,transformer-based Pre-trained Language Models(PLM)have excelled in Natural Language Processing(NLP)tasks by leveraging large-scale training corpora.Increasing the scale of these models enhances performance significantly,introducing abilities like context learning that smaller models lack.The advancement in Large Language Models,exemplified by the development of ChatGPT,has made significant impacts both academically and industrially,capturing widespread societal interest.This survey provides an overview of the development and prospects from Large Language Models(LLM)to Large Multimodal Models(LMM).It first discusses the contributions and technological advancements of LLMs in the field of natural language processing,especially in text generation and language understanding.Then,it turns to the discussion of LMMs,which integrates various data modalities such as text,images,and sound,demonstrating advanced capabilities in understanding and generating cross-modal content,paving new pathways for the adaptability and flexibility of AI systems.Finally,the survey highlights the prospects of LMMs in terms of technological development and application potential,while also pointing out challenges in data integration,cross-modal understanding accuracy,providing a comprehensive perspective on the latest developments in this field.
基金supported in part by the National Natural Science Foundation of China (No. 12202363)。
文摘Modeling of unsteady aerodynamic loads at high angles of attack using a small amount of experimental or simulation data to construct predictive models for unknown states can greatly improve the efficiency of aircraft unsteady aerodynamic design and flight dynamics analysis.In this paper,aiming at the problems of poor generalization of traditional aerodynamic models and intelligent models,an intelligent aerodynamic modeling method based on gated neural units is proposed.The time memory characteristics of the gated neural unit is fully utilized,thus the nonlinear flow field characterization ability of the learning and training process is enhanced,and the generalization ability of the whole prediction model is improved.The prediction and verification of the model are carried out under the maneuvering flight condition of NACA0015 airfoil.The results show that the model has good adaptability.In the interpolation prediction,the maximum prediction error of the lift and drag coefficients and the moment coefficient does not exceed 10%,which can basically represent the variation characteristics of the entire flow field.In the construction of extrapolation models,the training model based on the strong nonlinear data has good accuracy for weak nonlinear prediction.Furthermore,the error is larger,even exceeding 20%,which indicates that the extrapolation and generalization capabilities need to be further optimized by integrating physical models.Compared with the conventional state space equation model,the proposed method can improve the extrapolation accuracy and efficiency by 78%and 60%,respectively,which demonstrates the applied potential of this method in aerodynamic modeling.
文摘This letter evaluates the article by Gravina et al on ChatGPT’s potential in providing medical information for inflammatory bowel disease patients.While promising,it highlights the need for advanced techniques like reasoning+action and retrieval-augmented generation to improve accuracy and reliability.Emphasizing that simple question and answer testing is insufficient,it calls for more nuanced evaluation methods to truly gauge large language models’capabilities in clinical applications.
文摘BACKGROUND Medication errors,especially in dosage calculation,pose risks in healthcare.Artificial intelligence(AI)systems like ChatGPT and Google Bard may help reduce errors,but their accuracy in providing medication information remains to be evaluated.AIM To evaluate the accuracy of AI systems(ChatGPT 3.5,ChatGPT 4,Google Bard)in providing drug dosage information per Harrison's Principles of Internal Medicine.METHODS A set of natural language queries mimicking real-world medical dosage inquiries was presented to the AI systems.Responses were analyzed using a 3-point Likert scale.The analysis,conducted with Python and its libraries,focused on basic statistics,overall system accuracy,and disease-specific and organ system accuracies.RESULTS ChatGPT 4 outperformed the other systems,showing the highest rate of correct responses(83.77%)and the best overall weighted accuracy(0.6775).Disease-specific accuracy varied notably across systems,with some diseases being accurately recognized,while others demonstrated significant discrepancies.Organ system accuracy also showed variable results,underscoring system-specific strengths and weaknesses.CONCLUSION ChatGPT 4 demonstrates superior reliability in medical dosage information,yet variations across diseases emphasize the need for ongoing improvements.These results highlight AI's potential in aiding healthcare professionals,urging continuous development for dependable accuracy in critical medical situations.
文摘The article is focused on discussing a new methodological approach to the study on specifics of transferring human beings to the posthuman cyber society.The approach in question assists in rethinking interconnected problems both of human origins in the universe and mankind’s digital future.And,besides,such an approach allows to deal with self-organising interconversions between the poles of the cardinal dual opposition of the Global Noosphere Brain and the Artificial General Intelligence.Herewith such phenomena of digital social life as Global Digitalisation,Digital Immortality,Mindcloning,and Technological Zombification being the constituents of Technological Singularity Concept,are rethought as paving the way for oncoming Posthuman Digital Era.This concept is evidently exemplified by a bifurcation resulting in two alternatives to be chosen by human beings,to wit,either to be undergone Mindcloning and become digitally immortal or being destroyed by powerful intelligent machines.The investigation in question is based on such a progressive methodology as the Law of Self-Organizing Ideals,as well as on the Method of Dual Oppositions.Rethinking interrelationships between the problem of a sense of social history and the meaning-of-life of local societies members which any intelligent machine is devoid of permits to substantiate specific regularities of Self-Transforming Homo Faber into Homo Digitalis and Technological Zombies ready to be transferred to posthuman cyberspace.
基金supported financially by the United Kingdom Engineering and Physical Sciences Research Council (Grant No.EP/L015811/1)by the Foundation for Education and European Culture (founded by Nicos&Lydia Tricha).
文摘We consider a structural stochastic volatility model for the loss from a large portfolio of credit risky assets.Both the asset value and the volatility processes are correlated through systemic Brownian motions,with default determined by the asset value reaching a lower boundary.We prove that if our volatility models are picked from a class of mean-reverting diffusions,the system converges as the portfolio becomes large and,when the vol-of-vol function satisfies certain regularity and boundedness conditions,the limit of the empirical measure process has a density given in terms of a solution to a stochastic initial-boundary value problem on a half-space.The problem is defined in a special weighted Sobolev space.Regularity results are established for solutions to this problem,and then we show that there exists a unique solution.In contrast to the CIR volatility setting covered by the existing literature,our results hold even when the systemic Brownian motions are taken to be correlated.
文摘This paper proposes and illustrates an AI embedded object-oriented methodology to formulate the computable general equilibrium (CGE) models. In this framework, a CGE model is viewed as a collection of objects embedded AI or namely agents in computer world, corresponding to economic agents and entities in real world, such as government, households, markets and so on. A frame representation of major objects in CGE model is used for trade and environment. Embedded Al object-oriented approach (or software agent) is used in the CGE model representation can able to narrow the gap among the semantic representation, formal CGE (mathematical) representation and computer and algorithm representation, and to improve CGE in understanding and maintenance etc. In such a system, constructing a CGE model to appear an intuitive process rather than an abstract process. This intuitive process needs more understanding of the substance of economics and the logic underlying the problem rather than mathematical notation.
文摘2024年,世界卫生组织发布了“Ethics and governance of artificial intelligence for health.Guidance on large multi-modal models”,将其翻译成中文《卫生领域人工智能的伦理与治理:多模态大模型指南》供中国的同仁参阅,协助规划与卫生领域多模态大模型有关的益处和挑战,并为适当开发、提供和使用多模态大模型提供政策和实践方面的指导。世界卫生组织咨询了20位人工智能领域的顶尖专家,他们确定了在卫生领域使用人工智能的潜在益处和潜在风险,并发布了以协商方式达成一致的六项原则,供正在使用人工智能的政府、开发者和提供者在制定政策和实践时考虑。指南提供了与指导原则相一致的企业内部、政府和国际合作的治理建议,指南的基础是考虑到人类使用卫生领域生成式人工智能独特方式的指导原则和治理建议。生成式人工智能是算法在可用于生成新内容的数据集上进行训练的一种人工智能技术。指南针对其中一种类型的生成式人工智能,即多模态大模型,这种模型可以接受一种或多种类型的数据输入,并产生不局限于输入算法的数据类型的多种输出。据预测,多模态大模型将广泛应用于医疗保健、科学研究、公共卫生和药物开发等领域。多模态大模型也被称为“通用基础模型”,尽管尚未证实多模态大模型能否完成各种任务和目的。
文摘目的构建一种基于人工智能大语言模型(large language model,LLM)技术、可用于医学教育的新型虚拟患者(virtual patient,VP)系统,评价该系统在基层医生进修学习全科医学临床思维中的应用效果。方法选取2021年1月至2024年2月在东南大学附属中大医院进修的基层社区医生为研究对象,随机分为试验组和对照组,分别采用基于LLM的VP系统教学、传统教学方法进行授课,通过临床思维理论知识考核、临床思维能力考核、课程满意度调查评估教学效果,并对结果进行相应的统计学分析。结果共纳入124名基层社区医生,其中试验组60例、对照组64例,两组在一般基线资料上差异无统计学意义,具有可比性。课程结束后,试验组临床思维理论知识考核成绩显著高于对照组(83.83±3.15 vs.79.92±4.52,P<0.01),且不及格率显著低于对照组(0.00%vs.9.38%,P<0.05);试验组在临床思维能力3个维度(批判性、系统性、循证思维)方面教学后分数均显著高于教学前,而对照组仅在批判性思维维度上教学前后差异有统计学意义;教学后试验组在系统思维、循证思维方面分数均显著高于对照组(P<0.05),但在批判性思维上两组分数差异无统计学意义。试验组对授课的总体满意度也显著高于对照组(93.33%vs.85.48%,P<0.05)。结论基于LLM的VP系统提升了学员对临床思维理论知识的掌握程度,也促进了其临床思维能力的培养,该教学方法可为其他医学教育群体提供新的教学工具和思路。