Since the 1950s,when the Turing Test was introduced,there has been notable progress in machine language intelligence.Language modeling,crucial for AI development,has evolved from statistical to neural models over the ...Since the 1950s,when the Turing Test was introduced,there has been notable progress in machine language intelligence.Language modeling,crucial for AI development,has evolved from statistical to neural models over the last two decades.Recently,transformer-based Pre-trained Language Models(PLM)have excelled in Natural Language Processing(NLP)tasks by leveraging large-scale training corpora.Increasing the scale of these models enhances performance significantly,introducing abilities like context learning that smaller models lack.The advancement in Large Language Models,exemplified by the development of ChatGPT,has made significant impacts both academically and industrially,capturing widespread societal interest.This survey provides an overview of the development and prospects from Large Language Models(LLM)to Large Multimodal Models(LMM).It first discusses the contributions and technological advancements of LLMs in the field of natural language processing,especially in text generation and language understanding.Then,it turns to the discussion of LMMs,which integrates various data modalities such as text,images,and sound,demonstrating advanced capabilities in understanding and generating cross-modal content,paving new pathways for the adaptability and flexibility of AI systems.Finally,the survey highlights the prospects of LMMs in terms of technological development and application potential,while also pointing out challenges in data integration,cross-modal understanding accuracy,providing a comprehensive perspective on the latest developments in this field.展开更多
Visual Query Language on Spatial Information (SIVQL) is one kind of visual query language based on the extension of Query by Example (QBE). It is a visual operation based on graphics or media object, such as point, li...Visual Query Language on Spatial Information (SIVQL) is one kind of visual query language based on the extension of Query by Example (QBE). It is a visual operation based on graphics or media object, such as point, line and area elements. In this paper, the relation calculation and query function of SIVQL have been studied and discussed by using set theory and relation algebra. The theory foundation of SIVQL has been investigated by the mathematical method. Finally, its application examples are also given with the specific information system.展开更多
This study aims at investigating the effect of using L 1 (Arabic Language) while teaching a target language (English Language) on the achievement in General English of foundation year students in King Abdulaziz Un...This study aims at investigating the effect of using L 1 (Arabic Language) while teaching a target language (English Language) on the achievement in General English of foundation year students in King Abdulaziz University. To achieve the purpose of the study, the researcher used an experimental design: an experimental group and a control group. The independent variable is using L1 while teaching English in very limited and specified areas. The dependent variable is students' achievement in general English. The statistics used is the t-test. The population of the study was all students enrolled in the foundation year 1431/1432 in the ELI (English Language Institute) at King Abdulaziz University. The sample of the study consisted of 50 students taking North Star in the sections A and B as a university requirement in the foundation year 1431/1432 in King Abdulaziz University. The results of the study were in favor of banning Arabic in the English language classroom as shown in the mean scores of the control and experimental groups in the tables. It is recommended that teachers and instructors should be trained to use teaching strategies that help them use English only in the English language classroom.展开更多
English language(EL) and French language(FL) are very similar in their language attributes-lexicon,syntax and tenses.During FL acquisition,especially during the early acquisition stages,FL learners are strongly influe...English language(EL) and French language(FL) are very similar in their language attributes-lexicon,syntax and tenses.During FL acquisition,especially during the early acquisition stages,FL learners are strongly influenced by their EL forms.The essential point of this paper is to distinguish the differences and similarities of the EL and FL,then to suggest some strategies to restrain the negative transfer effectively,which would be helpful in third language acquisition.展开更多
English language(EL) and French language(FL) are very similar in their language attributes-lexicon,syntax and tenses.During FL acquisition,especially during the early acquisition stages,FL learners are strongly influe...English language(EL) and French language(FL) are very similar in their language attributes-lexicon,syntax and tenses.During FL acquisition,especially during the early acquisition stages,FL learners are strongly influenced by their EL forms.The essential point of this paper is to distinguish the differences and similarities of the EL and FL,then to suggest some strategies to restrain the negative transfer effectively,which would be helpful in third language acquisition.展开更多
This article analyzes creation methods of automated design system, presents design system of a house foundation from blocks. The creation methods of automated design system of a house foundation from blocks are discov...This article analyzes creation methods of automated design system, presents design system of a house foundation from blocks. The creation methods of automated design system of a house foundation from blocks are discovered with Unified Modeling Language. Analyzed objects-classes: block, specification, model. Graphical system can design foundation, form specification of objects and create 3D model of house foundation. There are several types and different dimensions of concrete blocks. The program optimally arranges selected blocks so that monolithic parts will be minimal volume. Program selects a house foundation blocks from database by using ActiveX Data Objects technology, which by programming method connects drawing and database. Drawing's graphical objects have additional data from which exchange of data between graphical system and database is executed. Visualization system and example of house foundation from blocks project with specifications is presented. Creation problems of automated design system are discussed and conclusions are made.展开更多
The aim of the paper is to present various aspects of the phenomenon of stereotyping in the context of FL (foreign language) learning and teaching and to discuss practical solutions to be used in a FL classroom to t...The aim of the paper is to present various aspects of the phenomenon of stereotyping in the context of FL (foreign language) learning and teaching and to discuss practical solutions to be used in a FL classroom to teach the worm about the worm by questioning the stereotypes learners have of other nations and languages. This paper is an attempt to present some ideas of FL teachers' role in developing students' socio-cultural competence with the aim of raising their cross-cultural awareness and questioning the stereotypes students bring into a FL classroom. The methodology used was an analysis of fragment of tape scripts from listening comprehension activities from a course book preparing Polish secondary students for the school leaving exam. The topics discussed concern opinions about attitudes towards and judgments of various cultural aspects, be it drinking tea or discussing the weather, impressions people have about other nations, or languages people speak.展开更多
Language anxiety and language motivation have been studied extensively by experts in foreign language (FL) learning. Both of these constructs have been found to be closely correlated to FL achievement. In this artic...Language anxiety and language motivation have been studied extensively by experts in foreign language (FL) learning. Both of these constructs have been found to be closely correlated to FL achievement. In this article, the author has reviewed foreign language anxiety, language motivation and the relationship between language motivation and language anxiety.展开更多
This research explores the integration of large language models (LLMs) into scientific data assimilation, focusing on combustion science as a case study. Leveraging foundational models integrated with Retrieval-Augmen...This research explores the integration of large language models (LLMs) into scientific data assimilation, focusing on combustion science as a case study. Leveraging foundational models integrated with Retrieval-Augmented Generation (RAG) framework, the study introduces an approach to process diverse combustion research data, spanning experimental studies, simulations, and literature. The multifaceted nature of combustion research emphasizes the critical role of knowledge processing in navigating and extracting valuable information from a vast and diverse pool of sources. The developed approach minimizes computational and economic expenses while optimizing data privacy and accuracy. It incorporates prompt engineering and offline open-source LLMs, offering user autonomy in selecting base models. The study provides a thorough examination of text segmentation strategies, conducts comparative studies between LLMs, and explores various optimized prompts to demonstrate the effectiveness of the framework. By incorporating an external vector database, the framework outperforms a conventional LLM in generating accurate responses and constructing robust arguments. Additionally, the study delves into the investigation of optimized prompt templates for the purpose of efficient extraction of scientific literature. Furthermore, we present a targeted scaling study to quantify the algorithmic performance of the framework as the number of prompt tokens increases. The research addresses concerns related to hallucinations and false research articles by introducing a custom workflow developed with a detection algorithm to filter out inaccuracies. Despite identified areas for improvement, the framework consistently delivers accurate domain-specific responses with minimal human oversight. The prompt-agnostic approach introduced holds promise for future improvements. The study underscores the significance of integrating LLMs and knowledge processing techniques in scientific research, providing a foundation for advancements in data assimilation and utilization.展开更多
In recent years, large language models have achieved breakthroughs on a wide range of benchmarks in natural language processing and continue to increase in performance. Recently, the advances of large language models ...In recent years, large language models have achieved breakthroughs on a wide range of benchmarks in natural language processing and continue to increase in performance. Recently, the advances of large language models have raised interest outside the natural language processing community and could have a large impact on daily life. In this paper, we pose the question: How will large language models and other foundation models shape the future product development process? We provide the reader with an overview of the subject by summarizing both recent advances in natural language processing and the use of information technology in the engineering design process. We argue that discourse should be regarded as the core of engineering design processes, and therefore should be represented in a digital artifact. On this basis, we describe how foundation models such as large language models could contribute to the design discourse by automating parts thereof that involve creativity and reasoning, and were previously reserved for humans. We describe how simulations, experiments, topology optimizations, and other process steps can be integrated into a machine-actionable, discourse-centric design process. As an example, we present a design discourse on the optimization of wind turbine blades. Finally, we outline the future research that will be necessary for the implementation of the conceptualized framework.展开更多
This article elucidates the concept of large model technology,summarizes the research status of large model technology both domestically and internationally,provides an overview of the application status of large mode...This article elucidates the concept of large model technology,summarizes the research status of large model technology both domestically and internationally,provides an overview of the application status of large models in vertical industries,outlines the challenges and issues confronted in applying large models in the oil and gas sector,and offers prospects for the application of large models in the oil and gas industry.The existing large models can be briefly divided into three categories:large language models,visual large models,and multimodal large models.The application of large models in the oil and gas industry is still in its infancy.Based on open-source large language models,some oil and gas enterprises have released large language model products using methods like fine-tuning and retrieval augmented generation.Scholars have attempted to develop scenario-specific models for oil and gas operations by using visual/multimodal foundation models.A few researchers have constructed pre-trained foundation models for seismic data processing and interpretation,as well as core analysis.The application of large models in the oil and gas industry faces challenges such as current data quantity and quality being difficult to support the training of large models,high research and development costs,and poor algorithm autonomy and control.The application of large models should be guided by the needs of oil and gas business,taking the application of large models as an opportunity to improve data lifecycle management,enhance data governance capabilities,promote the construction of computing power,strengthen the construction of“artificial intelligence+energy”composite teams,and boost the autonomy and control of large model technology.展开更多
微调后的大语言模型(Large language models,LLMs)在多任务中表现出色,但集中式训练存在用户隐私泄漏的风险。联邦学习(Federated learning,FL)通过本地训练避免了数据共享,但LLMs庞大的参数量对资源受限的设备和通信带宽构成挑战,导致...微调后的大语言模型(Large language models,LLMs)在多任务中表现出色,但集中式训练存在用户隐私泄漏的风险。联邦学习(Federated learning,FL)通过本地训练避免了数据共享,但LLMs庞大的参数量对资源受限的设备和通信带宽构成挑战,导致在边缘网络中部署困难。结合分割学习(Split learning,SL),联邦分割学习可以有效解决这一问题。基于模型深层权重的影响更为显著,以及对部分层的训练准确率略低于整体模型训练的发现,本文按照Transformer层对模型进行分割,同时引入低秩适应(Low⁃rank adaption,LoRA)进一步降低资源开销和提升安全性。因此,在设备端,仅对最后几层进行低秩适应和训练,然后上传至服务器进行聚合。为了降低开销并保证模型性能,本文提出了基于联邦分割学习与LoRA的RoBERTa预训练模型微调方法。通过联合优化边缘设备的计算频率和模型微调的秩,在资源受限的情况下最大化秩,提高模型的准确率。仿真结果显示,仅训练LLMs最后3层的情况下,在一定范围内(1~32)增加秩的取值可以提高模型的准确率。同时,增大模型每轮的容忍时延和设备的能量阈值可以进一步提升模型的准确率。展开更多
EFL (English as a Foreign Language) speaking is a very demanding skill that requires learners' socio-pragmatic as well as strategic competence in any interactional situation, and lexis proves to play a crucial role...EFL (English as a Foreign Language) speaking is a very demanding skill that requires learners' socio-pragmatic as well as strategic competence in any interactional situation, and lexis proves to play a crucial role in this process. However, few studies have investigated how both EFL teachers and learners view and analyze situations in which learners are not producing enough spoken language in class, and the reasons behind them. The present study will pinpoint the significant role of lexis in Moroccan learners' speaking production. To this end, 40 EFL teachers and 200 Moroccan high school students are surveyed and interviewed to reveal their perceptions of the speaking skill and the corresponding high significance of lexis in this instance. Results show that both teachers and learners identify vocabulary deficiency as the main factor behind students' inability to speak English. In the present paper, among the many suggestions that could be proposed to deal with this situation, it is argued that one efficient way would be to assist the students during the process of L2 (second language) vocabulary learning through vocabulary learning strategy instruction. Pedagogical and research implication will be given in response to the difficulties encountered in this area as have been identified by the EFL teachers and learners surveyed.展开更多
基金We acknowledge funding from NSFC Grant 62306283.
文摘Since the 1950s,when the Turing Test was introduced,there has been notable progress in machine language intelligence.Language modeling,crucial for AI development,has evolved from statistical to neural models over the last two decades.Recently,transformer-based Pre-trained Language Models(PLM)have excelled in Natural Language Processing(NLP)tasks by leveraging large-scale training corpora.Increasing the scale of these models enhances performance significantly,introducing abilities like context learning that smaller models lack.The advancement in Large Language Models,exemplified by the development of ChatGPT,has made significant impacts both academically and industrially,capturing widespread societal interest.This survey provides an overview of the development and prospects from Large Language Models(LLM)to Large Multimodal Models(LMM).It first discusses the contributions and technological advancements of LLMs in the field of natural language processing,especially in text generation and language understanding.Then,it turns to the discussion of LMMs,which integrates various data modalities such as text,images,and sound,demonstrating advanced capabilities in understanding and generating cross-modal content,paving new pathways for the adaptability and flexibility of AI systems.Finally,the survey highlights the prospects of LMMs in terms of technological development and application potential,while also pointing out challenges in data integration,cross-modal understanding accuracy,providing a comprehensive perspective on the latest developments in this field.
文摘Visual Query Language on Spatial Information (SIVQL) is one kind of visual query language based on the extension of Query by Example (QBE). It is a visual operation based on graphics or media object, such as point, line and area elements. In this paper, the relation calculation and query function of SIVQL have been studied and discussed by using set theory and relation algebra. The theory foundation of SIVQL has been investigated by the mathematical method. Finally, its application examples are also given with the specific information system.
文摘This study aims at investigating the effect of using L 1 (Arabic Language) while teaching a target language (English Language) on the achievement in General English of foundation year students in King Abdulaziz University. To achieve the purpose of the study, the researcher used an experimental design: an experimental group and a control group. The independent variable is using L1 while teaching English in very limited and specified areas. The dependent variable is students' achievement in general English. The statistics used is the t-test. The population of the study was all students enrolled in the foundation year 1431/1432 in the ELI (English Language Institute) at King Abdulaziz University. The sample of the study consisted of 50 students taking North Star in the sections A and B as a university requirement in the foundation year 1431/1432 in King Abdulaziz University. The results of the study were in favor of banning Arabic in the English language classroom as shown in the mean scores of the control and experimental groups in the tables. It is recommended that teachers and instructors should be trained to use teaching strategies that help them use English only in the English language classroom.
文摘English language(EL) and French language(FL) are very similar in their language attributes-lexicon,syntax and tenses.During FL acquisition,especially during the early acquisition stages,FL learners are strongly influenced by their EL forms.The essential point of this paper is to distinguish the differences and similarities of the EL and FL,then to suggest some strategies to restrain the negative transfer effectively,which would be helpful in third language acquisition.
文摘English language(EL) and French language(FL) are very similar in their language attributes-lexicon,syntax and tenses.During FL acquisition,especially during the early acquisition stages,FL learners are strongly influenced by their EL forms.The essential point of this paper is to distinguish the differences and similarities of the EL and FL,then to suggest some strategies to restrain the negative transfer effectively,which would be helpful in third language acquisition.
文摘This article analyzes creation methods of automated design system, presents design system of a house foundation from blocks. The creation methods of automated design system of a house foundation from blocks are discovered with Unified Modeling Language. Analyzed objects-classes: block, specification, model. Graphical system can design foundation, form specification of objects and create 3D model of house foundation. There are several types and different dimensions of concrete blocks. The program optimally arranges selected blocks so that monolithic parts will be minimal volume. Program selects a house foundation blocks from database by using ActiveX Data Objects technology, which by programming method connects drawing and database. Drawing's graphical objects have additional data from which exchange of data between graphical system and database is executed. Visualization system and example of house foundation from blocks project with specifications is presented. Creation problems of automated design system are discussed and conclusions are made.
文摘The aim of the paper is to present various aspects of the phenomenon of stereotyping in the context of FL (foreign language) learning and teaching and to discuss practical solutions to be used in a FL classroom to teach the worm about the worm by questioning the stereotypes learners have of other nations and languages. This paper is an attempt to present some ideas of FL teachers' role in developing students' socio-cultural competence with the aim of raising their cross-cultural awareness and questioning the stereotypes students bring into a FL classroom. The methodology used was an analysis of fragment of tape scripts from listening comprehension activities from a course book preparing Polish secondary students for the school leaving exam. The topics discussed concern opinions about attitudes towards and judgments of various cultural aspects, be it drinking tea or discussing the weather, impressions people have about other nations, or languages people speak.
文摘Language anxiety and language motivation have been studied extensively by experts in foreign language (FL) learning. Both of these constructs have been found to be closely correlated to FL achievement. In this article, the author has reviewed foreign language anxiety, language motivation and the relationship between language motivation and language anxiety.
文摘本文对生成式AI(Generative artificial intelligence,GenAI)的国内外发展现状进行了概述,重点分析了中美之间在算力、数据、算法、生态等方面存在的差距.为改变我国在生成式AI领域的落后现状,提出高能效算力建设、联邦数据、专业领域模型、基于TAO的联邦生态等应对策略,对大模型时代AI安全治理进行了论述,对通用人工智能(Artificial general intelligence,AGI)的未来发展进行了展望.
基金support from the Defense Threat Reduction Agency(DTRA)under Grant No.HDTRA12110012with Dr.Richard Fry as the Program Officer,and partial project support from the Air Force Office of Scientific Research(AFOSR)under Grant No.FA9550-24-1-0017with Dr.Chiping Li as the Program Officer.
文摘This research explores the integration of large language models (LLMs) into scientific data assimilation, focusing on combustion science as a case study. Leveraging foundational models integrated with Retrieval-Augmented Generation (RAG) framework, the study introduces an approach to process diverse combustion research data, spanning experimental studies, simulations, and literature. The multifaceted nature of combustion research emphasizes the critical role of knowledge processing in navigating and extracting valuable information from a vast and diverse pool of sources. The developed approach minimizes computational and economic expenses while optimizing data privacy and accuracy. It incorporates prompt engineering and offline open-source LLMs, offering user autonomy in selecting base models. The study provides a thorough examination of text segmentation strategies, conducts comparative studies between LLMs, and explores various optimized prompts to demonstrate the effectiveness of the framework. By incorporating an external vector database, the framework outperforms a conventional LLM in generating accurate responses and constructing robust arguments. Additionally, the study delves into the investigation of optimized prompt templates for the purpose of efficient extraction of scientific literature. Furthermore, we present a targeted scaling study to quantify the algorithmic performance of the framework as the number of prompt tokens increases. The research addresses concerns related to hallucinations and false research articles by introducing a custom workflow developed with a detection algorithm to filter out inaccuracies. Despite identified areas for improvement, the framework consistently delivers accurate domain-specific responses with minimal human oversight. The prompt-agnostic approach introduced holds promise for future improvements. The study underscores the significance of integrating LLMs and knowledge processing techniques in scientific research, providing a foundation for advancements in data assimilation and utilization.
基金the German Research Foundation(DFG)–project number:442146713.
文摘In recent years, large language models have achieved breakthroughs on a wide range of benchmarks in natural language processing and continue to increase in performance. Recently, the advances of large language models have raised interest outside the natural language processing community and could have a large impact on daily life. In this paper, we pose the question: How will large language models and other foundation models shape the future product development process? We provide the reader with an overview of the subject by summarizing both recent advances in natural language processing and the use of information technology in the engineering design process. We argue that discourse should be regarded as the core of engineering design processes, and therefore should be represented in a digital artifact. On this basis, we describe how foundation models such as large language models could contribute to the design discourse by automating parts thereof that involve creativity and reasoning, and were previously reserved for humans. We describe how simulations, experiments, topology optimizations, and other process steps can be integrated into a machine-actionable, discourse-centric design process. As an example, we present a design discourse on the optimization of wind turbine blades. Finally, we outline the future research that will be necessary for the implementation of the conceptualized framework.
基金Supported by the National Natural Science Foundation of China(72088101,42372175)PetroChina Science and Technology Innovation Fund Program(2021DQ02-0904)。
文摘This article elucidates the concept of large model technology,summarizes the research status of large model technology both domestically and internationally,provides an overview of the application status of large models in vertical industries,outlines the challenges and issues confronted in applying large models in the oil and gas sector,and offers prospects for the application of large models in the oil and gas industry.The existing large models can be briefly divided into three categories:large language models,visual large models,and multimodal large models.The application of large models in the oil and gas industry is still in its infancy.Based on open-source large language models,some oil and gas enterprises have released large language model products using methods like fine-tuning and retrieval augmented generation.Scholars have attempted to develop scenario-specific models for oil and gas operations by using visual/multimodal foundation models.A few researchers have constructed pre-trained foundation models for seismic data processing and interpretation,as well as core analysis.The application of large models in the oil and gas industry faces challenges such as current data quantity and quality being difficult to support the training of large models,high research and development costs,and poor algorithm autonomy and control.The application of large models should be guided by the needs of oil and gas business,taking the application of large models as an opportunity to improve data lifecycle management,enhance data governance capabilities,promote the construction of computing power,strengthen the construction of“artificial intelligence+energy”composite teams,and boost the autonomy and control of large model technology.
文摘EFL (English as a Foreign Language) speaking is a very demanding skill that requires learners' socio-pragmatic as well as strategic competence in any interactional situation, and lexis proves to play a crucial role in this process. However, few studies have investigated how both EFL teachers and learners view and analyze situations in which learners are not producing enough spoken language in class, and the reasons behind them. The present study will pinpoint the significant role of lexis in Moroccan learners' speaking production. To this end, 40 EFL teachers and 200 Moroccan high school students are surveyed and interviewed to reveal their perceptions of the speaking skill and the corresponding high significance of lexis in this instance. Results show that both teachers and learners identify vocabulary deficiency as the main factor behind students' inability to speak English. In the present paper, among the many suggestions that could be proposed to deal with this situation, it is argued that one efficient way would be to assist the students during the process of L2 (second language) vocabulary learning through vocabulary learning strategy instruction. Pedagogical and research implication will be given in response to the difficulties encountered in this area as have been identified by the EFL teachers and learners surveyed.