Since the 1950s,when the Turing Test was introduced,there has been notable progress in machine language intelligence.Language modeling,crucial for AI development,has evolved from statistical to neural models over the ...Since the 1950s,when the Turing Test was introduced,there has been notable progress in machine language intelligence.Language modeling,crucial for AI development,has evolved from statistical to neural models over the last two decades.Recently,transformer-based Pre-trained Language Models(PLM)have excelled in Natural Language Processing(NLP)tasks by leveraging large-scale training corpora.Increasing the scale of these models enhances performance significantly,introducing abilities like context learning that smaller models lack.The advancement in Large Language Models,exemplified by the development of ChatGPT,has made significant impacts both academically and industrially,capturing widespread societal interest.This survey provides an overview of the development and prospects from Large Language Models(LLM)to Large Multimodal Models(LMM).It first discusses the contributions and technological advancements of LLMs in the field of natural language processing,especially in text generation and language understanding.Then,it turns to the discussion of LMMs,which integrates various data modalities such as text,images,and sound,demonstrating advanced capabilities in understanding and generating cross-modal content,paving new pathways for the adaptability and flexibility of AI systems.Finally,the survey highlights the prospects of LMMs in terms of technological development and application potential,while also pointing out challenges in data integration,cross-modal understanding accuracy,providing a comprehensive perspective on the latest developments in this field.展开更多
Huntington'sdisease(HD)isahereditary neurodegenerative disorder for which there is currently no effectivetreatmentavailable.Consequently,the development of appropriate disease models is critical to thoroughly inve...Huntington'sdisease(HD)isahereditary neurodegenerative disorder for which there is currently no effectivetreatmentavailable.Consequently,the development of appropriate disease models is critical to thoroughly investigate disease progression.The genetic basis of HD involves the abnormal expansion of CAG repeats in the huntingtin(HTT)gene,leading to the expansion of a polyglutamine repeat in the HTT protein.Mutant HTT carrying the expanded polyglutamine repeat undergoes misfolding and forms aggregates in the brain,which precipitate selective neuronal loss in specific brain regions.Animal models play an important role in elucidating the pathogenesis of neurodegenerative disorders such as HD and in identifying potential therapeutic targets.Due to the marked species differences between rodents and larger animals,substantial efforts have been directed toward establishing large animal models for HD research.These models are pivotal for advancing the discovery of novel therapeutic targets,enhancing effective drug delivery methods,and improving treatment outcomes.We have explored the advantages of utilizing large animal models,particularly pigs,in previous reviews.Since then,however,significant progress has been made in developing more sophisticated animal models that faithfully replicate the typical pathology of HD.In the current review,we provide a comprehensive overview of large animal models of HD,incorporating recent findings regarding the establishment of HD knock-in(KI)pigs and their genetic therapy.We also explore the utilization of large animal models in HD research,with a focus on sheep,non-human primates(NHPs),and pigs.Our objective is to provide valuable insights into the application of these large animal models for the investigation and treatment of neurodegenerative disorders.展开更多
This article elucidates the concept of large model technology,summarizes the research status of large model technology both domestically and internationally,provides an overview of the application status of large mode...This article elucidates the concept of large model technology,summarizes the research status of large model technology both domestically and internationally,provides an overview of the application status of large models in vertical industries,outlines the challenges and issues confronted in applying large models in the oil and gas sector,and offers prospects for the application of large models in the oil and gas industry.The existing large models can be briefly divided into three categories:large language models,visual large models,and multimodal large models.The application of large models in the oil and gas industry is still in its infancy.Based on open-source large language models,some oil and gas enterprises have released large language model products using methods like fine-tuning and retrieval augmented generation.Scholars have attempted to develop scenario-specific models for oil and gas operations by using visual/multimodal foundation models.A few researchers have constructed pre-trained foundation models for seismic data processing and interpretation,as well as core analysis.The application of large models in the oil and gas industry faces challenges such as current data quantity and quality being difficult to support the training of large models,high research and development costs,and poor algorithm autonomy and control.The application of large models should be guided by the needs of oil and gas business,taking the application of large models as an opportunity to improve data lifecycle management,enhance data governance capabilities,promote the construction of computing power,strengthen the construction of“artificial intelligence+energy”composite teams,and boost the autonomy and control of large model technology.展开更多
Modern technological advancements have made social media an essential component of daily life.Social media allow individuals to share thoughts,emotions,and ideas.Sentiment analysis plays the function of evaluating whe...Modern technological advancements have made social media an essential component of daily life.Social media allow individuals to share thoughts,emotions,and ideas.Sentiment analysis plays the function of evaluating whether the sentiment of the text is positive,negative,neutral,or any other personal emotion to understand the sentiment context of the text.Sentiment analysis is essential in business and society because it impacts strategic decision-making.Sentiment analysis involves challenges due to lexical variation,an unlabeled dataset,and text distance correlations.The execution time increases due to the sequential processing of the sequence models.However,the calculation times for the Transformer models are reduced because of the parallel processing.This study uses a hybrid deep learning strategy to combine the strengths of the Transformer and Sequence models while ignoring their limitations.In particular,the proposed model integrates the Decoding-enhanced with Bidirectional Encoder Representations from Transformers(BERT)attention(DeBERTa)and the Gated Recurrent Unit(GRU)for sentiment analysis.Using the Decoding-enhanced BERT technique,the words are mapped into a compact,semantic word embedding space,and the Gated Recurrent Unit model can capture the distance contextual semantics correctly.The proposed hybrid model achieves F1-scores of 97%on the Twitter Large Language Model(LLM)dataset,which is much higher than the performance of new techniques.展开更多
This opinion paper explores the transformative potential of large language models(LLMs)in laparoscopic surgery and argues for their integration to enhance surgical education,decision support,reporting,and patient care...This opinion paper explores the transformative potential of large language models(LLMs)in laparoscopic surgery and argues for their integration to enhance surgical education,decision support,reporting,and patient care.LLMs can revolutionize surgical education by providing personalized learning experiences and accelerating skill acquisition.Intelligent decision support systems powered by LLMs can assist surgeons in making complex decisions,optimizing surgical workflows,and improving patient outcomes.Moreover,LLMs can automate surgical reporting and generate personalized patient education materials,streamlining documentation and improving patient engagement.However,challenges such as data scarcity,surgical semantic capture,real-time inference,and integration with existing systems need to be addressed for successful LLM integration.The future of laparoscopic surgery lies in the seamless integration of LLMs,enabling autonomous robotic surgery,predictive surgical planning,intraoperative decision support,virtual surgical assistants,and continuous learning.By harnessing the power of LLMs,laparoscopic surgery can be transformed,empowering surgeons and ultimately benefiting patients.展开更多
High-angle annular dark field(HAADF)imaging in scanning transmission electron microscopy(STEM)has become an indispensable tool in materials science due to its ability to offer sub-°A resolution and provide chemic...High-angle annular dark field(HAADF)imaging in scanning transmission electron microscopy(STEM)has become an indispensable tool in materials science due to its ability to offer sub-°A resolution and provide chemical information through Z-contrast.This study leverages large language models(LLMs)to conduct a comprehensive bibliometric analysis of a large amount of HAADF-related literature(more than 41000 papers).By using LLMs,specifically ChatGPT,we were able to extract detailed information on applications,sample preparation methods,instruments used,and study conclusions.The findings highlight the capability of LLMs to provide a new perspective into HAADF imaging,underscoring its increasingly important role in materials science.Moreover,the rich information extracted from these publications can be harnessed to develop AI models that enhance the automation and intelligence of electron microscopes.展开更多
In the process of constructing domain-specific knowledge graphs,the task of relational triple extraction plays a critical role in transforming unstructured text into structured information.Existing relational triple e...In the process of constructing domain-specific knowledge graphs,the task of relational triple extraction plays a critical role in transforming unstructured text into structured information.Existing relational triple extraction models facemultiple challenges when processing domain-specific data,including insufficient utilization of semantic interaction information between entities and relations,difficulties in handling challenging samples,and the scarcity of domain-specific datasets.To address these issues,our study introduces three innovative components:Relation semantic enhancement,data augmentation,and a voting strategy,all designed to significantly improve the model’s performance in tackling domain-specific relational triple extraction tasks.We first propose an innovative attention interaction module.This method significantly enhances the semantic interaction capabilities between entities and relations by integrating semantic information fromrelation labels.Second,we propose a voting strategy that effectively combines the strengths of large languagemodels(LLMs)and fine-tuned small pre-trained language models(SLMs)to reevaluate challenging samples,thereby improving the model’s adaptability in specific domains.Additionally,we explore the use of LLMs for data augmentation,aiming to generate domain-specific datasets to alleviate the scarcity of domain data.Experiments conducted on three domain-specific datasets demonstrate that our model outperforms existing comparative models in several aspects,with F1 scores exceeding the State of the Art models by 2%,1.6%,and 0.6%,respectively,validating the effectiveness and generalizability of our approach.展开更多
Accurately recommending candidate news to users is a basic challenge of personalized news recommendation systems.Traditional methods are usually difficult to learn and acquire complex semantic information in news text...Accurately recommending candidate news to users is a basic challenge of personalized news recommendation systems.Traditional methods are usually difficult to learn and acquire complex semantic information in news texts,resulting in unsatisfactory recommendation results.Besides,these traditional methods are more friendly to active users with rich historical behaviors.However,they can not effectively solve the long tail problem of inactive users.To address these issues,this research presents a novel general framework that combines Large Language Models(LLM)and Knowledge Graphs(KG)into traditional methods.To learn the contextual information of news text,we use LLMs’powerful text understanding ability to generate news representations with rich semantic information,and then,the generated news representations are used to enhance the news encoding in traditional methods.In addition,multi-hops relationship of news entities is mined and the structural information of news is encoded using KG,thus alleviating the challenge of long-tail distribution.Experimental results demonstrate that compared with various traditional models,on evaluation indicators such as AUC,MRR,nDCG@5 and nDCG@10,the framework significantly improves the recommendation performance.The successful integration of LLM and KG in our framework has established a feasible way for achieving more accurate personalized news recommendation.Our code is available at https://github.com/Xuan-ZW/LKPNR.展开更多
Modeling of unsteady aerodynamic loads at high angles of attack using a small amount of experimental or simulation data to construct predictive models for unknown states can greatly improve the efficiency of aircraft ...Modeling of unsteady aerodynamic loads at high angles of attack using a small amount of experimental or simulation data to construct predictive models for unknown states can greatly improve the efficiency of aircraft unsteady aerodynamic design and flight dynamics analysis.In this paper,aiming at the problems of poor generalization of traditional aerodynamic models and intelligent models,an intelligent aerodynamic modeling method based on gated neural units is proposed.The time memory characteristics of the gated neural unit is fully utilized,thus the nonlinear flow field characterization ability of the learning and training process is enhanced,and the generalization ability of the whole prediction model is improved.The prediction and verification of the model are carried out under the maneuvering flight condition of NACA0015 airfoil.The results show that the model has good adaptability.In the interpolation prediction,the maximum prediction error of the lift and drag coefficients and the moment coefficient does not exceed 10%,which can basically represent the variation characteristics of the entire flow field.In the construction of extrapolation models,the training model based on the strong nonlinear data has good accuracy for weak nonlinear prediction.Furthermore,the error is larger,even exceeding 20%,which indicates that the extrapolation and generalization capabilities need to be further optimized by integrating physical models.Compared with the conventional state space equation model,the proposed method can improve the extrapolation accuracy and efficiency by 78%and 60%,respectively,which demonstrates the applied potential of this method in aerodynamic modeling.展开更多
Students are considered one of the groups most affected by psychological pro-blems.Given the highly dangerous nature of mental illnesses and the increasing-ly serious state of global mental health,it is imperative for...Students are considered one of the groups most affected by psychological pro-blems.Given the highly dangerous nature of mental illnesses and the increasing-ly serious state of global mental health,it is imperative for us to explore new me-thods and approaches concerning the prevention and treatment of mental illne-sses.Large multimodal models(LMMs),as the most advanced artificial intelligen-ce models(i.e.ChatGPT-4),have brought new hope to the accurate prevention,diagnosis,and treatment of psychiatric disorders.The assistance of these models in the promotion of mental health is critical,as the latter necessitates a strong foundation of medical knowledge and professional skills,emotional support,stigma mitigation,the encouragement of more honest patient self-disclosure,reduced health care costs,improved medical efficiency,and greater mental health service coverage.However,these models must address challenges related to health,safety,hallucinations,and ethics simultaneously.In the future,we should address these challenges by developing relevant usage manuals,accountability rules,and legal regulations;implementing a human-centered approach;and intelligently upgrading LMMs through the deep optimization of such models,their algorithms,and other means.This effort will thus substantially contribute not only to the maintenance of students’health but also to the achievement of global sustainable development goals.展开更多
This letter evaluates the article by Gravina et al on ChatGPT’s potential in providing medical information for inflammatory bowel disease patients.While promising,it highlights the need for advanced techniques like r...This letter evaluates the article by Gravina et al on ChatGPT’s potential in providing medical information for inflammatory bowel disease patients.While promising,it highlights the need for advanced techniques like reasoning+action and retrieval-augmented generation to improve accuracy and reliability.Emphasizing that simple question and answer testing is insufficient,it calls for more nuanced evaluation methods to truly gauge large language models’capabilities in clinical applications.展开更多
The problematic use of social media has numerous negative impacts on individuals'daily lives,interpersonal relationships,physical and mental health,and more.Currently,there are few methods and tools to alleviate p...The problematic use of social media has numerous negative impacts on individuals'daily lives,interpersonal relationships,physical and mental health,and more.Currently,there are few methods and tools to alleviate problematic social media,and their potential is yet to be fully realized.Emerging large language models(LLMs)are becoming increasingly popular for providing information and assistance to people and are being applied in many aspects of life.In mitigating problematic social media use,LLMs such as ChatGPT can play a positive role by serving as conversational partners and outlets for users,providing personalized information and resources,monitoring and intervening in problematic social media use,and more.In this process,we should recognize both the enormous potential and endless possibilities of LLMs such as ChatGPT,leveraging their advantages to better address problematic social media use,while also acknowledging the limitations and potential pitfalls of ChatGPT technology,such as errors,limitations in issue resolution,privacy and security concerns,and potential overreliance.When we leverage the advantages of LLMs to address issues in social media usage,we must adopt a cautious and ethical approach,being vigilant of the potential adverse effects that LLMs may have in addressing problematic social media use to better harness technology to serve individuals and society.展开更多
The recent interest in the deployment of Generative AI applications that use large language models (LLMs) has brought to the forefront significant privacy concerns, notably the leakage of Personally Identifiable Infor...The recent interest in the deployment of Generative AI applications that use large language models (LLMs) has brought to the forefront significant privacy concerns, notably the leakage of Personally Identifiable Information (PII) and other confidential or protected information that may have been memorized during training, specifically during a fine-tuning or customization process. We describe different black-box attacks from potential adversaries and study their impact on the amount and type of information that may be recovered from commonly used and deployed LLMs. Our research investigates the relationship between PII leakage, memorization, and factors such as model size, architecture, and the nature of attacks employed. The study utilizes two broad categories of attacks: PII leakage-focused attacks (auto-completion and extraction attacks) and memorization-focused attacks (various membership inference attacks). The findings from these investigations are quantified using an array of evaluative metrics, providing a detailed understanding of LLM vulnerabilities and the effectiveness of different attacks.展开更多
Large Language Models (LLMs) have revolutionized Generative Artificial Intelligence (GenAI) tasks, becoming an integral part of various applications in society, including text generation, translation, summarization, a...Large Language Models (LLMs) have revolutionized Generative Artificial Intelligence (GenAI) tasks, becoming an integral part of various applications in society, including text generation, translation, summarization, and more. However, their widespread usage emphasizes the critical need to enhance their security posture to ensure the integrity and reliability of their outputs and minimize harmful effects. Prompt injections and training data poisoning attacks are two of the most prominent vulnerabilities in LLMs, which could potentially lead to unpredictable and undesirable behaviors, such as biased outputs, misinformation propagation, and even malicious content generation. The Common Vulnerability Scoring System (CVSS) framework provides a standardized approach to capturing the principal characteristics of vulnerabilities, facilitating a deeper understanding of their severity within the security and AI communities. By extending the current CVSS framework, we generate scores for these vulnerabilities such that organizations can prioritize mitigation efforts, allocate resources effectively, and implement targeted security measures to defend against potential risks.展开更多
Objective:Early and accurate identification of large vessel occlusion(LVO)acute ischemic stroke(AIS)patients is critically important for stroke management.Practicable scales with simple items can facilitate prehospita...Objective:Early and accurate identification of large vessel occlusion(LVO)acute ischemic stroke(AIS)patients is critically important for stroke management.Practicable scales with simple items can facilitate prehospital paramedics distinguishing LVO-AIS patients with high efficiency and help to avoid unnecessary and costly delays.The current study aims to develop a screening tool to predict AIS-LVO patients based on prehospital available data.Method:A total of 251 suspected stroke patients who were transported to the emergency department of our hospital via emergency medical services were consecutively enrolled from August,2020 to January,2022.Data including demographic information,medical history,clinical manifestations,and vital signs were collected.A multivariate logistic regression model was developed based on statistically significant variables selected from univariate analysis.Result:Forty-two patients(16.7%)were diagnosed as LVO-AIS based on imaging validation at admission.A comprehensive model was developed with past medical history factors such as atrial fibrillation and coronary heart disease,vital signs such as systolic blood pressure,and prominent symptoms and signs such as gaze palsy,facial paralysis,and dysarthria.The model showed better diagnostic performance in terms of area under the receiver operating characteristic curves(0.884,95%CI,0.830-0.939),which was higher than other common prehospital prediction scales such as the Face,Arm,Speech,Time test(FAST),the Field Assessment Stroke Triage for Emergency Destination(FAST-ED)scale,and the Gaze-Face-Arm-Speech-Time test(G-FAST).Calibration curve analysis,decision curve analysis,and clinical impact curve analysis further validated the reliability,net benefit,and potential clinical impact of the prediction model,respectively.Conclusion:We conducted a prediction model based on prehospital accessible factors including past history of atrial fibrillation and coronary heart disease,systolic blood pressure,and signs such as gaze palsy,facial palsy,and dysarthria.The prediction model showed good diagnostic power and accuracy for identification of the high-risk patients with LVO and may become an effective tool for the LVO recognition in prehospital settings.Future studies are warranted to refine and validate the model further in order to enhance the accuracy and objectivity of clinical judgments.展开更多
With the rapid development of artificial intelligence, large language models (LLMs) have demonstrated remarkable capabilities in natural language understanding and generation. These models have great potential to enha...With the rapid development of artificial intelligence, large language models (LLMs) have demonstrated remarkable capabilities in natural language understanding and generation. These models have great potential to enhance database query systems, enabling more intuitive and semantic query mechanisms. Our model leverages LLM’s deep learning architecture to interpret and process natural language queries and translate them into accurate database queries. The system integrates an LLM-powered semantic parser that translates user input into structured queries that can be understood by the database management system. First, the user query is pre-processed, the text is normalized, and the ambiguity is removed. This is followed by semantic parsing, where the LLM interprets the pre-processed text and identifies key entities and relationships. This is followed by query generation, which converts the parsed information into a structured query format and tailors it to the target database schema. Finally, there is query execution and feedback, where the resulting query is executed on the database and the results are returned to the user. The system also provides feedback mechanisms to improve and optimize future query interpretations. By using advanced LLMs for model implementation and fine-tuning on diverse datasets, the experimental results show that the proposed method significantly improves the accuracy and usability of database queries, making data retrieval easy for users without specialized knowledge.展开更多
In order to improve rib stability,failure criteria and instability mode of a thick coal seam with inter-band rock layer are analysed in this study.A three-dimensional mechanical model is established for the rib by con...In order to improve rib stability,failure criteria and instability mode of a thick coal seam with inter-band rock layer are analysed in this study.A three-dimensional mechanical model is established for the rib by considering the rock layer.A safety factor is defined foy the rib,and it is observed that the safety factor exhibits a positive correlation with the thickness and strength of the inter-band rock.A calculation method for determining critical parameters of the rock layer is presented to ensure the rib stability.It is revealed that incomplete propagation of the fracture at the hard rock constitutes a fundamental prerequisite for ensuring the rib stability.The influence of the position of the inter-band rock in the coal seam on failure mechanism of the rib was thoroughly investigated by developing a series of physical models for the rib at the face area.The best position for the inter-band rock in the coal seam is at a height of 1.5 m away from the roof line,which tends to provide a good stability state for the rib.For different inter-band rock positions,two ways of controlling rib by increasing supports stiffness and flexible grouting reinforcement are proposed.展开更多
A study was conducted to analyze the deformation mechanism of strongly weathered quartz schist in the Daliangshan Tunnel,located in the western Transverse Mountain area.A large deformation problem was experienced duri...A study was conducted to analyze the deformation mechanism of strongly weathered quartz schist in the Daliangshan Tunnel,located in the western Transverse Mountain area.A large deformation problem was experienced during the tunnel construction.To mitigate this problem,a support system was designed incorporating negative Poisson ratio(NPR)anchor cables with negative Poisson ratio effect.Physical model experiments,field experiments,and numerical simulation experiments were conducted to investigate the compensation mechanical behavior of NPR anchor cables.The large deformations of soft rocks in the Daliangshan Tunnel are caused by a high ground stress,a high degree of joint fracture development,and a high degree of surrounding rock fragmentation.A compensation mechanics support system combining long and short NPR anchor cables was suggested to provide sufficient counter-support force(approximately 350 kN)for the surrounding rock inside the tunnel.Comparing the NPR anchor cable support system with the original support system used in the Daliangshan tunnel showed that an NPR anchor cable support system,combining cables of 6.3 m and 10.3 m in length,effectively prevented convergence of surrounding rock deformation,and the integrated settlement convergence value remained below 300 mm.This study provides an effective scientific basis for resolving large deformation problems in deeply buried soft rocks in western transverse mountain areas.展开更多
Large animal models of cardiac ischemia-reperfusion are critical for evaluation of the efficacy of cardioprotective interventions prior to clinical translation.Nonetheless,current cardioprotective strategies/intervent...Large animal models of cardiac ischemia-reperfusion are critical for evaluation of the efficacy of cardioprotective interventions prior to clinical translation.Nonetheless,current cardioprotective strategies/interventions formulated in preclinical cardiovascular research are often limited to small animal models,which are not transferable or reproducible in large animal models due to different factors such as:(i)complex and varied features of human ischemic cardiac disease(ICD),which are challenging to mimic in animal models,(ii)significant differences in surgical techniques applied,and(iii)differences in cardiovascular anatomy and physiology between small versus large animals.This article highlights the advantages and disadvantages of different large animal models of preclinical cardiac ischemic reperfusion injury(IRI),as well as the different methods used to induce and assess IRI,and the obstacles faced in using large animals for translational research in the settings of cardiac IR.展开更多
Drying is a complicated physical process which involves simultaneous heat and mass transfer in the removal of solvents inside propellants.Inappropriate drying techniques may result in the formation of a hard skin laye...Drying is a complicated physical process which involves simultaneous heat and mass transfer in the removal of solvents inside propellants.Inappropriate drying techniques may result in the formation of a hard skin layer near the surface to block the free access of most solvent through for long stick propellants with large web thickness,which lead to lower drying efficiency and worse drying quality.This study aims to gain a comprehensive understanding of drying process and clarify the mechanism of the blocked layer near the propellant surface.A new three-dimensional coupled heat and mass transfer(3D-CHMT)model was successfully developed under transient conditions.The drying experiment results show that the 3DCHMT model could be applied to describe the drying process well since the relative error of the content of solvent between simulation and experiment values is only 5.5%.The solvent behavior simulation demonstrates that the mass transfer process can be divided into super-fast(SF)and subsequent minorfast(MF)stages,and the SF stage is vital to the prevention of the blocked layer against the free access for solvent molecules inside propellant grains.The effective solvent diffusion coefficient(Deff)of the propellant surface initially increases from 3.4×10^(-6)to 5.3×10^(-6)m^(2)/s as the temperature increases,and then decreases to 4.1×10^(-8)m^(2)/s at 60-100 min.The value of Deffof surface between 0-1.4 mm has a unique trend of change compared with other regions,and it is much lower than that of the internal at100 min under simulation conditions.Meanwhile,the temperature of the propellant surface increases rapidly at the SF stage(0-100 min)and then very slowly thereafter.Both the evolution of Deffand temperature distribution demonstrate that the blocked layer near the propellant surface has been formed in the time period of approximately 0-100 min and its thickness is about 1.4 mm.To mitigate the formation of blocked layer and improve its drying quality of finial propellant products effectively,it should be initially dried at lower drying temperature(30-40℃)in 0-100 min and then dried at higher drying temperature(50-60℃)to reduce drying time for later drying process in double base gun propellants.The present results can provide theoretical guidance for drying process and optimization of drying parameters for long stick propellants with large web thickness.展开更多
基金We acknowledge funding from NSFC Grant 62306283.
文摘Since the 1950s,when the Turing Test was introduced,there has been notable progress in machine language intelligence.Language modeling,crucial for AI development,has evolved from statistical to neural models over the last two decades.Recently,transformer-based Pre-trained Language Models(PLM)have excelled in Natural Language Processing(NLP)tasks by leveraging large-scale training corpora.Increasing the scale of these models enhances performance significantly,introducing abilities like context learning that smaller models lack.The advancement in Large Language Models,exemplified by the development of ChatGPT,has made significant impacts both academically and industrially,capturing widespread societal interest.This survey provides an overview of the development and prospects from Large Language Models(LLM)to Large Multimodal Models(LMM).It first discusses the contributions and technological advancements of LLMs in the field of natural language processing,especially in text generation and language understanding.Then,it turns to the discussion of LMMs,which integrates various data modalities such as text,images,and sound,demonstrating advanced capabilities in understanding and generating cross-modal content,paving new pathways for the adaptability and flexibility of AI systems.Finally,the survey highlights the prospects of LMMs in terms of technological development and application potential,while also pointing out challenges in data integration,cross-modal understanding accuracy,providing a comprehensive perspective on the latest developments in this field.
基金supported by the National Key Research and Development Program of China (2021YFA0805300,2021YFA0805200)National Natural Science Foundation of China (32170981,82371874,82394422,82171244,82071421,82271902)+1 种基金Guangzhou Key Research Program on Brain Science (202007030008)Department of Science and Technology of Guangdong Province (2021ZT09Y007,2020B121201006,2018B030337001)。
文摘Huntington'sdisease(HD)isahereditary neurodegenerative disorder for which there is currently no effectivetreatmentavailable.Consequently,the development of appropriate disease models is critical to thoroughly investigate disease progression.The genetic basis of HD involves the abnormal expansion of CAG repeats in the huntingtin(HTT)gene,leading to the expansion of a polyglutamine repeat in the HTT protein.Mutant HTT carrying the expanded polyglutamine repeat undergoes misfolding and forms aggregates in the brain,which precipitate selective neuronal loss in specific brain regions.Animal models play an important role in elucidating the pathogenesis of neurodegenerative disorders such as HD and in identifying potential therapeutic targets.Due to the marked species differences between rodents and larger animals,substantial efforts have been directed toward establishing large animal models for HD research.These models are pivotal for advancing the discovery of novel therapeutic targets,enhancing effective drug delivery methods,and improving treatment outcomes.We have explored the advantages of utilizing large animal models,particularly pigs,in previous reviews.Since then,however,significant progress has been made in developing more sophisticated animal models that faithfully replicate the typical pathology of HD.In the current review,we provide a comprehensive overview of large animal models of HD,incorporating recent findings regarding the establishment of HD knock-in(KI)pigs and their genetic therapy.We also explore the utilization of large animal models in HD research,with a focus on sheep,non-human primates(NHPs),and pigs.Our objective is to provide valuable insights into the application of these large animal models for the investigation and treatment of neurodegenerative disorders.
基金Supported by the National Natural Science Foundation of China(72088101,42372175)PetroChina Science and Technology Innovation Fund Program(2021DQ02-0904)。
文摘This article elucidates the concept of large model technology,summarizes the research status of large model technology both domestically and internationally,provides an overview of the application status of large models in vertical industries,outlines the challenges and issues confronted in applying large models in the oil and gas sector,and offers prospects for the application of large models in the oil and gas industry.The existing large models can be briefly divided into three categories:large language models,visual large models,and multimodal large models.The application of large models in the oil and gas industry is still in its infancy.Based on open-source large language models,some oil and gas enterprises have released large language model products using methods like fine-tuning and retrieval augmented generation.Scholars have attempted to develop scenario-specific models for oil and gas operations by using visual/multimodal foundation models.A few researchers have constructed pre-trained foundation models for seismic data processing and interpretation,as well as core analysis.The application of large models in the oil and gas industry faces challenges such as current data quantity and quality being difficult to support the training of large models,high research and development costs,and poor algorithm autonomy and control.The application of large models should be guided by the needs of oil and gas business,taking the application of large models as an opportunity to improve data lifecycle management,enhance data governance capabilities,promote the construction of computing power,strengthen the construction of“artificial intelligence+energy”composite teams,and boost the autonomy and control of large model technology.
文摘Modern technological advancements have made social media an essential component of daily life.Social media allow individuals to share thoughts,emotions,and ideas.Sentiment analysis plays the function of evaluating whether the sentiment of the text is positive,negative,neutral,or any other personal emotion to understand the sentiment context of the text.Sentiment analysis is essential in business and society because it impacts strategic decision-making.Sentiment analysis involves challenges due to lexical variation,an unlabeled dataset,and text distance correlations.The execution time increases due to the sequential processing of the sequence models.However,the calculation times for the Transformer models are reduced because of the parallel processing.This study uses a hybrid deep learning strategy to combine the strengths of the Transformer and Sequence models while ignoring their limitations.In particular,the proposed model integrates the Decoding-enhanced with Bidirectional Encoder Representations from Transformers(BERT)attention(DeBERTa)and the Gated Recurrent Unit(GRU)for sentiment analysis.Using the Decoding-enhanced BERT technique,the words are mapped into a compact,semantic word embedding space,and the Gated Recurrent Unit model can capture the distance contextual semantics correctly.The proposed hybrid model achieves F1-scores of 97%on the Twitter Large Language Model(LLM)dataset,which is much higher than the performance of new techniques.
文摘This opinion paper explores the transformative potential of large language models(LLMs)in laparoscopic surgery and argues for their integration to enhance surgical education,decision support,reporting,and patient care.LLMs can revolutionize surgical education by providing personalized learning experiences and accelerating skill acquisition.Intelligent decision support systems powered by LLMs can assist surgeons in making complex decisions,optimizing surgical workflows,and improving patient outcomes.Moreover,LLMs can automate surgical reporting and generate personalized patient education materials,streamlining documentation and improving patient engagement.However,challenges such as data scarcity,surgical semantic capture,real-time inference,and integration with existing systems need to be addressed for successful LLM integration.The future of laparoscopic surgery lies in the seamless integration of LLMs,enabling autonomous robotic surgery,predictive surgical planning,intraoperative decision support,virtual surgical assistants,and continuous learning.By harnessing the power of LLMs,laparoscopic surgery can be transformed,empowering surgeons and ultimately benefiting patients.
基金National Research Foundation(NRF)Singapore,under its NRF Fellowship(Grant No.NRFNRFF11-2019-0002).
文摘High-angle annular dark field(HAADF)imaging in scanning transmission electron microscopy(STEM)has become an indispensable tool in materials science due to its ability to offer sub-°A resolution and provide chemical information through Z-contrast.This study leverages large language models(LLMs)to conduct a comprehensive bibliometric analysis of a large amount of HAADF-related literature(more than 41000 papers).By using LLMs,specifically ChatGPT,we were able to extract detailed information on applications,sample preparation methods,instruments used,and study conclusions.The findings highlight the capability of LLMs to provide a new perspective into HAADF imaging,underscoring its increasingly important role in materials science.Moreover,the rich information extracted from these publications can be harnessed to develop AI models that enhance the automation and intelligence of electron microscopes.
基金Science and Technology Innovation 2030-Major Project of“New Generation Artificial Intelligence”granted by Ministry of Science and Technology,Grant Number 2020AAA0109300.
文摘In the process of constructing domain-specific knowledge graphs,the task of relational triple extraction plays a critical role in transforming unstructured text into structured information.Existing relational triple extraction models facemultiple challenges when processing domain-specific data,including insufficient utilization of semantic interaction information between entities and relations,difficulties in handling challenging samples,and the scarcity of domain-specific datasets.To address these issues,our study introduces three innovative components:Relation semantic enhancement,data augmentation,and a voting strategy,all designed to significantly improve the model’s performance in tackling domain-specific relational triple extraction tasks.We first propose an innovative attention interaction module.This method significantly enhances the semantic interaction capabilities between entities and relations by integrating semantic information fromrelation labels.Second,we propose a voting strategy that effectively combines the strengths of large languagemodels(LLMs)and fine-tuned small pre-trained language models(SLMs)to reevaluate challenging samples,thereby improving the model’s adaptability in specific domains.Additionally,we explore the use of LLMs for data augmentation,aiming to generate domain-specific datasets to alleviate the scarcity of domain data.Experiments conducted on three domain-specific datasets demonstrate that our model outperforms existing comparative models in several aspects,with F1 scores exceeding the State of the Art models by 2%,1.6%,and 0.6%,respectively,validating the effectiveness and generalizability of our approach.
基金supported by National Key R&D Program of China(2022QY2000-02).
文摘Accurately recommending candidate news to users is a basic challenge of personalized news recommendation systems.Traditional methods are usually difficult to learn and acquire complex semantic information in news texts,resulting in unsatisfactory recommendation results.Besides,these traditional methods are more friendly to active users with rich historical behaviors.However,they can not effectively solve the long tail problem of inactive users.To address these issues,this research presents a novel general framework that combines Large Language Models(LLM)and Knowledge Graphs(KG)into traditional methods.To learn the contextual information of news text,we use LLMs’powerful text understanding ability to generate news representations with rich semantic information,and then,the generated news representations are used to enhance the news encoding in traditional methods.In addition,multi-hops relationship of news entities is mined and the structural information of news is encoded using KG,thus alleviating the challenge of long-tail distribution.Experimental results demonstrate that compared with various traditional models,on evaluation indicators such as AUC,MRR,nDCG@5 and nDCG@10,the framework significantly improves the recommendation performance.The successful integration of LLM and KG in our framework has established a feasible way for achieving more accurate personalized news recommendation.Our code is available at https://github.com/Xuan-ZW/LKPNR.
基金supported in part by the National Natural Science Foundation of China (No. 12202363)。
文摘Modeling of unsteady aerodynamic loads at high angles of attack using a small amount of experimental or simulation data to construct predictive models for unknown states can greatly improve the efficiency of aircraft unsteady aerodynamic design and flight dynamics analysis.In this paper,aiming at the problems of poor generalization of traditional aerodynamic models and intelligent models,an intelligent aerodynamic modeling method based on gated neural units is proposed.The time memory characteristics of the gated neural unit is fully utilized,thus the nonlinear flow field characterization ability of the learning and training process is enhanced,and the generalization ability of the whole prediction model is improved.The prediction and verification of the model are carried out under the maneuvering flight condition of NACA0015 airfoil.The results show that the model has good adaptability.In the interpolation prediction,the maximum prediction error of the lift and drag coefficients and the moment coefficient does not exceed 10%,which can basically represent the variation characteristics of the entire flow field.In the construction of extrapolation models,the training model based on the strong nonlinear data has good accuracy for weak nonlinear prediction.Furthermore,the error is larger,even exceeding 20%,which indicates that the extrapolation and generalization capabilities need to be further optimized by integrating physical models.Compared with the conventional state space equation model,the proposed method can improve the extrapolation accuracy and efficiency by 78%and 60%,respectively,which demonstrates the applied potential of this method in aerodynamic modeling.
文摘Students are considered one of the groups most affected by psychological pro-blems.Given the highly dangerous nature of mental illnesses and the increasing-ly serious state of global mental health,it is imperative for us to explore new me-thods and approaches concerning the prevention and treatment of mental illne-sses.Large multimodal models(LMMs),as the most advanced artificial intelligen-ce models(i.e.ChatGPT-4),have brought new hope to the accurate prevention,diagnosis,and treatment of psychiatric disorders.The assistance of these models in the promotion of mental health is critical,as the latter necessitates a strong foundation of medical knowledge and professional skills,emotional support,stigma mitigation,the encouragement of more honest patient self-disclosure,reduced health care costs,improved medical efficiency,and greater mental health service coverage.However,these models must address challenges related to health,safety,hallucinations,and ethics simultaneously.In the future,we should address these challenges by developing relevant usage manuals,accountability rules,and legal regulations;implementing a human-centered approach;and intelligently upgrading LMMs through the deep optimization of such models,their algorithms,and other means.This effort will thus substantially contribute not only to the maintenance of students’health but also to the achievement of global sustainable development goals.
文摘This letter evaluates the article by Gravina et al on ChatGPT’s potential in providing medical information for inflammatory bowel disease patients.While promising,it highlights the need for advanced techniques like reasoning+action and retrieval-augmented generation to improve accuracy and reliability.Emphasizing that simple question and answer testing is insufficient,it calls for more nuanced evaluation methods to truly gauge large language models’capabilities in clinical applications.
文摘The problematic use of social media has numerous negative impacts on individuals'daily lives,interpersonal relationships,physical and mental health,and more.Currently,there are few methods and tools to alleviate problematic social media,and their potential is yet to be fully realized.Emerging large language models(LLMs)are becoming increasingly popular for providing information and assistance to people and are being applied in many aspects of life.In mitigating problematic social media use,LLMs such as ChatGPT can play a positive role by serving as conversational partners and outlets for users,providing personalized information and resources,monitoring and intervening in problematic social media use,and more.In this process,we should recognize both the enormous potential and endless possibilities of LLMs such as ChatGPT,leveraging their advantages to better address problematic social media use,while also acknowledging the limitations and potential pitfalls of ChatGPT technology,such as errors,limitations in issue resolution,privacy and security concerns,and potential overreliance.When we leverage the advantages of LLMs to address issues in social media usage,we must adopt a cautious and ethical approach,being vigilant of the potential adverse effects that LLMs may have in addressing problematic social media use to better harness technology to serve individuals and society.
文摘The recent interest in the deployment of Generative AI applications that use large language models (LLMs) has brought to the forefront significant privacy concerns, notably the leakage of Personally Identifiable Information (PII) and other confidential or protected information that may have been memorized during training, specifically during a fine-tuning or customization process. We describe different black-box attacks from potential adversaries and study their impact on the amount and type of information that may be recovered from commonly used and deployed LLMs. Our research investigates the relationship between PII leakage, memorization, and factors such as model size, architecture, and the nature of attacks employed. The study utilizes two broad categories of attacks: PII leakage-focused attacks (auto-completion and extraction attacks) and memorization-focused attacks (various membership inference attacks). The findings from these investigations are quantified using an array of evaluative metrics, providing a detailed understanding of LLM vulnerabilities and the effectiveness of different attacks.
文摘Large Language Models (LLMs) have revolutionized Generative Artificial Intelligence (GenAI) tasks, becoming an integral part of various applications in society, including text generation, translation, summarization, and more. However, their widespread usage emphasizes the critical need to enhance their security posture to ensure the integrity and reliability of their outputs and minimize harmful effects. Prompt injections and training data poisoning attacks are two of the most prominent vulnerabilities in LLMs, which could potentially lead to unpredictable and undesirable behaviors, such as biased outputs, misinformation propagation, and even malicious content generation. The Common Vulnerability Scoring System (CVSS) framework provides a standardized approach to capturing the principal characteristics of vulnerabilities, facilitating a deeper understanding of their severity within the security and AI communities. By extending the current CVSS framework, we generate scores for these vulnerabilities such that organizations can prioritize mitigation efforts, allocate resources effectively, and implement targeted security measures to defend against potential risks.
基金sponsored by National Natural Science Foundation of China(No.82101389 and 81971114)Beijing Nova Program(No.20230484286)General Project of Science and Technology of Beijing Municipal Education Commission(No.KM202110025018).
文摘Objective:Early and accurate identification of large vessel occlusion(LVO)acute ischemic stroke(AIS)patients is critically important for stroke management.Practicable scales with simple items can facilitate prehospital paramedics distinguishing LVO-AIS patients with high efficiency and help to avoid unnecessary and costly delays.The current study aims to develop a screening tool to predict AIS-LVO patients based on prehospital available data.Method:A total of 251 suspected stroke patients who were transported to the emergency department of our hospital via emergency medical services were consecutively enrolled from August,2020 to January,2022.Data including demographic information,medical history,clinical manifestations,and vital signs were collected.A multivariate logistic regression model was developed based on statistically significant variables selected from univariate analysis.Result:Forty-two patients(16.7%)were diagnosed as LVO-AIS based on imaging validation at admission.A comprehensive model was developed with past medical history factors such as atrial fibrillation and coronary heart disease,vital signs such as systolic blood pressure,and prominent symptoms and signs such as gaze palsy,facial paralysis,and dysarthria.The model showed better diagnostic performance in terms of area under the receiver operating characteristic curves(0.884,95%CI,0.830-0.939),which was higher than other common prehospital prediction scales such as the Face,Arm,Speech,Time test(FAST),the Field Assessment Stroke Triage for Emergency Destination(FAST-ED)scale,and the Gaze-Face-Arm-Speech-Time test(G-FAST).Calibration curve analysis,decision curve analysis,and clinical impact curve analysis further validated the reliability,net benefit,and potential clinical impact of the prediction model,respectively.Conclusion:We conducted a prediction model based on prehospital accessible factors including past history of atrial fibrillation and coronary heart disease,systolic blood pressure,and signs such as gaze palsy,facial palsy,and dysarthria.The prediction model showed good diagnostic power and accuracy for identification of the high-risk patients with LVO and may become an effective tool for the LVO recognition in prehospital settings.Future studies are warranted to refine and validate the model further in order to enhance the accuracy and objectivity of clinical judgments.
文摘With the rapid development of artificial intelligence, large language models (LLMs) have demonstrated remarkable capabilities in natural language understanding and generation. These models have great potential to enhance database query systems, enabling more intuitive and semantic query mechanisms. Our model leverages LLM’s deep learning architecture to interpret and process natural language queries and translate them into accurate database queries. The system integrates an LLM-powered semantic parser that translates user input into structured queries that can be understood by the database management system. First, the user query is pre-processed, the text is normalized, and the ambiguity is removed. This is followed by semantic parsing, where the LLM interprets the pre-processed text and identifies key entities and relationships. This is followed by query generation, which converts the parsed information into a structured query format and tailors it to the target database schema. Finally, there is query execution and feedback, where the resulting query is executed on the database and the results are returned to the user. The system also provides feedback mechanisms to improve and optimize future query interpretations. By using advanced LLMs for model implementation and fine-tuning on diverse datasets, the experimental results show that the proposed method significantly improves the accuracy and usability of database queries, making data retrieval easy for users without specialized knowledge.
基金financial support from the National Key Research and Development Program of China (No.2023YFC2907501)the National Natural Science Foundation of China (No.52374106)the Fundamental Research Funds for the Central Universities (No.2023ZKPYNY01)。
文摘In order to improve rib stability,failure criteria and instability mode of a thick coal seam with inter-band rock layer are analysed in this study.A three-dimensional mechanical model is established for the rib by considering the rock layer.A safety factor is defined foy the rib,and it is observed that the safety factor exhibits a positive correlation with the thickness and strength of the inter-band rock.A calculation method for determining critical parameters of the rock layer is presented to ensure the rib stability.It is revealed that incomplete propagation of the fracture at the hard rock constitutes a fundamental prerequisite for ensuring the rib stability.The influence of the position of the inter-band rock in the coal seam on failure mechanism of the rib was thoroughly investigated by developing a series of physical models for the rib at the face area.The best position for the inter-band rock in the coal seam is at a height of 1.5 m away from the roof line,which tends to provide a good stability state for the rib.For different inter-band rock positions,two ways of controlling rib by increasing supports stiffness and flexible grouting reinforcement are proposed.
基金Project(41941018)supported by the National Natural Science Foundation of China for the Special Project FundingProject(22-JKCF-08)supported by the Study on in-situ Stress Database and 3D in-situ Stress Inversion Technology of Highway Tunnel in Shanxi Province,China+1 种基金Project(2022-JKKJ-6)supported by the Study on Disaster Mechanism and NPR Anchor Cable Prevention and Control of Coal Mining Caving Subsidence in Operating Tunnel in Mountainous Area,ChinaProject(BBJ2024032)supported by the Fundamental Research Funds for the Central Universities(PhD Top Innovative Talents Fund of CUMTB),China。
文摘A study was conducted to analyze the deformation mechanism of strongly weathered quartz schist in the Daliangshan Tunnel,located in the western Transverse Mountain area.A large deformation problem was experienced during the tunnel construction.To mitigate this problem,a support system was designed incorporating negative Poisson ratio(NPR)anchor cables with negative Poisson ratio effect.Physical model experiments,field experiments,and numerical simulation experiments were conducted to investigate the compensation mechanical behavior of NPR anchor cables.The large deformations of soft rocks in the Daliangshan Tunnel are caused by a high ground stress,a high degree of joint fracture development,and a high degree of surrounding rock fragmentation.A compensation mechanics support system combining long and short NPR anchor cables was suggested to provide sufficient counter-support force(approximately 350 kN)for the surrounding rock inside the tunnel.Comparing the NPR anchor cable support system with the original support system used in the Daliangshan tunnel showed that an NPR anchor cable support system,combining cables of 6.3 m and 10.3 m in length,effectively prevented convergence of surrounding rock deformation,and the integrated settlement convergence value remained below 300 mm.This study provides an effective scientific basis for resolving large deformation problems in deeply buried soft rocks in western transverse mountain areas.
基金supported by the Early Career Scheme(ECS)2022/23(CUHK 24110822)from the Research Grants Council of Hong Kongthe Direct Grant for Research 2020/21(2020.035)+3 种基金Project Impact Enhancement Fund(PIEF)(PIEF/Ph2/COVID/08)Improvement on Competitiveness in Hiring New Faculties Funding Scheme from CUHK as well as the Centre for Cardiovascular Genomics and Medicine(CCGM)of the Lui Che Woo Institute of Innovative Medicine CUHK(to S.B.O.)a CUHK Department of Medicine&Therapeutics(MEDT)-funded PhD studenta CUHK Vice-Chancellor’s PhD Scholarship holder。
文摘Large animal models of cardiac ischemia-reperfusion are critical for evaluation of the efficacy of cardioprotective interventions prior to clinical translation.Nonetheless,current cardioprotective strategies/interventions formulated in preclinical cardiovascular research are often limited to small animal models,which are not transferable or reproducible in large animal models due to different factors such as:(i)complex and varied features of human ischemic cardiac disease(ICD),which are challenging to mimic in animal models,(ii)significant differences in surgical techniques applied,and(iii)differences in cardiovascular anatomy and physiology between small versus large animals.This article highlights the advantages and disadvantages of different large animal models of preclinical cardiac ischemic reperfusion injury(IRI),as well as the different methods used to induce and assess IRI,and the obstacles faced in using large animals for translational research in the settings of cardiac IR.
基金supported by the National Natural Science Foundation of China(Grant No.22075146)。
文摘Drying is a complicated physical process which involves simultaneous heat and mass transfer in the removal of solvents inside propellants.Inappropriate drying techniques may result in the formation of a hard skin layer near the surface to block the free access of most solvent through for long stick propellants with large web thickness,which lead to lower drying efficiency and worse drying quality.This study aims to gain a comprehensive understanding of drying process and clarify the mechanism of the blocked layer near the propellant surface.A new three-dimensional coupled heat and mass transfer(3D-CHMT)model was successfully developed under transient conditions.The drying experiment results show that the 3DCHMT model could be applied to describe the drying process well since the relative error of the content of solvent between simulation and experiment values is only 5.5%.The solvent behavior simulation demonstrates that the mass transfer process can be divided into super-fast(SF)and subsequent minorfast(MF)stages,and the SF stage is vital to the prevention of the blocked layer against the free access for solvent molecules inside propellant grains.The effective solvent diffusion coefficient(Deff)of the propellant surface initially increases from 3.4×10^(-6)to 5.3×10^(-6)m^(2)/s as the temperature increases,and then decreases to 4.1×10^(-8)m^(2)/s at 60-100 min.The value of Deffof surface between 0-1.4 mm has a unique trend of change compared with other regions,and it is much lower than that of the internal at100 min under simulation conditions.Meanwhile,the temperature of the propellant surface increases rapidly at the SF stage(0-100 min)and then very slowly thereafter.Both the evolution of Deffand temperature distribution demonstrate that the blocked layer near the propellant surface has been formed in the time period of approximately 0-100 min and its thickness is about 1.4 mm.To mitigate the formation of blocked layer and improve its drying quality of finial propellant products effectively,it should be initially dried at lower drying temperature(30-40℃)in 0-100 min and then dried at higher drying temperature(50-60℃)to reduce drying time for later drying process in double base gun propellants.The present results can provide theoretical guidance for drying process and optimization of drying parameters for long stick propellants with large web thickness.