Based on the data from experiment, several mathematical models for describing the ecophysiological characteristics ofPlagiomnium acutum and the relationship between photosynthetic active radiation (R)P, temperature of...Based on the data from experiment, several mathematical models for describing the ecophysiological characteristics ofPlagiomnium acutum and the relationship between photosynthetic active radiation (R)P, temperature of atmospheric (T a), relative humidity (H r), concentration of carbon dioxide were established with the method of regression. The biological meanings of the models were analyzed primarily, which showed significance in both theory and application.展开更多
针对干果图像信息量大、分类精度低和耗时多的特点,提出利用Bag of Words模型提取图片的代表特征,并采用朴素贝叶斯分类器指导特征矩阵分类。结果表明,图像分类精度能达到80%,分类处理时间约为2 s。通过增加学习样本来进一步提高分类精...针对干果图像信息量大、分类精度低和耗时多的特点,提出利用Bag of Words模型提取图片的代表特征,并采用朴素贝叶斯分类器指导特征矩阵分类。结果表明,图像分类精度能达到80%,分类处理时间约为2 s。通过增加学习样本来进一步提高分类精度,将Bag of Words应用于干果图像识别和分类是可行的。展开更多
The paper relies on the language management model. First of all, the thousands of new words coined each year in China can be roughly divided into eight groups. Besides, it exemplifies the problems that have occurred i...The paper relies on the language management model. First of all, the thousands of new words coined each year in China can be roughly divided into eight groups. Besides, it exemplifies the problems that have occurred in communication. Last but not least, to reflect language management acts/processes, it reports the efforts the language authorities and governmental bodies have made in the struggle to solve the problems. It attempts to remind the language management agencies of the point that for the use of new words, the language users are always right as long as communication flows without problems in understanding.展开更多
Newtonian, Quemada and Casson blood viscosity models are implemented in order to simulate the rheological behavior of blood under pulsating flow conditions in a patient specific iliac bifurcation. The influence of the...Newtonian, Quemada and Casson blood viscosity models are implemented in order to simulate the rheological behavior of blood under pulsating flow conditions in a patient specific iliac bifurcation. The influence of the applied blood constitutive equations is monitored via the wall shear stress (WSS) distribution, magnitude and oscillations, non-Newtonian importance factors, and viscosity values according to the shear rate. The distribution of WSS on the vascular wall follows a pattern which is independent of the theological model chosen. On the other hand, the WSS magnitude and oscillations are directly related to the blood constitutive equations applied and the shear rate. It is concluded that the Newtonian approximation is satisfactory only in high shear and flow rates. Moreover, the Newtonian model seems to overestimate the possibility for the formation of atherosclerotic lesions or aneurysms at sites of the vascular wall where the WSS are oscillating.展开更多
Trained on a large corpus,pretrained models(PTMs)can capture different levels of concepts in context and hence generate universal language representations,which greatly benefit downstream natural language processing(N...Trained on a large corpus,pretrained models(PTMs)can capture different levels of concepts in context and hence generate universal language representations,which greatly benefit downstream natural language processing(NLP)tasks.In recent years,PTMs have been widely used in most NLP applications,especially for high-resource languages,such as English and Chinese.However,scarce resources have discouraged the progress of PTMs for low-resource languages.Transformer-based PTMs for the Khmer language are presented in this work for the first time.We evaluate our models on two downstream tasks:Part-of-speech tagging and news categorization.The dataset for the latter task is self-constructed.Experiments demonstrate the effectiveness of the Khmer models.In addition,we find that the current Khmer word segmentation technology does not aid performance improvement.We aim to release our models and datasets to the community in hopes of facilitating the future development of Khmer NLP applications.展开更多
The essay tends to analyze the translation of Chinese Culture-loaded words from the perspective of Relevance Theory.The theory gains its prominence by studying the translation process and transcending the conflicts be...The essay tends to analyze the translation of Chinese Culture-loaded words from the perspective of Relevance Theory.The theory gains its prominence by studying the translation process and transcending the conflicts between literal and free translation. It incorporates recent work in cognitive linguistics, with ostensive-inference as its key model. Under the influence of Relevance theory, the translation of culture-loaded words is reader-oriented. Translators are obliged to help target readers to establish new assumptions to achieve equivalent response.展开更多
Recently, the emergence of pre-trained models(PTMs) has brought natural language processing(NLP) to a new era. In this survey, we provide a comprehensive review of PTMs for NLP. We first briefly introduce language rep...Recently, the emergence of pre-trained models(PTMs) has brought natural language processing(NLP) to a new era. In this survey, we provide a comprehensive review of PTMs for NLP. We first briefly introduce language representation learning and its research progress. Then we systematically categorize existing PTMs based on a taxonomy from four different perspectives. Next,we describe how to adapt the knowledge of PTMs to downstream tasks. Finally, we outline some potential directions of PTMs for future research. This survey is purposed to be a hands-on guide for understanding, using, and developing PTMs for various NLP tasks.展开更多
A novel framework for fuzzy modeling and model-based control design is described. Based on the theory of fuzzy constraint processing, the fuzzy model can be viewed as a generalized Takagi-Sugeno (TS) fuzzy model wit...A novel framework for fuzzy modeling and model-based control design is described. Based on the theory of fuzzy constraint processing, the fuzzy model can be viewed as a generalized Takagi-Sugeno (TS) fuzzy model with fuzzy functional consequences. It uses multivariate antecedent membership functions obtained by granular-prototype fuzzy clustering methods and consequent fuzzy equations obtained by fuzzy regression techniques. Constrained optimization is used to estimate the consequent parameters, where the constraints are based on control-relevant a priori knowledge about the modeled process. The fuzzy-constraint-based approach provides the following features. 1) The knowledge base of a constraint-based fuzzy model can incorporate information with various types of fuzzy predicates. Consequently, it is easy to provide a fusion of different types of knowledge. The knowledge can be from data-driven approaches and/or from controlrelevant physical models. 2) A corresponding inference mechanism for the proposed model can deal with heterogeneous information granules. 3) Both numerical and linguistic inputs can be accepted for predicting new outputs. The proposed techniques are demonstrated by means of two examples: a nonlinear function-fitting problem and the well-known Box-Jenkins gas furnace process. The first example shows that the proposed model uses fewer fuzzy predicates achieving similar results with the traditional rule-based approach, while the second shows the performance can be significantly improved when the control-relevant constraints are considered.展开更多
针对畜禽疫病文本语料匮乏、文本内包含大量疫病名称及短语等未登录词问题,提出了一种结合词典匹配的BERT-BiLSTM-CRF畜禽疫病文本分词模型。以羊疫病为研究对象,构建了常见疫病文本数据集,将其与通用语料PKU结合,利用BERT(Bidirectiona...针对畜禽疫病文本语料匮乏、文本内包含大量疫病名称及短语等未登录词问题,提出了一种结合词典匹配的BERT-BiLSTM-CRF畜禽疫病文本分词模型。以羊疫病为研究对象,构建了常见疫病文本数据集,将其与通用语料PKU结合,利用BERT(Bidirectional encoder representation from transformers)预训练语言模型进行文本向量化表示;通过双向长短时记忆网络(Bidirectional long short-term memory network,BiLSTM)获取上下文语义特征;由条件随机场(Conditional random field,CRF)输出全局最优标签序列。基于此,在CRF层后加入畜禽疫病领域词典进行分词匹配修正,减少在分词过程中出现的疫病名称及短语等造成的歧义切分,进一步提高了分词准确率。实验结果表明,结合词典匹配的BERT-BiLSTM-CRF模型在羊常见疫病文本数据集上的F1值为96.38%,与jieba分词器、BiLSTM-Softmax模型、BiLSTM-CRF模型、未结合词典匹配的本文模型相比,分别提升11.01、10.62、8.3、0.72个百分点,验证了方法的有效性。与单一语料相比,通用语料PKU和羊常见疫病文本数据集结合的混合语料,能够同时对畜禽疫病专业术语及疫病文本中常用词进行准确切分,在通用语料及疫病文本数据集上F1值都达到95%以上,具有较好的模型泛化能力。该方法可用于畜禽疫病文本分词。展开更多
文摘Based on the data from experiment, several mathematical models for describing the ecophysiological characteristics ofPlagiomnium acutum and the relationship between photosynthetic active radiation (R)P, temperature of atmospheric (T a), relative humidity (H r), concentration of carbon dioxide were established with the method of regression. The biological meanings of the models were analyzed primarily, which showed significance in both theory and application.
文摘针对干果图像信息量大、分类精度低和耗时多的特点,提出利用Bag of Words模型提取图片的代表特征,并采用朴素贝叶斯分类器指导特征矩阵分类。结果表明,图像分类精度能达到80%,分类处理时间约为2 s。通过增加学习样本来进一步提高分类精度,将Bag of Words应用于干果图像识别和分类是可行的。
文摘The paper relies on the language management model. First of all, the thousands of new words coined each year in China can be roughly divided into eight groups. Besides, it exemplifies the problems that have occurred in communication. Last but not least, to reflect language management acts/processes, it reports the efforts the language authorities and governmental bodies have made in the struggle to solve the problems. It attempts to remind the language management agencies of the point that for the use of new words, the language users are always right as long as communication flows without problems in understanding.
基金supported by the National Strategic Reference Framework(NSRF)2007-2013 project DEKA:“Integrated prognostic system for risk assessment in stent implantations for Abdominal Aortic Aneurysm repair”(Grant No.09SYN-12-1153)
文摘Newtonian, Quemada and Casson blood viscosity models are implemented in order to simulate the rheological behavior of blood under pulsating flow conditions in a patient specific iliac bifurcation. The influence of the applied blood constitutive equations is monitored via the wall shear stress (WSS) distribution, magnitude and oscillations, non-Newtonian importance factors, and viscosity values according to the shear rate. The distribution of WSS on the vascular wall follows a pattern which is independent of the theological model chosen. On the other hand, the WSS magnitude and oscillations are directly related to the blood constitutive equations applied and the shear rate. It is concluded that the Newtonian approximation is satisfactory only in high shear and flow rates. Moreover, the Newtonian model seems to overestimate the possibility for the formation of atherosclerotic lesions or aneurysms at sites of the vascular wall where the WSS are oscillating.
基金supported by the Major Projects of Guangdong Education Department for Foundation Research and Applied Research(No.2017KZDXM031)Guangzhou Science and Technology Plan Project(No.202009010021)。
文摘Trained on a large corpus,pretrained models(PTMs)can capture different levels of concepts in context and hence generate universal language representations,which greatly benefit downstream natural language processing(NLP)tasks.In recent years,PTMs have been widely used in most NLP applications,especially for high-resource languages,such as English and Chinese.However,scarce resources have discouraged the progress of PTMs for low-resource languages.Transformer-based PTMs for the Khmer language are presented in this work for the first time.We evaluate our models on two downstream tasks:Part-of-speech tagging and news categorization.The dataset for the latter task is self-constructed.Experiments demonstrate the effectiveness of the Khmer models.In addition,we find that the current Khmer word segmentation technology does not aid performance improvement.We aim to release our models and datasets to the community in hopes of facilitating the future development of Khmer NLP applications.
文摘The essay tends to analyze the translation of Chinese Culture-loaded words from the perspective of Relevance Theory.The theory gains its prominence by studying the translation process and transcending the conflicts between literal and free translation. It incorporates recent work in cognitive linguistics, with ostensive-inference as its key model. Under the influence of Relevance theory, the translation of culture-loaded words is reader-oriented. Translators are obliged to help target readers to establish new assumptions to achieve equivalent response.
基金the National Natural Science Foundation of China(Grant Nos.61751201 and 61672162)the Shanghai Municipal Science and Technology Major Project(Grant No.2018SHZDZX01)and ZJLab。
文摘Recently, the emergence of pre-trained models(PTMs) has brought natural language processing(NLP) to a new era. In this survey, we provide a comprehensive review of PTMs for NLP. We first briefly introduce language representation learning and its research progress. Then we systematically categorize existing PTMs based on a taxonomy from four different perspectives. Next,we describe how to adapt the knowledge of PTMs to downstream tasks. Finally, we outline some potential directions of PTMs for future research. This survey is purposed to be a hands-on guide for understanding, using, and developing PTMs for various NLP tasks.
文摘A novel framework for fuzzy modeling and model-based control design is described. Based on the theory of fuzzy constraint processing, the fuzzy model can be viewed as a generalized Takagi-Sugeno (TS) fuzzy model with fuzzy functional consequences. It uses multivariate antecedent membership functions obtained by granular-prototype fuzzy clustering methods and consequent fuzzy equations obtained by fuzzy regression techniques. Constrained optimization is used to estimate the consequent parameters, where the constraints are based on control-relevant a priori knowledge about the modeled process. The fuzzy-constraint-based approach provides the following features. 1) The knowledge base of a constraint-based fuzzy model can incorporate information with various types of fuzzy predicates. Consequently, it is easy to provide a fusion of different types of knowledge. The knowledge can be from data-driven approaches and/or from controlrelevant physical models. 2) A corresponding inference mechanism for the proposed model can deal with heterogeneous information granules. 3) Both numerical and linguistic inputs can be accepted for predicting new outputs. The proposed techniques are demonstrated by means of two examples: a nonlinear function-fitting problem and the well-known Box-Jenkins gas furnace process. The first example shows that the proposed model uses fewer fuzzy predicates achieving similar results with the traditional rule-based approach, while the second shows the performance can be significantly improved when the control-relevant constraints are considered.
文摘针对畜禽疫病文本语料匮乏、文本内包含大量疫病名称及短语等未登录词问题,提出了一种结合词典匹配的BERT-BiLSTM-CRF畜禽疫病文本分词模型。以羊疫病为研究对象,构建了常见疫病文本数据集,将其与通用语料PKU结合,利用BERT(Bidirectional encoder representation from transformers)预训练语言模型进行文本向量化表示;通过双向长短时记忆网络(Bidirectional long short-term memory network,BiLSTM)获取上下文语义特征;由条件随机场(Conditional random field,CRF)输出全局最优标签序列。基于此,在CRF层后加入畜禽疫病领域词典进行分词匹配修正,减少在分词过程中出现的疫病名称及短语等造成的歧义切分,进一步提高了分词准确率。实验结果表明,结合词典匹配的BERT-BiLSTM-CRF模型在羊常见疫病文本数据集上的F1值为96.38%,与jieba分词器、BiLSTM-Softmax模型、BiLSTM-CRF模型、未结合词典匹配的本文模型相比,分别提升11.01、10.62、8.3、0.72个百分点,验证了方法的有效性。与单一语料相比,通用语料PKU和羊常见疫病文本数据集结合的混合语料,能够同时对畜禽疫病专业术语及疫病文本中常用词进行准确切分,在通用语料及疫病文本数据集上F1值都达到95%以上,具有较好的模型泛化能力。该方法可用于畜禽疫病文本分词。