期刊文献+
共找到18,690篇文章
< 1 2 250 >
每页显示 20 50 100
Sentence及其结构研究——以古典主义时期作品为例
1
作者 符方泽 杨伟杰 《北方音乐》 2024年第1期131-140,共10页
文章以sentence为研究对象,一方面对国内外曲式学教材及相关文献展开概念比较、辨析与理论梳理,阐明其基本概念与内部结构;另一方面则以古典主义时期作品为分析实例,讨论这种特定主题(句法)的“结构范型”、并进一步对其“结构变形”的... 文章以sentence为研究对象,一方面对国内外曲式学教材及相关文献展开概念比较、辨析与理论梳理,阐明其基本概念与内部结构;另一方面则以古典主义时期作品为分析实例,讨论这种特定主题(句法)的“结构范型”、并进一步对其“结构变形”的情况进行归类。 展开更多
关键词 sentence 古典风格 范型 变形 陈述短句 延续短句
下载PDF
基于Sentence-BERT的专利技术主题聚类研究——以人工智能领域为例 被引量:2
2
作者 阮光册 周萌葳 《情报杂志》 北大核心 2024年第2期110-117,共8页
[研究目的]将Sentence-BERT模型应用于专利技术主题聚类,解决专利文献为突出新颖性,常使用独特技术术语造成词汇向量语义特征稀疏的问题。[研究方法]以人工智能领域2015年-2019年的22370篇专利为实验数据。首先,采用Sentence-BERT算法... [研究目的]将Sentence-BERT模型应用于专利技术主题聚类,解决专利文献为突出新颖性,常使用独特技术术语造成词汇向量语义特征稀疏的问题。[研究方法]以人工智能领域2015年-2019年的22370篇专利为实验数据。首先,采用Sentence-BERT算法对专利文献摘要文本进行向量化表示;其次,对向量化矩阵进行数据降维,利用HDBSCAN方式寻找原始数据中的高密度簇;最后,识别类簇文本集合中的主题特征,并完成主题呈现。[研究结论]对比LDA主题模型、K-means、doc2vec等方法,本文的实验结果提高了主题划分的细粒度和精确度,获得了较好的主题一致性。如何采用fine-tune策略进一步提升模型的效果,是未来该方法进一步深入探索的方向。 展开更多
关键词 sentence-BERT 专利文本 主题识别 文本聚类
下载PDF
Review of Research on English Translation of Chinese Running Sentences
3
作者 ZHANG Wen-hui 《Journal of Literature and Art Studies》 2024年第7期624-627,共4页
In order to convey complete meanings,there is a phenomenon in Chinese of using multiple running sentences.Xu Jingning(2023,p.66)states,“In communication,a complete expression of meaning often requires more than one c... In order to convey complete meanings,there is a phenomenon in Chinese of using multiple running sentences.Xu Jingning(2023,p.66)states,“In communication,a complete expression of meaning often requires more than one clause,which is common in human languages.”Domestic research on running sentences includes discussions on defining the concept and structural features of running sentences,sentence properties,sentence pattern classifications and their criteria,as well as issues related to translating running sentences into English.This article primarily focuses on scholarly research into the English translation of running sentences in China,highlighting recent achievements and identifying existing issues in the study of running sentence translation.However,by reviewing literature on the translation of running sentences,it is found that current research in the academic community on non-core running sentences is limited.Therefore,this paper proposes relevant strategies to address this issue. 展开更多
关键词 Chinese running sentences TOPICS English-Chinese translation
下载PDF
Classification of Conversational Sentences Using an Ensemble Pre-Trained Language Model with the Fine-Tuned Parameter
4
作者 R.Sujatha K.Nimala 《Computers, Materials & Continua》 SCIE EI 2024年第2期1669-1686,共18页
Sentence classification is the process of categorizing a sentence based on the context of the sentence.Sentence categorization requires more semantic highlights than other tasks,such as dependence parsing,which requir... Sentence classification is the process of categorizing a sentence based on the context of the sentence.Sentence categorization requires more semantic highlights than other tasks,such as dependence parsing,which requires more syntactic elements.Most existing strategies focus on the general semantics of a conversation without involving the context of the sentence,recognizing the progress and comparing impacts.An ensemble pre-trained language model was taken up here to classify the conversation sentences from the conversation corpus.The conversational sentences are classified into four categories:information,question,directive,and commission.These classification label sequences are for analyzing the conversation progress and predicting the pecking order of the conversation.Ensemble of Bidirectional Encoder for Representation of Transformer(BERT),Robustly Optimized BERT pretraining Approach(RoBERTa),Generative Pre-Trained Transformer(GPT),DistilBERT and Generalized Autoregressive Pretraining for Language Understanding(XLNet)models are trained on conversation corpus with hyperparameters.Hyperparameter tuning approach is carried out for better performance on sentence classification.This Ensemble of Pre-trained Language Models with a Hyperparameter Tuning(EPLM-HT)system is trained on an annotated conversation dataset.The proposed approach outperformed compared to the base BERT,GPT,DistilBERT and XLNet transformer models.The proposed ensemble model with the fine-tuned parameters achieved an F1_score of 0.88. 展开更多
关键词 Bidirectional encoder for representation of transformer conversation ensemble model fine-tuning generalized autoregressive pretraining for language understanding generative pre-trained transformer hyperparameter tuning natural language processing robustly optimized BERT pretraining approach sentence classification transformer models
下载PDF
融合Sentence-BERT和LDA的评论文本主题识别 被引量:6
5
作者 阮光册 黄韵莹 《现代情报》 2023年第5期46-53,共8页
[目的/意义]为了解决评论文本主题识别时语义描述不充分以及学习到的主题语义连贯性不强等问题。本文将Sentence-BERT句子嵌入模型和LDA模型相结合,提升评论文本主题的语义性。[方法/过程]采用Sentence-BERT模型获取评论文本句子层面的... [目的/意义]为了解决评论文本主题识别时语义描述不充分以及学习到的主题语义连贯性不强等问题。本文将Sentence-BERT句子嵌入模型和LDA模型相结合,提升评论文本主题的语义性。[方法/过程]采用Sentence-BERT模型获取评论文本句子层面的向量特征,同时,采用LDA模型获取评论文本的概率主题向量,随后使用自动编码器连接两组向量,运用K-means算法对潜在空间向量进行聚类,从类簇中获取上下文主题信息。[结果/结论]通过对评论文本数据集的实验,本文方法可以较好地获得具有语义信息的主题词。Sentence-BERT模型与LDA结合,增加了模型的复杂性。通过对比,本文方法获得的主题一致性指标(Coherence)优于目前常见的评论文本主题识别方法。 展开更多
关键词 sentence-BERT LDA模型 评论文本 主题识别
下载PDF
Markedness and UG in Chinese Children's Acquisition of One-word and Negative Sentences 被引量:1
6
作者 Yu Shanzhi Department of Foreign LanguagesHenan University Kadeng 475001P. R. China< sZyu@mail.henu.edu.cn>Zhang Xinhong Faculty Of English Language and Culture Guangdong University of Foreign Studies Guangzhou 510420P. R. China or < bbjohnson@ ]63.net > 《现代外语》 CSSCI 北大核心 1999年第4期379-381,共3页
Thepresentstudyisaninvestigationandanalysisoftherelationshipbetweenmarkednessandfirstlanguageacquisitionsequence,asshowninthecasesofone-wordandnegativesentences.Hereourobjectivesaretoargueforthepriorityofunmarkednesso... Thepresentstudyisaninvestigationandanalysisoftherelationshipbetweenmarkednessandfirstlanguageacquisitionsequence,asshowninthecasesofone-wordandnegativesentences.Hereourobjectivesaretoargueforthepriorityofunmarkednessovermarkednessintheacquisitionsequ... 展开更多
关键词 MARKEDNESS UG ACQUISITION one-word sentence negative sentence.
下载PDF
Next Words Prediction and Sentence Completion in Bangla Language Using GRU-Based RNN on N-Gram Language Model
7
作者 Afranul Hoque Busrat Jahan +3 位作者 Shaikat Chandra Paul Zinat Ara Zabu Rakhi Mondal Papeya Akter 《Journal of Data Analysis and Information Processing》 2023年第4期388-399,共12页
We use a lot of devices in our daily life to communicate with others. In this modern world, people use email, Facebook, Twitter, and many other social network sites for exchanging information. People lose their valuab... We use a lot of devices in our daily life to communicate with others. In this modern world, people use email, Facebook, Twitter, and many other social network sites for exchanging information. People lose their valuable time misspelling and retyping, and some people are not happy to type large sentences because they face unnecessary words or grammatical issues. So, for this reason, word predictive systems help to exchange textual information more quickly, easier, and comfortably for all people. These systems predict the next most probable words and give users to choose of the needed word from these suggested words. Word prediction can help the writer by predicting the next word and helping complete the sentence correctly. This research aims to forecast the most suitable next word to complete a sentence for any given context. In this research, we have worked on the Bangla language. We have presented a process that can expect the next maximum probable and proper words and suggest a complete sentence using predicted words. In this research, GRU-based RNN has been used on the N-gram dataset to develop the proposed model. We collected a large dataset using multiple sources in the Bangla language and also compared it to the other approaches that have been used such as LSTM, and Naive Bayes. But this suggested approach provides excellent exactness than others. Here, the Unigram model provides 88.22%, Bi-gram model is 99.24%, Tri-gram model is 97.69%, and 4-gram and 5-gram models provide 99.43% and 99.78% on average accurateness. We think that our proposed method profound impression on Bangla search engines. 展开更多
关键词 Bangla Language Words Prediction sentence Completion GRU RNN Corpus N-Gram
下载PDF
A Sentence Retrieval Generation Network Guided Video Captioning
8
作者 Ou Ye Mimi Wang +3 位作者 Zhenhua Yu Yan Fu Shun Yi Jun Deng 《Computers, Materials & Continua》 SCIE EI 2023年第6期5675-5696,共22页
Currently,the video captioning models based on an encoder-decoder mainly rely on a single video input source.The contents of video captioning are limited since few studies employed external corpus information to guide... Currently,the video captioning models based on an encoder-decoder mainly rely on a single video input source.The contents of video captioning are limited since few studies employed external corpus information to guide the generation of video captioning,which is not conducive to the accurate descrip-tion and understanding of video content.To address this issue,a novel video captioning method guided by a sentence retrieval generation network(ED-SRG)is proposed in this paper.First,a ResNeXt network model,an efficient convolutional network for online video understanding(ECO)model,and a long short-term memory(LSTM)network model are integrated to construct an encoder-decoder,which is utilized to extract the 2D features,3D features,and object features of video data respectively.These features are decoded to generate textual sentences that conform to video content for sentence retrieval.Then,a sentence-transformer network model is employed to retrieve different sentences in an external corpus that are semantically similar to the above textual sentences.The candidate sentences are screened out through similarity measurement.Finally,a novel GPT-2 network model is constructed based on GPT-2 network structure.The model introduces a designed random selector to randomly select predicted words with a high probability in the corpus,which is used to guide and generate textual sentences that are more in line with human natural language expressions.The proposed method in this paper is compared with several existing works by experiments.The results show that the indicators BLEU-4,CIDEr,ROUGE_L,and METEOR are improved by 3.1%,1.3%,0.3%,and 1.5%on a public dataset MSVD and 1.3%,0.5%,0.2%,1.9%on a public dataset MSR-VTT respectively.It can be seen that the proposed method in this paper can generate video captioning with richer semantics than several state-of-the-art approaches. 展开更多
关键词 Video captioning encoder-decoder sentence retrieval external corpus RS GPT-2 network model
下载PDF
2'⁃岩藻糖基乳糖功能及其微生物生产菌种构建研究进展
9
作者 刘琳 赵藤 +6 位作者 高俊哲 李俊众 孙雪 宗剑飞 刘逸寒 李玉 李庆刚 《食品研究与开发》 CAS 2024年第8期207-216,共10页
2'⁃岩藻糖基乳糖(2'⁃fucosyllactose,2'⁃FL)在人乳寡糖中含量最高,占比可达30%。2'⁃FL能够有效促进婴儿大脑发育、提高婴儿免疫力,对青少年、成年人、老年人也大有裨益,已被广泛应用于婴幼儿配方奶粉、功能性食品饮料... 2'⁃岩藻糖基乳糖(2'⁃fucosyllactose,2'⁃FL)在人乳寡糖中含量最高,占比可达30%。2'⁃FL能够有效促进婴儿大脑发育、提高婴儿免疫力,对青少年、成年人、老年人也大有裨益,已被广泛应用于婴幼儿配方奶粉、功能性食品饮料及医疗辅剂中。微生物合成2'⁃FL具有易实现大规模生产、可使用廉价原料作为底物等优势。目前2'⁃FL最主要的工业生产菌种为大肠杆菌,其他食品安全级菌株如酿酒酵母、枯草芽孢杆菌等也陆续被改造用来生产2'⁃FL,并已取得一定成效。该文对2'⁃FL作用于人体的机制、应用领域及2'⁃FL合成菌种构建现状等进行综述,并对未来发展趋势进行展望。 展开更多
关键词 2'⁃岩藻糖基乳糖 人乳寡糖 代谢工程 工业微生物 菌种构建
下载PDF
A Study of Nominal Predicate Sentences Under the Framework of Cognitive Grammar
10
作者 ZOU Wen-jie GAO Wen-cheng 《Journal of Literature and Art Studies》 2023年第9期704-707,共4页
Cognitive grammar,as a linguistic theory that attaches importance to the relationship between language and thinking,provides us with a more comprehensive way to understand the structure,semantics and cognitive process... Cognitive grammar,as a linguistic theory that attaches importance to the relationship between language and thinking,provides us with a more comprehensive way to understand the structure,semantics and cognitive processing of noun predicate sentences.Therefore,under the framework of cognitive grammar,this paper tries to analyze the semantic connection and cognitive process in noun predicate sentences from the semantic perspective and the method of example theory,and discusses the motivation of the formation of this construction,so as to provide references for in-depth analysis of the cognitive laws behind noun predicate sentences. 展开更多
关键词 nominal predicate sentences cognitive grammar SEMANTICS
下载PDF
Windowing and Gapping in Imperative Sentences: on the Basis of Talmy's"Causal-chain Windowing"Approach 被引量:1
11
作者 吕思琪 《海外英语》 2013年第12X期237-238,264,共3页
This paper intends to analyze the six types of English imperative sentences proposed by Chen (1984) from a perspective of causal-chain windowing. It comes to the conclusions that Talmy's causal-chain windowing app... This paper intends to analyze the six types of English imperative sentences proposed by Chen (1984) from a perspective of causal-chain windowing. It comes to the conclusions that Talmy's causal-chain windowing approach as well as the cognitive underpinnings of causal windowing and gapping is proved to be applicable in English imperative structures, and that generally speaking, the final portion of an imperative sentence is always windowed while the intermediate portions gapped. 展开更多
关键词 WINDOWING of ATTENTION causal-chain WINDOWING impe
下载PDF
3,3',4,4'-联苯四胺改性邻苯二甲腈树脂及其复合材料性能
12
作者 董俊宇 赵星诺 +4 位作者 章宇琳 刘小僮 王文蓓 周权 吴霄 《高分子材料科学与工程》 EI CAS CSCD 北大核心 2024年第2期65-73,共9页
采用3,3',4,4'-联苯四胺(LBS)与联苯型邻苯二甲腈树脂(BPh)进行预聚反应制备改性联苯型邻苯二甲腈树脂预聚物(BLBS)。利用红外光谱法、差示扫描量热仪、旋转流变仪和热重分析对BLBS的固化行为、流变性能和耐温性能进行研究。结... 采用3,3',4,4'-联苯四胺(LBS)与联苯型邻苯二甲腈树脂(BPh)进行预聚反应制备改性联苯型邻苯二甲腈树脂预聚物(BLBS)。利用红外光谱法、差示扫描量热仪、旋转流变仪和热重分析对BLBS的固化行为、流变性能和耐温性能进行研究。结果表明,BLBS最低可在240℃前开始固化,但仍需要较高的温度才能固化完全;BLBS-3固化物在氮气和空气氛围中的质量损失5%时的温度(Td5)分别为549.4℃和555.6℃。石英纤维增强BLBS复合材料(QF/BLBS)具有优异的力学性能和耐热性能,玻璃化转变温度(T_(g))大于500°C,室温弯曲强度和层间剪切强度分别为711.5 MPa和48.5 MPa,400℃热处理2 h后弯曲强度和层间剪切强度分别为680.5 MPa和31.3 MPa。 展开更多
关键词 联苯型邻苯二甲腈 3 3' 4 4'-联苯四胺 耐热性能 力学性能 复合材料
下载PDF
3,5-二(2',5'-苯二羧酸)苯甲酸镍配合物的合成、结构及磁性
13
作者 高玲玲 张政委 陈勇强 《广州化工》 CAS 2024年第9期28-31,共4页
利用对称芳香多酸3,5-二(2',5'-苯二羧酸)苯甲酸(H_(5)L)和过渡金属镍,在水热法条件下,设计合成了一种新颖的金属有机配合物:{Ni(H_(3)L)(H_(2)O)}n(1),并通过单晶X射线衍射,红外光谱(IR),热重分析(TG)和粉末衍射对配合物1进行... 利用对称芳香多酸3,5-二(2',5'-苯二羧酸)苯甲酸(H_(5)L)和过渡金属镍,在水热法条件下,设计合成了一种新颖的金属有机配合物:{Ni(H_(3)L)(H_(2)O)}n(1),并通过单晶X射线衍射,红外光谱(IR),热重分析(TG)和粉末衍射对配合物1进行结构表征。结构分析表明1是基于多核[Ni(μ_(2)-H_(2)O)(μ_(2)-COO)(μ_(1)-COO)_(2)]_(n)SBUs的无限延伸的1D链状结构,并进一步通过π…π作用堆积成三维网络空间结构。磁性分析表明配合物1中的Ni(Ⅱ)离子之间存在反铁磁耦合作用。 展开更多
关键词 3 5-二(2' 5'-苯二羧酸)苯甲酸 金属有机配合物 磁性
下载PDF
基于微通道技术合成2-氨基-5-氟-3',4'-二氯联苯
14
作者 严泽华 何星帅 +1 位作者 吴浩 尹凯 《世界农药》 CAS 2024年第5期54-60,共7页
2-氨基-5-氟-3',4'-二氯联苯是琥珀酸脱氢酶抑制剂(SDHI)类杀菌剂—联苯吡菌胺的重要中间体。基于微通道技术,以3,4-二氯苯胺为原料,经重氮化、中和、与对氟苯胺偶联得到2-氨基-5-氟-3',4'-二氯联苯。系统考察了影响重... 2-氨基-5-氟-3',4'-二氯联苯是琥珀酸脱氢酶抑制剂(SDHI)类杀菌剂—联苯吡菌胺的重要中间体。基于微通道技术,以3,4-二氯苯胺为原料,经重氮化、中和、与对氟苯胺偶联得到2-氨基-5-氟-3',4'-二氯联苯。系统考察了影响重氮化、中和、偶联3步对反应的影响,并得出优化条件:盐酸与3,4-二氯苯胺当量6:1,亚硝酸钠与3,4-二氯苯胺当量1.1︰1,成盐反应温度60℃,重氮反应温度25℃,氢氧化钠当量6.5︰1,中和温度25℃,偶联最佳温度120℃,对氟苯胺当量10︰1。在此优化条件下合成2-氨基-5-氟-3',4'-二氯联苯粗品含量85%,精制后含量可以达到98.5%,2步总收率65%。 展开更多
关键词 2-氨基-5-氟-3' 4'-二氯联苯 联苯吡菌胺 微通道技术
下载PDF
汉语的“句子”与英语的sentence 被引量:20
15
作者 姜望琪 《解放军外国语学院学报》 北大核心 2005年第1期10-15,共6页
汉语的"句子"不等于英语的sentence,它更像utterance。在以英语研究为代表的西方语言研究中,sentence是一个抽象单位。而汉语研究一向注重实际使用的单位,忽视抽象单位,特别是对相当于sentence这一级的研究开始较晚,以至汉语... 汉语的"句子"不等于英语的sentence,它更像utterance。在以英语研究为代表的西方语言研究中,sentence是一个抽象单位。而汉语研究一向注重实际使用的单位,忽视抽象单位,特别是对相当于sentence这一级的研究开始较晚,以至汉语的"句子"至今仍是一个具体单位,或称"动态单位"。跟sentence相当的汉语单位是"词组",不是"句子"。"词组"或称"短语"是汉语最大的结构单位、语法单位或"静态单位"。 展开更多
关键词 句子 话语 动态单位 静态单位
下载PDF
基于Sentence-BERT语义表示的咨询问题提示列表自动构建方法研究——以糖尿病咨询为例 被引量:14
16
作者 唐晓波 刘亚岚 《现代情报》 CSSCI 2021年第8期3-15,共13页
[目的/意义]咨询问题提示列表能引导咨询者在智能问答和智能咨询系统进行咨询并为动态咨询引导提供基础。目前,关于问题提示列表构建的研究大多采用专家咨询法、访谈法,这些方法无法满足智能咨询服务要求,本文以有问必答网中糖尿病问答... [目的/意义]咨询问题提示列表能引导咨询者在智能问答和智能咨询系统进行咨询并为动态咨询引导提供基础。目前,关于问题提示列表构建的研究大多采用专家咨询法、访谈法,这些方法无法满足智能咨询服务要求,本文以有问必答网中糖尿病问答为例,提出了基于Sentence-BERT语义表示的咨询问题提示列表自动构建模型。[方法/过程]本文首先在糖尿病相关文献调查和分析的基础上确定糖尿病类目体系,并人工标注咨询问题类别;其次使用LDA模型对每类问题集进行主题聚类;然后各主题下通过Sentence-BERT预训练模型进行问题语义表示,textRank算法计算问题重要性并排序;最终冗余处理后构建出咨询问题提示列表。[结果/结论]实验结果表明,本文提出的模型能有效构建出信息质量较高的、内容丰富的咨询问题提示列表,对咨询引导有促进作用。 展开更多
关键词 问题提示列表 智能问答 智能咨询 问答社区 糖尿病咨询 LDA sentence-BERT
下载PDF
On Sentence Complexity in THE TIMES: A Comparative Study of SentenceLength and Sentence Complexity in the News Section and the Sports Section
17
作者 赤列德吉 《海外英语》 2014年第15期3-4,共2页
My investigation will serve two purposes. First, I shall investigate the function of the subclauses in the corpus in relation to their complexity, and I shall establish whether there is a correlation between sentence ... My investigation will serve two purposes. First, I shall investigate the function of the subclauses in the corpus in relation to their complexity, and I shall establish whether there is a correlation between sentence length and sentence complexity.Second, I shall analyse the complexity of the subclauses collected from the two sections and compare the results from these sections, focusing on finite subclauses and non-finite subclauses. I hope to be able to point out some differences in style between the news and sports sections concerning the use of subordinate clauses in various syntactic functions in order to examine how the choice of linguistic structures differs in different sections of The Times. 展开更多
关键词 sentence length sentence COMPLEXITY style MARKER t
下载PDF
Developing Sentence Sense
18
作者 霍鑫红 商洋 《海外英语》 2018年第6期212-213,共2页
There is a big problem in understanding long sentences which are complex and complicated in English for many people.The desire seems obvious that people have difficulties using a complete sentence. So this paper is to... There is a big problem in understanding long sentences which are complex and complicated in English for many people.The desire seems obvious that people have difficulties using a complete sentence. So this paper is to solve the problems mentionedby developing sentence sense and a chart with much help. 展开更多
关键词 sentence sense finite verbs non-finite verbs CHART
下载PDF
基于Sentence-Rank的图像句子标注
19
作者 徐守坤 徐坚 +2 位作者 李宁 周佳 刘楚秋 《计算机工程与应用》 CSCD 北大核心 2019年第2期121-127,共7页
传统的图像语义句子标注是利用句子模板完成对图像内容描述,但其标注句子很难做到符合语言逻辑。针对这一问题,提出基于统计思想从语料库中选出一条最优的句子来描述图像内容,设计以N-gram算法为主要思想的Sentence-Rank算法生成标注句... 传统的图像语义句子标注是利用句子模板完成对图像内容描述,但其标注句子很难做到符合语言逻辑。针对这一问题,提出基于统计思想从语料库中选出一条最优的句子来描述图像内容,设计以N-gram算法为主要思想的Sentence-Rank算法生成标注句子。首先执行机器视觉特征学习,选择标注性能最好的HSV-LBP-HOG融合特征完成图像分类,获得图像标注关键词。然后,利用字符串匹配算法从语料库中列出包含所有标注关键词的句子,并将得到的句子通过Sentence-Rank算法进行价值排序,选取评分最高的句子描述图像。实验结果表明,该方法得到的标注句子具有较低的困惑度,较好地解决了句子的语言逻辑问题。 展开更多
关键词 机器学习 自然语言处理 特征融合 sentence-Rank N-GRAM
下载PDF
Masked Sentence Model Based on BERT for Move Recognition in Medical Scientific Abstracts 被引量:19
20
作者 Gaihong Yu Zhixiong Zhang +1 位作者 Huan Liu Liangping Ding 《Journal of Data and Information Science》 CSCD 2019年第4期42-55,共14页
Purpose:Mo ve recognition in scientific abstracts is an NLP task of classifying sentences of the abstracts into different types of language units.To improve the performance of move recognition in scientific abstracts,... Purpose:Mo ve recognition in scientific abstracts is an NLP task of classifying sentences of the abstracts into different types of language units.To improve the performance of move recognition in scientific abstracts,a novel model of move recognition is proposed that outperforms the BERT-based method.Design/methodology/approach:Prevalent models based on BERT for sentence classification often classify sentences without considering the context of the sentences.In this paper,inspired by the BERT masked language model(MLM),we propose a novel model called the masked sentence model that integrates the content and contextual information of the sentences in move recognition.Experiments are conducted on the benchmark dataset PubMed 20K RCT in three steps.Then,we compare our model with HSLN-RNN,BERT-based and SciBERT using the same dataset.Findings:Compared with the BERT-based and SciBERT models,the F1 score of our model outperforms them by 4.96%and 4.34%,respectively,which shows the feasibility and effectiveness of the novel model and the result of our model comes closest to the state-of-theart results of HSLN-RNN at present.Research limitations:The sequential features of move labels are not considered,which might be one of the reasons why HSLN-RNN has better performance.Our model is restricted to dealing with biomedical English literature because we use a dataset from PubMed,which is a typical biomedical database,to fine-tune our model.Practical implications:The proposed model is better and simpler in identifying move structures in scientific abstracts and is worthy of text classification experiments for capturing contextual features of sentences.Originality/value:T he study proposes a masked sentence model based on BERT that considers the contextual features of the sentences in abstracts in a new way.The performance of this classification model is significantly improved by rebuilding the input layer without changing the structure of neural networks. 展开更多
关键词 Move recognition BERT Masked sentence model Scientific abstracts
下载PDF
上一页 1 2 250 下一页 到第
使用帮助 返回顶部