期刊文献+
共找到11,930篇文章
< 1 2 250 >
每页显示 20 50 100
Sentence及其结构研究——以古典主义时期作品为例
1
作者 符方泽 杨伟杰 《北方音乐》 2024年第1期131-140,共10页
文章以sentence为研究对象,一方面对国内外曲式学教材及相关文献展开概念比较、辨析与理论梳理,阐明其基本概念与内部结构;另一方面则以古典主义时期作品为分析实例,讨论这种特定主题(句法)的“结构范型”、并进一步对其“结构变形”的... 文章以sentence为研究对象,一方面对国内外曲式学教材及相关文献展开概念比较、辨析与理论梳理,阐明其基本概念与内部结构;另一方面则以古典主义时期作品为分析实例,讨论这种特定主题(句法)的“结构范型”、并进一步对其“结构变形”的情况进行归类。 展开更多
关键词 sentence 古典风格 范型 变形 陈述短句 延续短句
下载PDF
基于Sentence-BERT的专利技术主题聚类研究——以人工智能领域为例 被引量:3
2
作者 阮光册 周萌葳 《情报杂志》 CSSCI 北大核心 2024年第2期110-117,共8页
[研究目的]将Sentence-BERT模型应用于专利技术主题聚类,解决专利文献为突出新颖性,常使用独特技术术语造成词汇向量语义特征稀疏的问题。[研究方法]以人工智能领域2015年-2019年的22370篇专利为实验数据。首先,采用Sentence-BERT算法... [研究目的]将Sentence-BERT模型应用于专利技术主题聚类,解决专利文献为突出新颖性,常使用独特技术术语造成词汇向量语义特征稀疏的问题。[研究方法]以人工智能领域2015年-2019年的22370篇专利为实验数据。首先,采用Sentence-BERT算法对专利文献摘要文本进行向量化表示;其次,对向量化矩阵进行数据降维,利用HDBSCAN方式寻找原始数据中的高密度簇;最后,识别类簇文本集合中的主题特征,并完成主题呈现。[研究结论]对比LDA主题模型、K-means、doc2vec等方法,本文的实验结果提高了主题划分的细粒度和精确度,获得了较好的主题一致性。如何采用fine-tune策略进一步提升模型的效果,是未来该方法进一步深入探索的方向。 展开更多
关键词 sentence-BERT 专利文本 主题识别 文本聚类
下载PDF
Review of Research on English Translation of Chinese Running Sentences
3
作者 ZHANG Wen-hui 《Journal of Literature and Art Studies》 2024年第7期624-627,共4页
In order to convey complete meanings,there is a phenomenon in Chinese of using multiple running sentences.Xu Jingning(2023,p.66)states,“In communication,a complete expression of meaning often requires more than one c... In order to convey complete meanings,there is a phenomenon in Chinese of using multiple running sentences.Xu Jingning(2023,p.66)states,“In communication,a complete expression of meaning often requires more than one clause,which is common in human languages.”Domestic research on running sentences includes discussions on defining the concept and structural features of running sentences,sentence properties,sentence pattern classifications and their criteria,as well as issues related to translating running sentences into English.This article primarily focuses on scholarly research into the English translation of running sentences in China,highlighting recent achievements and identifying existing issues in the study of running sentence translation.However,by reviewing literature on the translation of running sentences,it is found that current research in the academic community on non-core running sentences is limited.Therefore,this paper proposes relevant strategies to address this issue. 展开更多
关键词 Chinese running sentences TOPICS English-Chinese translation
下载PDF
Classification of Conversational Sentences Using an Ensemble Pre-Trained Language Model with the Fine-Tuned Parameter
4
作者 R.Sujatha K.Nimala 《Computers, Materials & Continua》 SCIE EI 2024年第2期1669-1686,共18页
Sentence classification is the process of categorizing a sentence based on the context of the sentence.Sentence categorization requires more semantic highlights than other tasks,such as dependence parsing,which requir... Sentence classification is the process of categorizing a sentence based on the context of the sentence.Sentence categorization requires more semantic highlights than other tasks,such as dependence parsing,which requires more syntactic elements.Most existing strategies focus on the general semantics of a conversation without involving the context of the sentence,recognizing the progress and comparing impacts.An ensemble pre-trained language model was taken up here to classify the conversation sentences from the conversation corpus.The conversational sentences are classified into four categories:information,question,directive,and commission.These classification label sequences are for analyzing the conversation progress and predicting the pecking order of the conversation.Ensemble of Bidirectional Encoder for Representation of Transformer(BERT),Robustly Optimized BERT pretraining Approach(RoBERTa),Generative Pre-Trained Transformer(GPT),DistilBERT and Generalized Autoregressive Pretraining for Language Understanding(XLNet)models are trained on conversation corpus with hyperparameters.Hyperparameter tuning approach is carried out for better performance on sentence classification.This Ensemble of Pre-trained Language Models with a Hyperparameter Tuning(EPLM-HT)system is trained on an annotated conversation dataset.The proposed approach outperformed compared to the base BERT,GPT,DistilBERT and XLNet transformer models.The proposed ensemble model with the fine-tuned parameters achieved an F1_score of 0.88. 展开更多
关键词 Bidirectional encoder for representation of transformer conversation ensemble model fine-tuning generalized autoregressive pretraining for language understanding generative pre-trained transformer hyperparameter tuning natural language processing robustly optimized BERT pretraining approach sentence classification transformer models
下载PDF
融合Sentence-BERT和LDA的评论文本主题识别 被引量:11
5
作者 阮光册 黄韵莹 《现代情报》 CSSCI 2023年第5期46-53,共8页
[目的/意义]为了解决评论文本主题识别时语义描述不充分以及学习到的主题语义连贯性不强等问题。本文将Sentence-BERT句子嵌入模型和LDA模型相结合,提升评论文本主题的语义性。[方法/过程]采用Sentence-BERT模型获取评论文本句子层面的... [目的/意义]为了解决评论文本主题识别时语义描述不充分以及学习到的主题语义连贯性不强等问题。本文将Sentence-BERT句子嵌入模型和LDA模型相结合,提升评论文本主题的语义性。[方法/过程]采用Sentence-BERT模型获取评论文本句子层面的向量特征,同时,采用LDA模型获取评论文本的概率主题向量,随后使用自动编码器连接两组向量,运用K-means算法对潜在空间向量进行聚类,从类簇中获取上下文主题信息。[结果/结论]通过对评论文本数据集的实验,本文方法可以较好地获得具有语义信息的主题词。Sentence-BERT模型与LDA结合,增加了模型的复杂性。通过对比,本文方法获得的主题一致性指标(Coherence)优于目前常见的评论文本主题识别方法。 展开更多
关键词 sentence-BERT LDA模型 评论文本 主题识别
下载PDF
Next Words Prediction and Sentence Completion in Bangla Language Using GRU-Based RNN on N-Gram Language Model
6
作者 Afranul Hoque Busrat Jahan +3 位作者 Shaikat Chandra Paul Zinat Ara Zabu Rakhi Mondal Papeya Akter 《Journal of Data Analysis and Information Processing》 2023年第4期388-399,共12页
We use a lot of devices in our daily life to communicate with others. In this modern world, people use email, Facebook, Twitter, and many other social network sites for exchanging information. People lose their valuab... We use a lot of devices in our daily life to communicate with others. In this modern world, people use email, Facebook, Twitter, and many other social network sites for exchanging information. People lose their valuable time misspelling and retyping, and some people are not happy to type large sentences because they face unnecessary words or grammatical issues. So, for this reason, word predictive systems help to exchange textual information more quickly, easier, and comfortably for all people. These systems predict the next most probable words and give users to choose of the needed word from these suggested words. Word prediction can help the writer by predicting the next word and helping complete the sentence correctly. This research aims to forecast the most suitable next word to complete a sentence for any given context. In this research, we have worked on the Bangla language. We have presented a process that can expect the next maximum probable and proper words and suggest a complete sentence using predicted words. In this research, GRU-based RNN has been used on the N-gram dataset to develop the proposed model. We collected a large dataset using multiple sources in the Bangla language and also compared it to the other approaches that have been used such as LSTM, and Naive Bayes. But this suggested approach provides excellent exactness than others. Here, the Unigram model provides 88.22%, Bi-gram model is 99.24%, Tri-gram model is 97.69%, and 4-gram and 5-gram models provide 99.43% and 99.78% on average accurateness. We think that our proposed method profound impression on Bangla search engines. 展开更多
关键词 Bangla Language Words Prediction sentence Completion GRU RNN Corpus N-Gram
下载PDF
A Sentence Retrieval Generation Network Guided Video Captioning
7
作者 Ou Ye Mimi Wang +3 位作者 Zhenhua Yu Yan Fu Shun Yi Jun Deng 《Computers, Materials & Continua》 SCIE EI 2023年第6期5675-5696,共22页
Currently,the video captioning models based on an encoder-decoder mainly rely on a single video input source.The contents of video captioning are limited since few studies employed external corpus information to guide... Currently,the video captioning models based on an encoder-decoder mainly rely on a single video input source.The contents of video captioning are limited since few studies employed external corpus information to guide the generation of video captioning,which is not conducive to the accurate descrip-tion and understanding of video content.To address this issue,a novel video captioning method guided by a sentence retrieval generation network(ED-SRG)is proposed in this paper.First,a ResNeXt network model,an efficient convolutional network for online video understanding(ECO)model,and a long short-term memory(LSTM)network model are integrated to construct an encoder-decoder,which is utilized to extract the 2D features,3D features,and object features of video data respectively.These features are decoded to generate textual sentences that conform to video content for sentence retrieval.Then,a sentence-transformer network model is employed to retrieve different sentences in an external corpus that are semantically similar to the above textual sentences.The candidate sentences are screened out through similarity measurement.Finally,a novel GPT-2 network model is constructed based on GPT-2 network structure.The model introduces a designed random selector to randomly select predicted words with a high probability in the corpus,which is used to guide and generate textual sentences that are more in line with human natural language expressions.The proposed method in this paper is compared with several existing works by experiments.The results show that the indicators BLEU-4,CIDEr,ROUGE_L,and METEOR are improved by 3.1%,1.3%,0.3%,and 1.5%on a public dataset MSVD and 1.3%,0.5%,0.2%,1.9%on a public dataset MSR-VTT respectively.It can be seen that the proposed method in this paper can generate video captioning with richer semantics than several state-of-the-art approaches. 展开更多
关键词 Video captioning encoder-decoder sentence retrieval external corpus RS GPT-2 network model
下载PDF
A Study of Nominal Predicate Sentences Under the Framework of Cognitive Grammar
8
作者 ZOU Wen-jie GAO Wen-cheng 《Journal of Literature and Art Studies》 2023年第9期704-707,共4页
Cognitive grammar,as a linguistic theory that attaches importance to the relationship between language and thinking,provides us with a more comprehensive way to understand the structure,semantics and cognitive process... Cognitive grammar,as a linguistic theory that attaches importance to the relationship between language and thinking,provides us with a more comprehensive way to understand the structure,semantics and cognitive processing of noun predicate sentences.Therefore,under the framework of cognitive grammar,this paper tries to analyze the semantic connection and cognitive process in noun predicate sentences from the semantic perspective and the method of example theory,and discusses the motivation of the formation of this construction,so as to provide references for in-depth analysis of the cognitive laws behind noun predicate sentences. 展开更多
关键词 nominal predicate sentences cognitive grammar SEMANTICS
下载PDF
椎间孔镜BEIS技术联合甲钴胺治疗腰椎间盘突出症患者的临床效果
9
作者 刘二涛 秦晓东 +2 位作者 郭张建 丁正美 朱小建 《大医生》 2024年第21期9-12,共4页
目的探究椎间孔镜BEIS技术联合甲钴胺对腰椎间盘突出症患者的效果,为临床治疗提供参考。方法选取2021年2月至2023年2月南通市第四人民医院收治的100例腰椎间盘突出症患者为研究对象,按照随机数字表法分为对照组和观察组,各50例。对照组... 目的探究椎间孔镜BEIS技术联合甲钴胺对腰椎间盘突出症患者的效果,为临床治疗提供参考。方法选取2021年2月至2023年2月南通市第四人民医院收治的100例腰椎间盘突出症患者为研究对象,按照随机数字表法分为对照组和观察组,各50例。对照组患者接受椎板开窗术治疗,观察组患者接受椎间孔镜BEIS技术联合甲钴胺治疗。比较两组患者视觉模拟量表(VAS)疼痛评分、日本骨科协会(JOA)评分、Oswestry功能障碍指数问卷表(ODI)评分、炎症因子[C反应蛋白(CRP)、红细胞沉降率(ESR)]水平、相关恢复指标。结果两组患者VAS疼痛评分具有时间、组间、交互效应差异(F_(时间)=23.469,P时间<0.05;F_(组间)=19.861,P_(组间)<0.05;F_(交互)=10.256,P_(交互)<0.05)。两组患者术前及术后24 h、7 d、1个月、3个月的VAS疼痛评分均呈下降趋势,且观察组术后24 h~3个月的VAS疼痛评分均低于对照组(均P<0.05)。术后3个月,两组患者JOA评分均升高,且观察组高于对照组;两者患者ODI评分均降低,且观察组低于对照组(均P<0.05)。两组患者炎症因子水平均具有时间、组间、交互效应差异(CRP:F_(时间)=21.143,P时间<0.05;F_(组间)=18.251,P_(组间)<0.05;F_(交互)=24.452,P_(交互)<0.05。ESR:F_(时间)=21.127,P时间<0.05;F_(组间)=17.341,P_(组间)<0.05;F_(交互)=11.243,P_(交互)<0.05)。两组患者术前及术后7 d、3个月CRP水平及ESR均呈下降趋势,且观察组术后7 d~3个月均低于对照组(均P<0.05)。观察组患者独立行走时间、日常活动恢复时间、社会活动恢复时间均短于对照组(均P<0.05)。结论椎间孔镜BEIS技术联合甲钴胺对腰椎间盘突出症患者的临床效果较好,可改善缓解患者疼痛,促进腰椎功能的恢复,抑制炎症反应,缩短恢复时间,值得临床应用。 展开更多
关键词 椎间孔镜beiS技术 甲钴胺 腰椎间盘突出症
下载PDF
椎间孔镜BEIS技术治疗腰椎间盘突出症的临床研究
10
作者 刘兆斌 刘元涛 +2 位作者 杨倩倩 王绍飞 蒋海 《临床研究》 2024年第6期21-25,共5页
目的探讨椎间孔镜宽视野、简易、即刻手术(BEIS)技术治疗腰椎间盘突出症(LDH)的临床优势。方法采用随机数字表法将2015年1月至2022年12月临沂市中心医院疼痛科收治的400例LDH患者分为两组,各200例,对照组予传统椎板开窗减压髓核摘除术治... 目的探讨椎间孔镜宽视野、简易、即刻手术(BEIS)技术治疗腰椎间盘突出症(LDH)的临床优势。方法采用随机数字表法将2015年1月至2022年12月临沂市中心医院疼痛科收治的400例LDH患者分为两组,各200例,对照组予传统椎板开窗减压髓核摘除术治疗,研究组予椎间孔镜BEIS技术治疗,观察两组手术时间、术中出血量、术后首次下床活动时间、住院时间、并发症发生率及复发率;并比较手术前后两组患者视觉模拟评分法(VAS)、日本骨科协会(JOA)评分标准评定脊柱功能、Oswestry功能障碍指数(ODI),半年后使用改良MacNab标准评价疗效。结果研究组手术时间和术后首次下床活动时间、住院时间短于对照组,术中出血量少于对照组,差异有统计学意义(P<0.05);研究组术后并发症发生率低于对照组,差异有统计学意义(P<0.05),但两组随访复发率差异无统计学意义(P>0.05);两组术前VAS评分差异无统计学意义(P>0.05),研究组术后24 h、术后48 h、术后1周时VAS评分低于对照组,差异有统计学意义(P<0.05);组间术前JOA评分与ODI评分差异无统计学意义(P>0.05);两组术后JOA评分与ODI评分较术前均有改善,其中研究组JOA评分高于对照组,ODI评分低于对照组,差异有统计学意义(P<0.05);研究组治疗有效率(94.27%)高于对照组(85.13%),差异有统计学意义(P<0.05)。结论椎间孔镜BEIS技术是治疗LDH的有效术式,不仅疗效满意,且安全性高,术后恢复快,值得临床应用。 展开更多
关键词 椎间孔镜beiS技术 腰椎间盘突出症 并发症
下载PDF
基于BEI技术的孔院教师胜任力模型构建
11
作者 王珺琪 《新潮电子》 2024年第10期253-255,共3页
孔院作为国际中文教育龙头品牌,自然肩负重要的使命,孔院的教师更是推动孔院未来发展的中坚力量。结合时代背景需求,本文基于胜任力素质理论,从五洲分别选取一个国家,将加拿大、西班牙、韩国、新西兰与佛得角五国孔院教师胜任力作为研... 孔院作为国际中文教育龙头品牌,自然肩负重要的使命,孔院的教师更是推动孔院未来发展的中坚力量。结合时代背景需求,本文基于胜任力素质理论,从五洲分别选取一个国家,将加拿大、西班牙、韩国、新西兰与佛得角五国孔院教师胜任力作为研究对象,采用BEI访谈法提取孔院教师胜任力特征要素,最终构建孔院教师胜任力词典模型。 展开更多
关键词 bei访谈法 孔子学院教师 胜任力模型
下载PDF
Learn to Speak 100 Chinese Sentences(Ⅹ):Ⅹ See a performance of Beijing opera:
12
《China & The World Cultural Exchange》 1997年第4期44-47,共4页
关键词 Learn to Speak 100 Chinese sentences See a performance of beijing opera
下载PDF
汉语的“句子”与英语的sentence 被引量:20
13
作者 姜望琪 《解放军外国语学院学报》 北大核心 2005年第1期10-15,共6页
汉语的"句子"不等于英语的sentence,它更像utterance。在以英语研究为代表的西方语言研究中,sentence是一个抽象单位。而汉语研究一向注重实际使用的单位,忽视抽象单位,特别是对相当于sentence这一级的研究开始较晚,以至汉语... 汉语的"句子"不等于英语的sentence,它更像utterance。在以英语研究为代表的西方语言研究中,sentence是一个抽象单位。而汉语研究一向注重实际使用的单位,忽视抽象单位,特别是对相当于sentence这一级的研究开始较晚,以至汉语的"句子"至今仍是一个具体单位,或称"动态单位"。跟sentence相当的汉语单位是"词组",不是"句子"。"词组"或称"短语"是汉语最大的结构单位、语法单位或"静态单位"。 展开更多
关键词 句子 话语 动态单位 静态单位
下载PDF
Product image sentence annotation based on kernel descriptors and tag-rank
14
作者 张红斌 姬东鸿 +2 位作者 尹兰 任亚峰 殷依 《Journal of Southeast University(English Edition)》 EI CAS 2016年第2期170-176,共7页
Dealing with issues such as too simple image features and word noise inference in product image sentence anmotation, a product image sentence annotation model focusing on image feature learning and key words summariza... Dealing with issues such as too simple image features and word noise inference in product image sentence anmotation, a product image sentence annotation model focusing on image feature learning and key words summarization is described. Three kernel descriptors such as gradient, shape, and color are extracted, respectively. Feature late-fusion is executed in turn by the multiple kernel learning model to obtain more discriminant image features. Absolute rank and relative rank of the tag-rank model are used to boost the key words' weights. A new word integration algorithm named word sequence blocks building (WSBB) is designed to create N-gram word sequences. Sentences are generated according to the N-gram word sequences and predefined templates. Experimental results show that both the BLEU-1 scores and BLEU-2 scores of the sentences are superior to those of the state-of-art baselines. 展开更多
关键词 product image sentence annotation kerneldescriptors tag-rank word sequence blocks building(WSBB) N-gram word sequences
下载PDF
基于Sentence-BERT语义表示的咨询问题提示列表自动构建方法研究——以糖尿病咨询为例 被引量:14
15
作者 唐晓波 刘亚岚 《现代情报》 CSSCI 2021年第8期3-15,共13页
[目的/意义]咨询问题提示列表能引导咨询者在智能问答和智能咨询系统进行咨询并为动态咨询引导提供基础。目前,关于问题提示列表构建的研究大多采用专家咨询法、访谈法,这些方法无法满足智能咨询服务要求,本文以有问必答网中糖尿病问答... [目的/意义]咨询问题提示列表能引导咨询者在智能问答和智能咨询系统进行咨询并为动态咨询引导提供基础。目前,关于问题提示列表构建的研究大多采用专家咨询法、访谈法,这些方法无法满足智能咨询服务要求,本文以有问必答网中糖尿病问答为例,提出了基于Sentence-BERT语义表示的咨询问题提示列表自动构建模型。[方法/过程]本文首先在糖尿病相关文献调查和分析的基础上确定糖尿病类目体系,并人工标注咨询问题类别;其次使用LDA模型对每类问题集进行主题聚类;然后各主题下通过Sentence-BERT预训练模型进行问题语义表示,textRank算法计算问题重要性并排序;最终冗余处理后构建出咨询问题提示列表。[结果/结论]实验结果表明,本文提出的模型能有效构建出信息质量较高的、内容丰富的咨询问题提示列表,对咨询引导有促进作用。 展开更多
关键词 问题提示列表 智能问答 智能咨询 问答社区 糖尿病咨询 LDA sentence-BERT
下载PDF
Rethinking English periodic sentence
16
作者 龙艳 《Sino-US English Teaching》 2008年第7期40-43,共4页
Periodic sentence is often identified as sentence with main clause as end-weight. In fact, this definition is so confusing that it causes the same confusion in practice. This paper aims at rethinking periodic sentence... Periodic sentence is often identified as sentence with main clause as end-weight. In fact, this definition is so confusing that it causes the same confusion in practice. This paper aims at rethinking periodic sentence and advocates the adoption of noble styles with periodic sentence as its chief representative. 展开更多
关键词 periodic sentence classical and modem end-weight sentence noble styles
下载PDF
On Sentence Complexity in THE TIMES: A Comparative Study of SentenceLength and Sentence Complexity in the News Section and the Sports Section
17
作者 赤列德吉 《海外英语》 2014年第15期3-4,共2页
My investigation will serve two purposes. First, I shall investigate the function of the subclauses in the corpus in relation to their complexity, and I shall establish whether there is a correlation between sentence ... My investigation will serve two purposes. First, I shall investigate the function of the subclauses in the corpus in relation to their complexity, and I shall establish whether there is a correlation between sentence length and sentence complexity.Second, I shall analyse the complexity of the subclauses collected from the two sections and compare the results from these sections, focusing on finite subclauses and non-finite subclauses. I hope to be able to point out some differences in style between the news and sports sections concerning the use of subordinate clauses in various syntactic functions in order to examine how the choice of linguistic structures differs in different sections of The Times. 展开更多
关键词 sentence length sentence COMPLEXITY style MARKER t
下载PDF
Developing Sentence Sense
18
作者 霍鑫红 商洋 《海外英语》 2018年第6期212-213,共2页
There is a big problem in understanding long sentences which are complex and complicated in English for many people.The desire seems obvious that people have difficulties using a complete sentence. So this paper is to... There is a big problem in understanding long sentences which are complex and complicated in English for many people.The desire seems obvious that people have difficulties using a complete sentence. So this paper is to solve the problems mentionedby developing sentence sense and a chart with much help. 展开更多
关键词 sentence sense finite verbs non-finite verbs CHART
下载PDF
基于Sentence-Rank的图像句子标注
19
作者 徐守坤 徐坚 +2 位作者 李宁 周佳 刘楚秋 《计算机工程与应用》 CSCD 北大核心 2019年第2期121-127,共7页
传统的图像语义句子标注是利用句子模板完成对图像内容描述,但其标注句子很难做到符合语言逻辑。针对这一问题,提出基于统计思想从语料库中选出一条最优的句子来描述图像内容,设计以N-gram算法为主要思想的Sentence-Rank算法生成标注句... 传统的图像语义句子标注是利用句子模板完成对图像内容描述,但其标注句子很难做到符合语言逻辑。针对这一问题,提出基于统计思想从语料库中选出一条最优的句子来描述图像内容,设计以N-gram算法为主要思想的Sentence-Rank算法生成标注句子。首先执行机器视觉特征学习,选择标注性能最好的HSV-LBP-HOG融合特征完成图像分类,获得图像标注关键词。然后,利用字符串匹配算法从语料库中列出包含所有标注关键词的句子,并将得到的句子通过Sentence-Rank算法进行价值排序,选取评分最高的句子描述图像。实验结果表明,该方法得到的标注句子具有较低的困惑度,较好地解决了句子的语言逻辑问题。 展开更多
关键词 机器学习 自然语言处理 特征融合 sentence-Rank N-GRAM
下载PDF
Masked Sentence Model Based on BERT for Move Recognition in Medical Scientific Abstracts 被引量:20
20
作者 Gaihong Yu Zhixiong Zhang +1 位作者 Huan Liu Liangping Ding 《Journal of Data and Information Science》 CSCD 2019年第4期42-55,共14页
Purpose:Mo ve recognition in scientific abstracts is an NLP task of classifying sentences of the abstracts into different types of language units.To improve the performance of move recognition in scientific abstracts,... Purpose:Mo ve recognition in scientific abstracts is an NLP task of classifying sentences of the abstracts into different types of language units.To improve the performance of move recognition in scientific abstracts,a novel model of move recognition is proposed that outperforms the BERT-based method.Design/methodology/approach:Prevalent models based on BERT for sentence classification often classify sentences without considering the context of the sentences.In this paper,inspired by the BERT masked language model(MLM),we propose a novel model called the masked sentence model that integrates the content and contextual information of the sentences in move recognition.Experiments are conducted on the benchmark dataset PubMed 20K RCT in three steps.Then,we compare our model with HSLN-RNN,BERT-based and SciBERT using the same dataset.Findings:Compared with the BERT-based and SciBERT models,the F1 score of our model outperforms them by 4.96%and 4.34%,respectively,which shows the feasibility and effectiveness of the novel model and the result of our model comes closest to the state-of-theart results of HSLN-RNN at present.Research limitations:The sequential features of move labels are not considered,which might be one of the reasons why HSLN-RNN has better performance.Our model is restricted to dealing with biomedical English literature because we use a dataset from PubMed,which is a typical biomedical database,to fine-tune our model.Practical implications:The proposed model is better and simpler in identifying move structures in scientific abstracts and is worthy of text classification experiments for capturing contextual features of sentences.Originality/value:T he study proposes a masked sentence model based on BERT that considers the contextual features of the sentences in abstracts in a new way.The performance of this classification model is significantly improved by rebuilding the input layer without changing the structure of neural networks. 展开更多
关键词 Move recognition BERT Masked sentence model Scientific abstracts
下载PDF
上一页 1 2 250 下一页 到第
使用帮助 返回顶部