期刊文献+
共找到4篇文章
< 1 >
每页显示 20 50 100
Masked Sentence Model Based on BERT for Move Recognition in Medical Scientific Abstracts 被引量:16
1
作者 Gaihong Yu Zhixiong Zhang +1 位作者 Huan Liu Liangping Ding 《Journal of Data and Information Science》 CSCD 2019年第4期42-55,共14页
Purpose:Mo ve recognition in scientific abstracts is an NLP task of classifying sentences of the abstracts into different types of language units.To improve the performance of move recognition in scientific abstracts,... Purpose:Mo ve recognition in scientific abstracts is an NLP task of classifying sentences of the abstracts into different types of language units.To improve the performance of move recognition in scientific abstracts,a novel model of move recognition is proposed that outperforms the BERT-based method.Design/methodology/approach:Prevalent models based on BERT for sentence classification often classify sentences without considering the context of the sentences.In this paper,inspired by the BERT masked language model(MLM),we propose a novel model called the masked sentence model that integrates the content and contextual information of the sentences in move recognition.Experiments are conducted on the benchmark dataset PubMed 20K RCT in three steps.Then,we compare our model with HSLN-RNN,BERT-based and SciBERT using the same dataset.Findings:Compared with the BERT-based and SciBERT models,the F1 score of our model outperforms them by 4.96%and 4.34%,respectively,which shows the feasibility and effectiveness of the novel model and the result of our model comes closest to the state-of-theart results of HSLN-RNN at present.Research limitations:The sequential features of move labels are not considered,which might be one of the reasons why HSLN-RNN has better performance.Our model is restricted to dealing with biomedical English literature because we use a dataset from PubMed,which is a typical biomedical database,to fine-tune our model.Practical implications:The proposed model is better and simpler in identifying move structures in scientific abstracts and is worthy of text classification experiments for capturing contextual features of sentences.Originality/value:T he study proposes a masked sentence model based on BERT that considers the contextual features of the sentences in abstracts in a new way.The performance of this classification model is significantly improved by rebuilding the input layer without changing the structure of neural networks. 展开更多
关键词 Move recognition BERT Masked sentence model scientific abstracts
下载PDF
Cambridge Scientific Abstracts
2
《Meteorological and Environmental Research》 CAS 2012年第12期84-84,共1页
Meteorological and Environmental Research has been included by Cambridge Scientific Abstracts (CSA) since 2011. CSA is a retrieval system published by Cambridge Information Group. CSA was founded in the late 1950’s,a... Meteorological and Environmental Research has been included by Cambridge Scientific Abstracts (CSA) since 2011. CSA is a retrieval system published by Cambridge Information Group. CSA was founded in the late 1950’s,and became part of the CIG family in 1971. CSA’s original mission was publishing secondary source materials relating to the physical sciences. 展开更多
关键词 CSA Cambridge scientific abstracts CIG
下载PDF
Automatic Keyphrase Extraction from Scientific Chinese Medical Abstracts Based on Character-Level Sequence Labeling 被引量:3
3
作者 Liangping Ding Zhixiong Zhang +2 位作者 Huan Liu Jie Li GaihongYu 《Journal of Data and Information Science》 CSCD 2021年第3期35-57,共23页
Purpose:Automatic keyphrase extraction(AKE)is an important task for grasping the main points of the text.In this paper,we aim to combine the benefits of sequence labeling formulation and pretrained language model to p... Purpose:Automatic keyphrase extraction(AKE)is an important task for grasping the main points of the text.In this paper,we aim to combine the benefits of sequence labeling formulation and pretrained language model to propose an automatic keyphrase extraction model for Chinese scientific research.Design/methodology/approach:We regard AKE from Chinese text as a character-level sequence labeling task to avoid segmentation errors of Chinese tokenizer and initialize our model with pretrained language model BERT,which was released by Google in 2018.We collect data from Chinese Science Citation Database and construct a large-scale dataset from medical domain,which contains 100,000 abstracts as training set,6,000 abstracts as development set and 3,094 abstracts as test set.We use unsupervised keyphrase extraction methods including term frequency(TF),TF-IDF,TextRank and supervised machine learning methods including Conditional Random Field(CRF),Bidirectional Long Short Term Memory Network(BiLSTM),and BiLSTM-CRF as baselines.Experiments are designed to compare word-level and character-level sequence labeling approaches on supervised machine learning models and BERT-based models.Findings:Compared with character-level BiLSTM-CRF,the best baseline model with F1 score of 50.16%,our character-level sequence labeling model based on BERT obtains F1 score of 59.80%,getting 9.64%absolute improvement.Research limitations:We just consider automatic keyphrase extraction task rather than keyphrase generation task,so only keyphrases that are occurred in the given text can be extracted.In addition,our proposed dataset is not suitable for dealing with nested keyphrases.Practical implications:We make our character-level IOB format dataset of Chinese Automatic Keyphrase Extraction from scientific Chinese medical abstracts(CAKE)publicly available for the benefits of research community,which is available at:https://github.com/possible1402/Dataset-For-Chinese-Medical-Keyphrase-Extraction.Originality/value:By designing comparative experiments,our study demonstrates that character-level formulation is more suitable for Chinese automatic keyphrase extraction task under the general trend of pretrained language models.And our proposed dataset provides a unified method for model evaluation and can promote the development of Chinese automatic keyphrase extraction to some extent. 展开更多
关键词 Automatic keyphrase extraction Character-level sequence labeling Pretrained language model scientific chinese medical abstracts
下载PDF
Abstracts From the South China Cardiovascular Scientific Sessions
4
作者 黄征 刘伊丽 《South China Journal of Cardiology》 CAS 2000年第1期58-60,共3页
Ⅰ. BASIC RESEARCH 1. Establishment of the Experimental Animal Models To study myocardial hibernating phenomenon, chronic occlusive multi-vessel coronary stenosis were made by placing amiroid constrictors on proximal ... Ⅰ. BASIC RESEARCH 1. Establishment of the Experimental Animal Models To study myocardial hibernating phenomenon, chronic occlusive multi-vessel coronary stenosis were made by placing amiroid constrictors on proximal LAD and LCX in canine models. Rabbit artery restenosis models were created by balloon injury of iliac artery and high lipid diet. Acute coronary artery occlusive models were performed in closed chest canines by putting polyvinyl chloride emboli to LAD or LCX via catheter and external 展开更多
关键词 abstracts From the South China Cardiovascular scientific Sessions
下载PDF
上一页 1 下一页 到第
使用帮助 返回顶部