Purpose:Mo ve recognition in scientific abstracts is an NLP task of classifying sentences of the abstracts into different types of language units.To improve the performance of move recognition in scientific abstracts,...Purpose:Mo ve recognition in scientific abstracts is an NLP task of classifying sentences of the abstracts into different types of language units.To improve the performance of move recognition in scientific abstracts,a novel model of move recognition is proposed that outperforms the BERT-based method.Design/methodology/approach:Prevalent models based on BERT for sentence classification often classify sentences without considering the context of the sentences.In this paper,inspired by the BERT masked language model(MLM),we propose a novel model called the masked sentence model that integrates the content and contextual information of the sentences in move recognition.Experiments are conducted on the benchmark dataset PubMed 20K RCT in three steps.Then,we compare our model with HSLN-RNN,BERT-based and SciBERT using the same dataset.Findings:Compared with the BERT-based and SciBERT models,the F1 score of our model outperforms them by 4.96%and 4.34%,respectively,which shows the feasibility and effectiveness of the novel model and the result of our model comes closest to the state-of-theart results of HSLN-RNN at present.Research limitations:The sequential features of move labels are not considered,which might be one of the reasons why HSLN-RNN has better performance.Our model is restricted to dealing with biomedical English literature because we use a dataset from PubMed,which is a typical biomedical database,to fine-tune our model.Practical implications:The proposed model is better and simpler in identifying move structures in scientific abstracts and is worthy of text classification experiments for capturing contextual features of sentences.Originality/value:T he study proposes a masked sentence model based on BERT that considers the contextual features of the sentences in abstracts in a new way.The performance of this classification model is significantly improved by rebuilding the input layer without changing the structure of neural networks.展开更多
Purpose:Automatic keyphrase extraction(AKE)is an important task for grasping the main points of the text.In this paper,we aim to combine the benefits of sequence labeling formulation and pretrained language model to p...Purpose:Automatic keyphrase extraction(AKE)is an important task for grasping the main points of the text.In this paper,we aim to combine the benefits of sequence labeling formulation and pretrained language model to propose an automatic keyphrase extraction model for Chinese scientific research.Design/methodology/approach:We regard AKE from Chinese text as a character-level sequence labeling task to avoid segmentation errors of Chinese tokenizer and initialize our model with pretrained language model BERT,which was released by Google in 2018.We collect data from Chinese Science Citation Database and construct a large-scale dataset from medical domain,which contains 100,000 abstracts as training set,6,000 abstracts as development set and 3,094 abstracts as test set.We use unsupervised keyphrase extraction methods including term frequency(TF),TF-IDF,TextRank and supervised machine learning methods including Conditional Random Field(CRF),Bidirectional Long Short Term Memory Network(BiLSTM),and BiLSTM-CRF as baselines.Experiments are designed to compare word-level and character-level sequence labeling approaches on supervised machine learning models and BERT-based models.Findings:Compared with character-level BiLSTM-CRF,the best baseline model with F1 score of 50.16%,our character-level sequence labeling model based on BERT obtains F1 score of 59.80%,getting 9.64%absolute improvement.Research limitations:We just consider automatic keyphrase extraction task rather than keyphrase generation task,so only keyphrases that are occurred in the given text can be extracted.In addition,our proposed dataset is not suitable for dealing with nested keyphrases.Practical implications:We make our character-level IOB format dataset of Chinese Automatic Keyphrase Extraction from scientific Chinese medical abstracts(CAKE)publicly available for the benefits of research community,which is available at:https://github.com/possible1402/Dataset-For-Chinese-Medical-Keyphrase-Extraction.Originality/value:By designing comparative experiments,our study demonstrates that character-level formulation is more suitable for Chinese automatic keyphrase extraction task under the general trend of pretrained language models.And our proposed dataset provides a unified method for model evaluation and can promote the development of Chinese automatic keyphrase extraction to some extent.展开更多
Cambridge Scientific Abstracts (CSA) is a retrieval system published by Cambridge Information Group. With a history of more than 30 years, the company is an important publisher of publishing abstracts and indexes wi...Cambridge Scientific Abstracts (CSA) is a retrieval system published by Cambridge Information Group. With a history of more than 30 years, the company is an important publisher of publishing abstracts and indexes with agent centers in Britain, France, Austral- ia, Japan, Netherlands, New Zealand, and Hong Kong China. The products include traditional printed abstracting and indexing jour- nals, a variety of electronic database which are retrieved through remote on-line and CD-ROM. Recently, the CSA launched network based (IDS) services, which can retrieve hundreds of databases provided by CSA and CSA publishing partners.展开更多
Cambridge Scientific Abstracts (CSA) is a retrieval system published by Cambridge Information Group. With a history of more than 30 years, the company is an important publisher of publishing abstracts and indexes wi...Cambridge Scientific Abstracts (CSA) is a retrieval system published by Cambridge Information Group. With a history of more than 30 years, the company is an important publisher of publishing abstracts and indexes with agent centers in Britain, France, Austral- ia, Japan, Netherlands, New Zealand, and Hong Kong China. The products include traditional printed abstracting and indexing jour- nals, a variety of electronic database which are retrieved through remote on-line and CD-ROM. Recently, the CSA launched network based (IDS) services, which can retrieve hundreds of databases provided by CSA and CSA publishing partners.展开更多
Cambridge Scientific Abstracts (CSA) is a retrieval system published by Cambridge Information Group. With a history of more than 30 years, the company is an important publisher of publishing abstracts and indexes wi...Cambridge Scientific Abstracts (CSA) is a retrieval system published by Cambridge Information Group. With a history of more than 30 years, the company is an important publisher of publishing abstracts and indexes with agent centers in Britain, France, Austral- ia, Japan, Netherlands, New Zealand, and Hong Kong China. The products include traditional printed abstracting and indexing jour- nals, a variety of electronic database which are retrieved through remote on-line and CD-ROM. Recently, the CSA launched network based (IDS) services, which can retrieve hundreds of databases provided by CSA and CSA publishing partners.展开更多
Cambridge Scientific Abstracts (CSA) is a retrieval system published by Cambridge Information Group. With a history of more than 30 years, the company is an important publisher of publishing abstracts and indexes wi...Cambridge Scientific Abstracts (CSA) is a retrieval system published by Cambridge Information Group. With a history of more than 30 years, the company is an important publisher of publishing abstracts and indexes with agent centers in Britain, France, Australia, Japan, Netherlands, New Zealand, and Hong Kong China. The products include traditional printed abstracting and indexing journals, a variety of electronic database which are retrieved through remote on-line and CD-ROM. Recently, the CSA launched network based (IDS) services, which can retrieve hundreds of databases provided by CSA and CSA publishing partners.展开更多
Meteorological and Environmental Research has been included by Cambridge Scientific Abstracts (CSA) since 2011. CSA is a retrieval system published by Cambridge Information Group. CSA was founded in the late 1950’s,a...Meteorological and Environmental Research has been included by Cambridge Scientific Abstracts (CSA) since 2011. CSA is a retrieval system published by Cambridge Information Group. CSA was founded in the late 1950’s,and became part of the CIG family in 1971. CSA’s original mission was publishing secondary source materials relating to the physical sciences. Completely展开更多
Cambridge Scientific Abstracts (CSA) is a retrieval system published by Cambridge Information Group. With a history of more than 30 years, the company is an important publisher of publishing abstracts and indexes wi...Cambridge Scientific Abstracts (CSA) is a retrieval system published by Cambridge Information Group. With a history of more than 30 years, the company is an important publisher of publishing abstracts and indexes with agent centers in Britain, France, Australia, Japan, Netherlands, New Zealand, and Hong Kong China.展开更多
Cambridge Scientific Abstracts (CSA) is a retrieval system published by Cambridge Information Group. With a history of more than 30 years, the company is an important publisher of publishing abstracts and indexes wi...Cambridge Scientific Abstracts (CSA) is a retrieval system published by Cambridge Information Group. With a history of more than 30 years, the company is an important publisher of publishing abstracts and indexes with agent centers in Britain, France, Australia, Japan, Netherlands, New Zealand, and Hong Kong China. The products include traditional printed abstracting and indexing journals, a variety of electronic database which are retrieved through remote on-line and CD-ROM. Recently, the CSA launched network based (IDS) services, which can retrieve hundreds of databases provided by CSA and CSA publishing partners.展开更多
Cambridge Scientific Abstracts (CSA) is a retrieval system published by Cambridge Information Group. With a history of morc than 30 years, the company is an important publisher of publishing abstracts and indexes wi...Cambridge Scientific Abstracts (CSA) is a retrieval system published by Cambridge Information Group. With a history of morc than 30 years, the company is an important publisher of publishing abstracts and indexes with agent centers in Britain, France, Austral- ia, Japan, Netherlands, New Zealand, and Hong Kong China. The products include traditional printed abstracting and indexing jour- nals, a variety of electronic database which are retrieved through remote on-line and CD-ROM. Recently, the CSA launched network based (IDS) services, which can retrieve hundreds of databases provided by CSA and CSA publishing partners.展开更多
Cambridge Scientific Abstracts (CSA) is a retrieval system published by Cambridge Information Group. With a history of more than 30 years, the company is an important publisher of publishing abstracts and indexes wi...Cambridge Scientific Abstracts (CSA) is a retrieval system published by Cambridge Information Group. With a history of more than 30 years, the company is an important publisher of publishing abstracts and indexes with agent centers in Britain, France, Austral- ia, Japan, Netherlands, New Zealand, and Hong Kong China. The products include traditional printed abstracting and indexing jour- nals, a variety of electronic database which are retrieved through remote on-line and CD-ROM. Recently, the CSA lannched network based (IDS) services, which can retrieve hundreds of databases provided by CSA and CSA publishing partners.展开更多
On December 22, 2010, an E-mail by Dr. Elena Raevskaya, the manager of Asia and Africa Section, All-Russian Institute of Scientific and Technical Information (V1NITI), was sent to Journal Publishing Center, Xinjiang...On December 22, 2010, an E-mail by Dr. Elena Raevskaya, the manager of Asia and Africa Section, All-Russian Institute of Scientific and Technical Information (V1NITI), was sent to Journal Publishing Center, Xinjiang Insti- tute of Ecology and Geography, CAS and said “Here is some new information about the Chinese journals evaluated by our experts. Ten journals have been found informative and useful in our work and have been included in VINITI database for regular abstraction (some of these journals were reexamined by expertise).”展开更多
Ⅰ. BASIC RESEARCH 1. Establishment of the Experimental Animal Models To study myocardial hibernating phenomenon, chronic occlusive multi-vessel coronary stenosis were made by placing amiroid constrictors on proximal ...Ⅰ. BASIC RESEARCH 1. Establishment of the Experimental Animal Models To study myocardial hibernating phenomenon, chronic occlusive multi-vessel coronary stenosis were made by placing amiroid constrictors on proximal LAD and LCX in canine models. Rabbit artery restenosis models were created by balloon injury of iliac artery and high lipid diet. Acute coronary artery occlusive models were performed in closed chest canines by putting polyvinyl chloride emboli to LAD or LCX via catheter and external counterpul展开更多
基金supported by the project “The demonstration system of rich semantic search application in scientific literature” (Grant No. 1734) from the Chinese Academy of Sciences
文摘Purpose:Mo ve recognition in scientific abstracts is an NLP task of classifying sentences of the abstracts into different types of language units.To improve the performance of move recognition in scientific abstracts,a novel model of move recognition is proposed that outperforms the BERT-based method.Design/methodology/approach:Prevalent models based on BERT for sentence classification often classify sentences without considering the context of the sentences.In this paper,inspired by the BERT masked language model(MLM),we propose a novel model called the masked sentence model that integrates the content and contextual information of the sentences in move recognition.Experiments are conducted on the benchmark dataset PubMed 20K RCT in three steps.Then,we compare our model with HSLN-RNN,BERT-based and SciBERT using the same dataset.Findings:Compared with the BERT-based and SciBERT models,the F1 score of our model outperforms them by 4.96%and 4.34%,respectively,which shows the feasibility and effectiveness of the novel model and the result of our model comes closest to the state-of-theart results of HSLN-RNN at present.Research limitations:The sequential features of move labels are not considered,which might be one of the reasons why HSLN-RNN has better performance.Our model is restricted to dealing with biomedical English literature because we use a dataset from PubMed,which is a typical biomedical database,to fine-tune our model.Practical implications:The proposed model is better and simpler in identifying move structures in scientific abstracts and is worthy of text classification experiments for capturing contextual features of sentences.Originality/value:T he study proposes a masked sentence model based on BERT that considers the contextual features of the sentences in abstracts in a new way.The performance of this classification model is significantly improved by rebuilding the input layer without changing the structure of neural networks.
基金This work is supported by the project“Research on Methods and Technologies of Scientific Researcher Entity Linking and Subject Indexing”(Grant No.G190091)from the National Science Library,Chinese Academy of Sciencesthe project“Design and Research on a Next Generation of Open Knowledge Services System and Key Technologies”(2019XM55).
文摘Purpose:Automatic keyphrase extraction(AKE)is an important task for grasping the main points of the text.In this paper,we aim to combine the benefits of sequence labeling formulation and pretrained language model to propose an automatic keyphrase extraction model for Chinese scientific research.Design/methodology/approach:We regard AKE from Chinese text as a character-level sequence labeling task to avoid segmentation errors of Chinese tokenizer and initialize our model with pretrained language model BERT,which was released by Google in 2018.We collect data from Chinese Science Citation Database and construct a large-scale dataset from medical domain,which contains 100,000 abstracts as training set,6,000 abstracts as development set and 3,094 abstracts as test set.We use unsupervised keyphrase extraction methods including term frequency(TF),TF-IDF,TextRank and supervised machine learning methods including Conditional Random Field(CRF),Bidirectional Long Short Term Memory Network(BiLSTM),and BiLSTM-CRF as baselines.Experiments are designed to compare word-level and character-level sequence labeling approaches on supervised machine learning models and BERT-based models.Findings:Compared with character-level BiLSTM-CRF,the best baseline model with F1 score of 50.16%,our character-level sequence labeling model based on BERT obtains F1 score of 59.80%,getting 9.64%absolute improvement.Research limitations:We just consider automatic keyphrase extraction task rather than keyphrase generation task,so only keyphrases that are occurred in the given text can be extracted.In addition,our proposed dataset is not suitable for dealing with nested keyphrases.Practical implications:We make our character-level IOB format dataset of Chinese Automatic Keyphrase Extraction from scientific Chinese medical abstracts(CAKE)publicly available for the benefits of research community,which is available at:https://github.com/possible1402/Dataset-For-Chinese-Medical-Keyphrase-Extraction.Originality/value:By designing comparative experiments,our study demonstrates that character-level formulation is more suitable for Chinese automatic keyphrase extraction task under the general trend of pretrained language models.And our proposed dataset provides a unified method for model evaluation and can promote the development of Chinese automatic keyphrase extraction to some extent.
文摘Cambridge Scientific Abstracts (CSA) is a retrieval system published by Cambridge Information Group. With a history of more than 30 years, the company is an important publisher of publishing abstracts and indexes with agent centers in Britain, France, Austral- ia, Japan, Netherlands, New Zealand, and Hong Kong China. The products include traditional printed abstracting and indexing jour- nals, a variety of electronic database which are retrieved through remote on-line and CD-ROM. Recently, the CSA launched network based (IDS) services, which can retrieve hundreds of databases provided by CSA and CSA publishing partners.
文摘Cambridge Scientific Abstracts (CSA) is a retrieval system published by Cambridge Information Group. With a history of more than 30 years, the company is an important publisher of publishing abstracts and indexes with agent centers in Britain, France, Austral- ia, Japan, Netherlands, New Zealand, and Hong Kong China. The products include traditional printed abstracting and indexing jour- nals, a variety of electronic database which are retrieved through remote on-line and CD-ROM. Recently, the CSA launched network based (IDS) services, which can retrieve hundreds of databases provided by CSA and CSA publishing partners.
文摘Cambridge Scientific Abstracts (CSA) is a retrieval system published by Cambridge Information Group. With a history of more than 30 years, the company is an important publisher of publishing abstracts and indexes with agent centers in Britain, France, Austral- ia, Japan, Netherlands, New Zealand, and Hong Kong China. The products include traditional printed abstracting and indexing jour- nals, a variety of electronic database which are retrieved through remote on-line and CD-ROM. Recently, the CSA launched network based (IDS) services, which can retrieve hundreds of databases provided by CSA and CSA publishing partners.
文摘Cambridge Scientific Abstracts (CSA) is a retrieval system published by Cambridge Information Group. With a history of more than 30 years, the company is an important publisher of publishing abstracts and indexes with agent centers in Britain, France, Australia, Japan, Netherlands, New Zealand, and Hong Kong China. The products include traditional printed abstracting and indexing journals, a variety of electronic database which are retrieved through remote on-line and CD-ROM. Recently, the CSA launched network based (IDS) services, which can retrieve hundreds of databases provided by CSA and CSA publishing partners.
文摘Meteorological and Environmental Research has been included by Cambridge Scientific Abstracts (CSA) since 2011. CSA is a retrieval system published by Cambridge Information Group. CSA was founded in the late 1950’s,and became part of the CIG family in 1971. CSA’s original mission was publishing secondary source materials relating to the physical sciences. Completely
文摘Cambridge Scientific Abstracts (CSA) is a retrieval system published by Cambridge Information Group. With a history of more than 30 years, the company is an important publisher of publishing abstracts and indexes with agent centers in Britain, France, Australia, Japan, Netherlands, New Zealand, and Hong Kong China.
文摘Cambridge Scientific Abstracts (CSA) is a retrieval system published by Cambridge Information Group. With a history of more than 30 years, the company is an important publisher of publishing abstracts and indexes with agent centers in Britain, France, Australia, Japan, Netherlands, New Zealand, and Hong Kong China. The products include traditional printed abstracting and indexing journals, a variety of electronic database which are retrieved through remote on-line and CD-ROM. Recently, the CSA launched network based (IDS) services, which can retrieve hundreds of databases provided by CSA and CSA publishing partners.
文摘Cambridge Scientific Abstracts (CSA) is a retrieval system published by Cambridge Information Group. With a history of morc than 30 years, the company is an important publisher of publishing abstracts and indexes with agent centers in Britain, France, Austral- ia, Japan, Netherlands, New Zealand, and Hong Kong China. The products include traditional printed abstracting and indexing jour- nals, a variety of electronic database which are retrieved through remote on-line and CD-ROM. Recently, the CSA launched network based (IDS) services, which can retrieve hundreds of databases provided by CSA and CSA publishing partners.
文摘Cambridge Scientific Abstracts (CSA) is a retrieval system published by Cambridge Information Group. With a history of more than 30 years, the company is an important publisher of publishing abstracts and indexes with agent centers in Britain, France, Austral- ia, Japan, Netherlands, New Zealand, and Hong Kong China. The products include traditional printed abstracting and indexing jour- nals, a variety of electronic database which are retrieved through remote on-line and CD-ROM. Recently, the CSA lannched network based (IDS) services, which can retrieve hundreds of databases provided by CSA and CSA publishing partners.
文摘On December 22, 2010, an E-mail by Dr. Elena Raevskaya, the manager of Asia and Africa Section, All-Russian Institute of Scientific and Technical Information (V1NITI), was sent to Journal Publishing Center, Xinjiang Insti- tute of Ecology and Geography, CAS and said “Here is some new information about the Chinese journals evaluated by our experts. Ten journals have been found informative and useful in our work and have been included in VINITI database for regular abstraction (some of these journals were reexamined by expertise).”
文摘Ⅰ. BASIC RESEARCH 1. Establishment of the Experimental Animal Models To study myocardial hibernating phenomenon, chronic occlusive multi-vessel coronary stenosis were made by placing amiroid constrictors on proximal LAD and LCX in canine models. Rabbit artery restenosis models were created by balloon injury of iliac artery and high lipid diet. Acute coronary artery occlusive models were performed in closed chest canines by putting polyvinyl chloride emboli to LAD or LCX via catheter and external counterpul