The rhetorical structure of abstracts has been a widely discussed topic, as it can greatly enhance the abstract writing skills of second-language writers. This study aims to provide guidance on the syntactic features ...The rhetorical structure of abstracts has been a widely discussed topic, as it can greatly enhance the abstract writing skills of second-language writers. This study aims to provide guidance on the syntactic features that L2 learners can employ, as well as suggest which features they should focus on in English academic writing. To achieve this, all samples were analyzed for rhetorical moves using Hyland’s five-rhetorical move model. Additionally, all sentences were evaluated for syntactic complexity, considering measures such as global, clausal and phrasal complexity. The findings reveal that expert writers exhibit a more balanced use of syntactic complexity across moves, effectively fulfilling the rhetorical objectives of abstracts. On the other hand, MA students tend to rely excessively on embedded structures and dependent clauses in an attempt to increase complexity. The implications of these findings for academic writing research, pedagogy, and assessment are thoroughly discussed.展开更多
Purpose:Mo ve recognition in scientific abstracts is an NLP task of classifying sentences of the abstracts into different types of language units.To improve the performance of move recognition in scientific abstracts,...Purpose:Mo ve recognition in scientific abstracts is an NLP task of classifying sentences of the abstracts into different types of language units.To improve the performance of move recognition in scientific abstracts,a novel model of move recognition is proposed that outperforms the BERT-based method.Design/methodology/approach:Prevalent models based on BERT for sentence classification often classify sentences without considering the context of the sentences.In this paper,inspired by the BERT masked language model(MLM),we propose a novel model called the masked sentence model that integrates the content and contextual information of the sentences in move recognition.Experiments are conducted on the benchmark dataset PubMed 20K RCT in three steps.Then,we compare our model with HSLN-RNN,BERT-based and SciBERT using the same dataset.Findings:Compared with the BERT-based and SciBERT models,the F1 score of our model outperforms them by 4.96%and 4.34%,respectively,which shows the feasibility and effectiveness of the novel model and the result of our model comes closest to the state-of-theart results of HSLN-RNN at present.Research limitations:The sequential features of move labels are not considered,which might be one of the reasons why HSLN-RNN has better performance.Our model is restricted to dealing with biomedical English literature because we use a dataset from PubMed,which is a typical biomedical database,to fine-tune our model.Practical implications:The proposed model is better and simpler in identifying move structures in scientific abstracts and is worthy of text classification experiments for capturing contextual features of sentences.Originality/value:T he study proposes a masked sentence model based on BERT that considers the contextual features of the sentences in abstracts in a new way.The performance of this classification model is significantly improved by rebuilding the input layer without changing the structure of neural networks.展开更多
Purpose:Automatic keyphrase extraction(AKE)is an important task for grasping the main points of the text.In this paper,we aim to combine the benefits of sequence labeling formulation and pretrained language model to p...Purpose:Automatic keyphrase extraction(AKE)is an important task for grasping the main points of the text.In this paper,we aim to combine the benefits of sequence labeling formulation and pretrained language model to propose an automatic keyphrase extraction model for Chinese scientific research.Design/methodology/approach:We regard AKE from Chinese text as a character-level sequence labeling task to avoid segmentation errors of Chinese tokenizer and initialize our model with pretrained language model BERT,which was released by Google in 2018.We collect data from Chinese Science Citation Database and construct a large-scale dataset from medical domain,which contains 100,000 abstracts as training set,6,000 abstracts as development set and 3,094 abstracts as test set.We use unsupervised keyphrase extraction methods including term frequency(TF),TF-IDF,TextRank and supervised machine learning methods including Conditional Random Field(CRF),Bidirectional Long Short Term Memory Network(BiLSTM),and BiLSTM-CRF as baselines.Experiments are designed to compare word-level and character-level sequence labeling approaches on supervised machine learning models and BERT-based models.Findings:Compared with character-level BiLSTM-CRF,the best baseline model with F1 score of 50.16%,our character-level sequence labeling model based on BERT obtains F1 score of 59.80%,getting 9.64%absolute improvement.Research limitations:We just consider automatic keyphrase extraction task rather than keyphrase generation task,so only keyphrases that are occurred in the given text can be extracted.In addition,our proposed dataset is not suitable for dealing with nested keyphrases.Practical implications:We make our character-level IOB format dataset of Chinese Automatic Keyphrase Extraction from scientific Chinese medical abstracts(CAKE)publicly available for the benefits of research community,which is available at:https://github.com/possible1402/Dataset-For-Chinese-Medical-Keyphrase-Extraction.Originality/value:By designing comparative experiments,our study demonstrates that character-level formulation is more suitable for Chinese automatic keyphrase extraction task under the general trend of pretrained language models.And our proposed dataset provides a unified method for model evaluation and can promote the development of Chinese automatic keyphrase extraction to some extent.展开更多
The Chinese Optics and Applied Optics Abstracts , sponsored by the Documentation andInformation Center of the Chinese Academy of Sciences, the Optical Information Networkof the Chinese Academy of Sciences and the Chan...The Chinese Optics and Applied Optics Abstracts , sponsored by the Documentation andInformation Center of the Chinese Academy of Sciences, the Optical Information Networkof the Chinese Academy of Sciences and the Changchun Institute of Optics, Fine Mechanicsand Physics of the Chinese Academy of Sciences, is one of the series of science andtechnology indexing periodicals published by the Chinese Academy of Sciences.The Chinese Optics and Applied Optics Abstracts started a quarterly publication in 1985,with the name of Chinese Science and Technology Document Catalogues: Optics andApplied Optics. It changed into a bimonthly publication with the name of Chinese Opticsand Applied Optics Abstracts in 1987. In combination with the Chinese Optics Documen-展开更多
Inclusion variations and calcium treatment optimizationin pipeline steel productionLIU Jianhua, WU Huajie, BAO Yanping, and WANG Min Abstract SiCa line and SiCaBaFe alloy were injected into liquid pipe-line steel at t...Inclusion variations and calcium treatment optimizationin pipeline steel productionLIU Jianhua, WU Huajie, BAO Yanping, and WANG Min Abstract SiCa line and SiCaBaFe alloy were injected into liquid pipe-line steel at the end of LF refining as calcium treatment,展开更多
Genre analysis has become one of the most important approaches to text analysis, especially in the field of English for Specific Purposes. Abstract is the essential part of the paper, which helps the readers get initi...Genre analysis has become one of the most important approaches to text analysis, especially in the field of English for Specific Purposes. Abstract is the essential part of the paper, which helps the readers get initial impressions. Because of its particular the usage of communication, it has its special rules and mode. The purpose of this paper was to present the 5-move model characteristic of research dissertation Abstracts, and explore the linguistic characteristics of each move. The analysis started from the macrostructure, i.e from the text as a whole, towards the microstructure which included linguistic description (syntactic and lexical). The results showed that most abstracts followed 5-move model and the linguistic features of this genre.展开更多
Nov. 1—4, 1989, Beijing, China River water chemistry in India-An overview V. Subramamian School of Environmental Sciences, Jawaharlal Nehru University, New Delhi 110067, India. Based on extensive analyses of a very l...Nov. 1—4, 1989, Beijing, China River water chemistry in India-An overview V. Subramamian School of Environmental Sciences, Jawaharlal Nehru University, New Delhi 110067, India. Based on extensive analyses of a very large number of samples, the average river water in India is more alkaline than the world average river water. The dominance of Na and Cl in Indian river shows their monsoon control. There are spatial and seasonal variations. The northern river are less saline than the southern rivers. The sediments covered by the Ganges-展开更多
The 7th international conference on unsaturated soils(3rd–5th August 2018)is organised by the Hong Kong University of Science and Technology and supported by:TC106 Unsaturated Soils of ISSMGE;Hong Kong Geotechnica...The 7th international conference on unsaturated soils(3rd–5th August 2018)is organised by the Hong Kong University of Science and Technology and supported by:TC106 Unsaturated Soils of ISSMGE;Hong Kong Geotechnical Society;Geotechnical Division of Hong Kong Institution of Engineers;展开更多
Research on Conodont Biostratigraphy near the Bottom Boundary of the Middle Triassic Qingyan Stage in Southern Guizhou ProvinceYao Jianxin, Ji Zhansheng, Wang Liting , Wang Yanbin andWu Guichun(1. Institute of Geology...Research on Conodont Biostratigraphy near the Bottom Boundary of the Middle Triassic Qingyan Stage in Southern Guizhou ProvinceYao Jianxin, Ji Zhansheng, Wang Liting , Wang Yanbin andWu Guichun(1. Institute of Geology, Chinese Academy of GeologicalSciences, Beijing, 100037; 2. Bureau of Geological and MineralResources Survey of Guizhou Province, Guiyang, Guizhou550004)展开更多
Analysis on Therapeutic Effect of 182 Casesof Chronic Prostatitis Treated by UmbilicalTherapyCheng Kejia 程可佳Journal of Chinese Acupuncture &Moxibustion 1992,12(5):5-6The drug used include SemenVaccariae,Rhizoma...Analysis on Therapeutic Effect of 182 Casesof Chronic Prostatitis Treated by UmbilicalTherapyCheng Kejia 程可佳Journal of Chinese Acupuncture &Moxibustion 1992,12(5):5-6The drug used include SemenVaccariae,Rhizoma Acori Graminei。展开更多
09-02-001 海岸湿地针叶林-阔叶林突变临界维持机制研究=Maintenance of an abrupt boundary between needle-leaved and broad-leaved forests in a wetland near coast[刊,英]/Shiro Tsuyuzaki1。
文摘The rhetorical structure of abstracts has been a widely discussed topic, as it can greatly enhance the abstract writing skills of second-language writers. This study aims to provide guidance on the syntactic features that L2 learners can employ, as well as suggest which features they should focus on in English academic writing. To achieve this, all samples were analyzed for rhetorical moves using Hyland’s five-rhetorical move model. Additionally, all sentences were evaluated for syntactic complexity, considering measures such as global, clausal and phrasal complexity. The findings reveal that expert writers exhibit a more balanced use of syntactic complexity across moves, effectively fulfilling the rhetorical objectives of abstracts. On the other hand, MA students tend to rely excessively on embedded structures and dependent clauses in an attempt to increase complexity. The implications of these findings for academic writing research, pedagogy, and assessment are thoroughly discussed.
基金supported by the project “The demonstration system of rich semantic search application in scientific literature” (Grant No. 1734) from the Chinese Academy of Sciences
文摘Purpose:Mo ve recognition in scientific abstracts is an NLP task of classifying sentences of the abstracts into different types of language units.To improve the performance of move recognition in scientific abstracts,a novel model of move recognition is proposed that outperforms the BERT-based method.Design/methodology/approach:Prevalent models based on BERT for sentence classification often classify sentences without considering the context of the sentences.In this paper,inspired by the BERT masked language model(MLM),we propose a novel model called the masked sentence model that integrates the content and contextual information of the sentences in move recognition.Experiments are conducted on the benchmark dataset PubMed 20K RCT in three steps.Then,we compare our model with HSLN-RNN,BERT-based and SciBERT using the same dataset.Findings:Compared with the BERT-based and SciBERT models,the F1 score of our model outperforms them by 4.96%and 4.34%,respectively,which shows the feasibility and effectiveness of the novel model and the result of our model comes closest to the state-of-theart results of HSLN-RNN at present.Research limitations:The sequential features of move labels are not considered,which might be one of the reasons why HSLN-RNN has better performance.Our model is restricted to dealing with biomedical English literature because we use a dataset from PubMed,which is a typical biomedical database,to fine-tune our model.Practical implications:The proposed model is better and simpler in identifying move structures in scientific abstracts and is worthy of text classification experiments for capturing contextual features of sentences.Originality/value:T he study proposes a masked sentence model based on BERT that considers the contextual features of the sentences in abstracts in a new way.The performance of this classification model is significantly improved by rebuilding the input layer without changing the structure of neural networks.
基金This work is supported by the project“Research on Methods and Technologies of Scientific Researcher Entity Linking and Subject Indexing”(Grant No.G190091)from the National Science Library,Chinese Academy of Sciencesthe project“Design and Research on a Next Generation of Open Knowledge Services System and Key Technologies”(2019XM55).
文摘Purpose:Automatic keyphrase extraction(AKE)is an important task for grasping the main points of the text.In this paper,we aim to combine the benefits of sequence labeling formulation and pretrained language model to propose an automatic keyphrase extraction model for Chinese scientific research.Design/methodology/approach:We regard AKE from Chinese text as a character-level sequence labeling task to avoid segmentation errors of Chinese tokenizer and initialize our model with pretrained language model BERT,which was released by Google in 2018.We collect data from Chinese Science Citation Database and construct a large-scale dataset from medical domain,which contains 100,000 abstracts as training set,6,000 abstracts as development set and 3,094 abstracts as test set.We use unsupervised keyphrase extraction methods including term frequency(TF),TF-IDF,TextRank and supervised machine learning methods including Conditional Random Field(CRF),Bidirectional Long Short Term Memory Network(BiLSTM),and BiLSTM-CRF as baselines.Experiments are designed to compare word-level and character-level sequence labeling approaches on supervised machine learning models and BERT-based models.Findings:Compared with character-level BiLSTM-CRF,the best baseline model with F1 score of 50.16%,our character-level sequence labeling model based on BERT obtains F1 score of 59.80%,getting 9.64%absolute improvement.Research limitations:We just consider automatic keyphrase extraction task rather than keyphrase generation task,so only keyphrases that are occurred in the given text can be extracted.In addition,our proposed dataset is not suitable for dealing with nested keyphrases.Practical implications:We make our character-level IOB format dataset of Chinese Automatic Keyphrase Extraction from scientific Chinese medical abstracts(CAKE)publicly available for the benefits of research community,which is available at:https://github.com/possible1402/Dataset-For-Chinese-Medical-Keyphrase-Extraction.Originality/value:By designing comparative experiments,our study demonstrates that character-level formulation is more suitable for Chinese automatic keyphrase extraction task under the general trend of pretrained language models.And our proposed dataset provides a unified method for model evaluation and can promote the development of Chinese automatic keyphrase extraction to some extent.
文摘The Chinese Optics and Applied Optics Abstracts , sponsored by the Documentation andInformation Center of the Chinese Academy of Sciences, the Optical Information Networkof the Chinese Academy of Sciences and the Changchun Institute of Optics, Fine Mechanicsand Physics of the Chinese Academy of Sciences, is one of the series of science andtechnology indexing periodicals published by the Chinese Academy of Sciences.The Chinese Optics and Applied Optics Abstracts started a quarterly publication in 1985,with the name of Chinese Science and Technology Document Catalogues: Optics andApplied Optics. It changed into a bimonthly publication with the name of Chinese Opticsand Applied Optics Abstracts in 1987. In combination with the Chinese Optics Documen-
文摘Inclusion variations and calcium treatment optimizationin pipeline steel productionLIU Jianhua, WU Huajie, BAO Yanping, and WANG Min Abstract SiCa line and SiCaBaFe alloy were injected into liquid pipe-line steel at the end of LF refining as calcium treatment,
文摘Genre analysis has become one of the most important approaches to text analysis, especially in the field of English for Specific Purposes. Abstract is the essential part of the paper, which helps the readers get initial impressions. Because of its particular the usage of communication, it has its special rules and mode. The purpose of this paper was to present the 5-move model characteristic of research dissertation Abstracts, and explore the linguistic characteristics of each move. The analysis started from the macrostructure, i.e from the text as a whole, towards the microstructure which included linguistic description (syntactic and lexical). The results showed that most abstracts followed 5-move model and the linguistic features of this genre.
文摘Nov. 1—4, 1989, Beijing, China River water chemistry in India-An overview V. Subramamian School of Environmental Sciences, Jawaharlal Nehru University, New Delhi 110067, India. Based on extensive analyses of a very large number of samples, the average river water in India is more alkaline than the world average river water. The dominance of Na and Cl in Indian river shows their monsoon control. There are spatial and seasonal variations. The northern river are less saline than the southern rivers. The sediments covered by the Ganges-
文摘The 7th international conference on unsaturated soils(3rd–5th August 2018)is organised by the Hong Kong University of Science and Technology and supported by:TC106 Unsaturated Soils of ISSMGE;Hong Kong Geotechnical Society;Geotechnical Division of Hong Kong Institution of Engineers;
文摘Research on Conodont Biostratigraphy near the Bottom Boundary of the Middle Triassic Qingyan Stage in Southern Guizhou ProvinceYao Jianxin, Ji Zhansheng, Wang Liting , Wang Yanbin andWu Guichun(1. Institute of Geology, Chinese Academy of GeologicalSciences, Beijing, 100037; 2. Bureau of Geological and MineralResources Survey of Guizhou Province, Guiyang, Guizhou550004)
文摘Analysis on Therapeutic Effect of 182 Casesof Chronic Prostatitis Treated by UmbilicalTherapyCheng Kejia 程可佳Journal of Chinese Acupuncture &Moxibustion 1992,12(5):5-6The drug used include SemenVaccariae,Rhizoma Acori Graminei。
文摘09-02-001 海岸湿地针叶林-阔叶林突变临界维持机制研究=Maintenance of an abrupt boundary between needle-leaved and broad-leaved forests in a wetland near coast[刊,英]/Shiro Tsuyuzaki1。