Researchers around the world strive to communicate new knowledge,primarily via publication,with the abstract being crucial in conveying core insights.Previous research has generally analyzed the discourse features of ...Researchers around the world strive to communicate new knowledge,primarily via publication,with the abstract being crucial in conveying core insights.Previous research has generally analyzed the discourse features of abstracts from a macro perspective and often employed either outdated texts,such as those over a decade old,or papers written by authors with lower English academic writing proficiency as research material.In this study,we analyzed forty abstracts from leading journals in applied linguistics,evenly split between Chinese and international journals.It revealed that the use of nominalization in abstracts by Chinese and international scholars showed similarities due to the universal academic requirement for conciseness.However,due to cultural and educational differences,each group differed in their respective language choices and nominalization usage.By analyzing the application of nominalization in different cultural contexts,the results of our study offered practical suggestions for crafting abstracts that effectively convey information,thereby,contributing to the broader academic community.展开更多
Embodied cognition theories propose that language comprehension triggers a sensorimotor system in the brain.However,most previous research has paid much attention to concrete and factual sentences,and little emphasis ...Embodied cognition theories propose that language comprehension triggers a sensorimotor system in the brain.However,most previous research has paid much attention to concrete and factual sentences,and little emphasis has been put on the research of abstract and counterfactual sentences.The primary challenges for embodied theories lie in elucidating the meanings of abstract and counterfactual sentences.The most prevalent explanation is that abstract and counterfactual sentences are grounded in the activation of a sensorimotor system,in exactly the same way as concrete and factual ones.The present research employed a dual-task experimental paradigm to investigate whether the embodied meaning is activated in comprehending action-related abstract Chinese counterfactual sentences through the presence or absence of action-sentence compatibility effect(ACE).Participants were instructed to read and listen to the action-related abstract Chinese factual or counterfactual sentences describing an abstract transfer word towards or away from them,and then move their fingers towards or away from them to press the buttons in the same direction as the motion cue of the transfer verb.The action-sentence compatibility effect was observed in both abstract factual and counterfactual sentences,in line with the embodied cognition theories,which indicated that the embodied meanings were activated in both action-related abstract factuals and counterfactuals.展开更多
The rhetorical structure of abstracts has been a widely discussed topic, as it can greatly enhance the abstract writing skills of second-language writers. This study aims to provide guidance on the syntactic features ...The rhetorical structure of abstracts has been a widely discussed topic, as it can greatly enhance the abstract writing skills of second-language writers. This study aims to provide guidance on the syntactic features that L2 learners can employ, as well as suggest which features they should focus on in English academic writing. To achieve this, all samples were analyzed for rhetorical moves using Hyland’s five-rhetorical move model. Additionally, all sentences were evaluated for syntactic complexity, considering measures such as global, clausal and phrasal complexity. The findings reveal that expert writers exhibit a more balanced use of syntactic complexity across moves, effectively fulfilling the rhetorical objectives of abstracts. On the other hand, MA students tend to rely excessively on embedded structures and dependent clauses in an attempt to increase complexity. The implications of these findings for academic writing research, pedagogy, and assessment are thoroughly discussed.展开更多
The existing abstractive text summarisation models only consider the word sequence correlations between the source document and the reference summary,and the summary generated by models lacks the cover of the subject ...The existing abstractive text summarisation models only consider the word sequence correlations between the source document and the reference summary,and the summary generated by models lacks the cover of the subject of source document due to models'small perspective.In order to make up these disadvantages,a multi‐domain attention pointer(MDA‐Pointer)abstractive summarisation model is proposed in this work.First,the model uses bidirectional long short‐term memory to encode,respectively,the word and sentence sequence of source document for obtaining the semantic representations at word and sentence level.Furthermore,the multi‐domain attention mechanism between the semantic representations and the summary word is established,and the proposed model can generate summary words under the proposed attention mechanism based on the words and sen-tences.Then,the words are extracted from the vocabulary or the original word sequences through the pointer network to form the summary,and the coverage mechanism is introduced,respectively,into word and sentence level to reduce the redundancy of sum-mary content.Finally,experiment validation is conducted on CNN/Daily Mail dataset.ROUGE evaluation indexes of the model without and with the coverage mechanism are improved respectively,and the results verify the validation of model proposed by this paper.展开更多
A large variety of complaint reports reflect subjective information expressed by citizens.A key challenge of text summarization for complaint reports is to ensure the factual consistency of generated summary.Therefore...A large variety of complaint reports reflect subjective information expressed by citizens.A key challenge of text summarization for complaint reports is to ensure the factual consistency of generated summary.Therefore,in this paper,a simple and weakly supervised framework considering factual consistency is proposed to generate a summary of city-based complaint reports without pre-labeled sentences/words.Furthermore,it considers the importance of entity in complaint reports to ensure factual consistency of summary.Experimental results on the customer review datasets(Yelp and Amazon)and complaint report dataset(complaint reports of Shenyang in China)show that the proposed framework outperforms state-of-the-art approaches in ROUGE scores and human evaluation.It unveils the effectiveness of our approach to helping in dealing with complaint reports.展开更多
This article examines the complex interplay between abstraction and representation in the ontology of images.Images inhabit an in-between space as tangible artifacts that also convey intangible ideas and meanings.The ...This article examines the complex interplay between abstraction and representation in the ontology of images.Images inhabit an in-between space as tangible artifacts that also convey intangible ideas and meanings.The analysis synthesizes perspectives from across the history of philosophy to elucidate how images bridge abstraction and representation through their form and function.It engages with ongoing epistemological and aesthetic debates concerning the dual nature of images.Plato’s theory of ideal forms is outlined as an early attempt to define abstraction.Modern semiotic theories are discussed for their insights into how images create meaning through codes and signs.Phenomenology offers an alternative approach by prioritizing the sensorial,affective impact of images.Poststructuralism problematizes representation in the context of mechanical reproduction and simulacra.While diverse,these philosophical frameworks all grapple with the issues images pose between abstract essence and concrete appearance,conceptual ideas and sensory manifestations.The article reveals the richness of images as liminal constructs that collapse dualisms in their creative interfacing of material forms and immaterial meanings.It concludes that this ontological ambiguity empowers images as mediators between imagination and perception,subjectivity and reality.展开更多
Text summarization aims to generate a concise version of the original text.The longer the summary text is,themore detailed it will be fromthe original text,and this depends on the intended use.Therefore,the problem of...Text summarization aims to generate a concise version of the original text.The longer the summary text is,themore detailed it will be fromthe original text,and this depends on the intended use.Therefore,the problem of generating summary texts with desired lengths is a vital task to put the research into practice.To solve this problem,in this paper,we propose a new method to integrate the desired length of the summarized text into the encoder-decoder model for the abstractive text summarization problem.This length parameter is integrated into the encoding phase at each self-attention step and the decoding process by preserving the remaining length for calculating headattention in the generation process and using it as length embeddings added to theword embeddings.We conducted experiments for the proposed model on the two data sets,Cable News Network(CNN)Daily and NEWSROOM,with different desired output lengths.The obtained results show the proposed model’s effectiveness compared with related studies.展开更多
Abstract is the epitome of the core idea of a journal paper.Excellent English abstract plays an important role in ensuring the quality of the paper and promoting its academic value in international exchanges.However,t...Abstract is the epitome of the core idea of a journal paper.Excellent English abstract plays an important role in ensuring the quality of the paper and promoting its academic value in international exchanges.However,there are still many problems in the English abstracts of many papers published in academic journals.This paper analyzes and summarizes the grammatical errors of articles,singular and plural nouns,predicate verbs,conjunctions,Chinglish from other English abstracts of some papers in a vehicle engineering academic journal retrieved from CNCN.cn,and then corrects them.It is expected to provide some guidance for editors,academic workers,and engineering students in writing papers.展开更多
Purpose:Mo ve recognition in scientific abstracts is an NLP task of classifying sentences of the abstracts into different types of language units.To improve the performance of move recognition in scientific abstracts,...Purpose:Mo ve recognition in scientific abstracts is an NLP task of classifying sentences of the abstracts into different types of language units.To improve the performance of move recognition in scientific abstracts,a novel model of move recognition is proposed that outperforms the BERT-based method.Design/methodology/approach:Prevalent models based on BERT for sentence classification often classify sentences without considering the context of the sentences.In this paper,inspired by the BERT masked language model(MLM),we propose a novel model called the masked sentence model that integrates the content and contextual information of the sentences in move recognition.Experiments are conducted on the benchmark dataset PubMed 20K RCT in three steps.Then,we compare our model with HSLN-RNN,BERT-based and SciBERT using the same dataset.Findings:Compared with the BERT-based and SciBERT models,the F1 score of our model outperforms them by 4.96%and 4.34%,respectively,which shows the feasibility and effectiveness of the novel model and the result of our model comes closest to the state-of-theart results of HSLN-RNN at present.Research limitations:The sequential features of move labels are not considered,which might be one of the reasons why HSLN-RNN has better performance.Our model is restricted to dealing with biomedical English literature because we use a dataset from PubMed,which is a typical biomedical database,to fine-tune our model.Practical implications:The proposed model is better and simpler in identifying move structures in scientific abstracts and is worthy of text classification experiments for capturing contextual features of sentences.Originality/value:T he study proposes a masked sentence model based on BERT that considers the contextual features of the sentences in abstracts in a new way.The performance of this classification model is significantly improved by rebuilding the input layer without changing the structure of neural networks.展开更多
Purpose:Automatic keyphrase extraction(AKE)is an important task for grasping the main points of the text.In this paper,we aim to combine the benefits of sequence labeling formulation and pretrained language model to p...Purpose:Automatic keyphrase extraction(AKE)is an important task for grasping the main points of the text.In this paper,we aim to combine the benefits of sequence labeling formulation and pretrained language model to propose an automatic keyphrase extraction model for Chinese scientific research.Design/methodology/approach:We regard AKE from Chinese text as a character-level sequence labeling task to avoid segmentation errors of Chinese tokenizer and initialize our model with pretrained language model BERT,which was released by Google in 2018.We collect data from Chinese Science Citation Database and construct a large-scale dataset from medical domain,which contains 100,000 abstracts as training set,6,000 abstracts as development set and 3,094 abstracts as test set.We use unsupervised keyphrase extraction methods including term frequency(TF),TF-IDF,TextRank and supervised machine learning methods including Conditional Random Field(CRF),Bidirectional Long Short Term Memory Network(BiLSTM),and BiLSTM-CRF as baselines.Experiments are designed to compare word-level and character-level sequence labeling approaches on supervised machine learning models and BERT-based models.Findings:Compared with character-level BiLSTM-CRF,the best baseline model with F1 score of 50.16%,our character-level sequence labeling model based on BERT obtains F1 score of 59.80%,getting 9.64%absolute improvement.Research limitations:We just consider automatic keyphrase extraction task rather than keyphrase generation task,so only keyphrases that are occurred in the given text can be extracted.In addition,our proposed dataset is not suitable for dealing with nested keyphrases.Practical implications:We make our character-level IOB format dataset of Chinese Automatic Keyphrase Extraction from scientific Chinese medical abstracts(CAKE)publicly available for the benefits of research community,which is available at:https://github.com/possible1402/Dataset-For-Chinese-Medical-Keyphrase-Extraction.Originality/value:By designing comparative experiments,our study demonstrates that character-level formulation is more suitable for Chinese automatic keyphrase extraction task under the general trend of pretrained language models.And our proposed dataset provides a unified method for model evaluation and can promote the development of Chinese automatic keyphrase extraction to some extent.展开更多
The Chinese Optics and Applied Optics Abstracts , sponsored by the Documentation andInformation Center of the Chinese Academy of Sciences, the Optical Information Networkof the Chinese Academy of Sciences and the Chan...The Chinese Optics and Applied Optics Abstracts , sponsored by the Documentation andInformation Center of the Chinese Academy of Sciences, the Optical Information Networkof the Chinese Academy of Sciences and the Changchun Institute of Optics, Fine Mechanicsand Physics of the Chinese Academy of Sciences, is one of the series of science andtechnology indexing periodicals published by the Chinese Academy of Sciences.The Chinese Optics and Applied Optics Abstracts started a quarterly publication in 1985,with the name of Chinese Science and Technology Document Catalogues: Optics andApplied Optics. It changed into a bimonthly publication with the name of Chinese Opticsand Applied Optics Abstracts in 1987. In combination with the Chinese Optics Documen-展开更多
The excited-state intramolecular hydrogen abstraction reactions of butanal have been investigated using the CAS-MP2/6-311+G^*//CASSCF/6-31G^* methods. Calculated results show that the hydrogen transfer induced fluo...The excited-state intramolecular hydrogen abstraction reactions of butanal have been investigated using the CAS-MP2/6-311+G^*//CASSCF/6-31G^* methods. Calculated results show that the hydrogen transfer induced fluorescence quenching of the n,π^*-excited state of covalent butanal with three paths: (1) The first path corresponds to direct S0-react reconstitution, which involves the first S1 decay by partial hydrogen atom transfer. (2) The second stepwise mechanism can be viewed as a full hydrogen atom transfer followed by a partial hydrogen atom back transfer, electron transfer (near S1/S0 or S0-TS) and finally a proton transfer to S0-react. (3) On the triplet surface, the surface crossing to the singlet state would be clearly much efficient at the T1/S0 region due to the large SOC value of 8.3 cm^-1. The S0-react decay route from T1/S0 was studied with an intrinsic reaction coordinate (IRC) calculation at the CASSCF level, resulting in the S0-React minimum.展开更多
Prognosis of residual coal gas capacity made by the 'Express' method Pavel Prokop, Pavel Zapletal, and Ivo Pegrimek Abstract An easy, reliable, and inexpensive method, called 'Express' method, was described to det...Prognosis of residual coal gas capacity made by the 'Express' method Pavel Prokop, Pavel Zapletal, and Ivo Pegrimek Abstract An easy, reliable, and inexpensive method, called 'Express' method, was described to determine the residual gas capacity of deep mines using results from an air and gas balance. Air and gas balances are common elements of mine management and must be performed periodically. Using the process described here to obtain balance results,展开更多
Kinetics of the electrochemical process of galena electrodes in the diethyldithiocarbamate solutionCHENG Lift, SUN Tichang, LUO Xianping, and WANG DianzuoAbstract The electrochemical process of galena in a pH 12.8 buf...Kinetics of the electrochemical process of galena electrodes in the diethyldithiocarbamate solutionCHENG Lift, SUN Tichang, LUO Xianping, and WANG DianzuoAbstract The electrochemical process of galena in a pH 12.8 buffer solution was investigated using chronoamperometry and chronopotentiometry.展开更多
Sentence similarity computing plays an important role in machine question-answering systems, machine-translation systems, information retrieval and automatic abstracting systems. This article firstly sums up several m...Sentence similarity computing plays an important role in machine question-answering systems, machine-translation systems, information retrieval and automatic abstracting systems. This article firstly sums up several methods for calculating similarity between sentences, and brings out a new method which takes all factors into consideration including critical words, semantic information, sentential form and sen-tence length. And on this basis, a automatic abstracting system based on LexRank algorithm is implemented. We made several improvements in both sentence weight computing and redundancy resolution. The system described in this article could deal with single or multi-document summarization both in English and Chinese. With evaluations on two corpuses, our system could produce better summaries to a certain degree. We also show that our system is quite insensitive to the noise in the data that may result from an imperfect topical clustering of documents. And in the end, existing problem and the developing trend of automatic summariza-tion technology are discussed.展开更多
文摘Researchers around the world strive to communicate new knowledge,primarily via publication,with the abstract being crucial in conveying core insights.Previous research has generally analyzed the discourse features of abstracts from a macro perspective and often employed either outdated texts,such as those over a decade old,or papers written by authors with lower English academic writing proficiency as research material.In this study,we analyzed forty abstracts from leading journals in applied linguistics,evenly split between Chinese and international journals.It revealed that the use of nominalization in abstracts by Chinese and international scholars showed similarities due to the universal academic requirement for conciseness.However,due to cultural and educational differences,each group differed in their respective language choices and nominalization usage.By analyzing the application of nominalization in different cultural contexts,the results of our study offered practical suggestions for crafting abstracts that effectively convey information,thereby,contributing to the broader academic community.
文摘Embodied cognition theories propose that language comprehension triggers a sensorimotor system in the brain.However,most previous research has paid much attention to concrete and factual sentences,and little emphasis has been put on the research of abstract and counterfactual sentences.The primary challenges for embodied theories lie in elucidating the meanings of abstract and counterfactual sentences.The most prevalent explanation is that abstract and counterfactual sentences are grounded in the activation of a sensorimotor system,in exactly the same way as concrete and factual ones.The present research employed a dual-task experimental paradigm to investigate whether the embodied meaning is activated in comprehending action-related abstract Chinese counterfactual sentences through the presence or absence of action-sentence compatibility effect(ACE).Participants were instructed to read and listen to the action-related abstract Chinese factual or counterfactual sentences describing an abstract transfer word towards or away from them,and then move their fingers towards or away from them to press the buttons in the same direction as the motion cue of the transfer verb.The action-sentence compatibility effect was observed in both abstract factual and counterfactual sentences,in line with the embodied cognition theories,which indicated that the embodied meanings were activated in both action-related abstract factuals and counterfactuals.
文摘The rhetorical structure of abstracts has been a widely discussed topic, as it can greatly enhance the abstract writing skills of second-language writers. This study aims to provide guidance on the syntactic features that L2 learners can employ, as well as suggest which features they should focus on in English academic writing. To achieve this, all samples were analyzed for rhetorical moves using Hyland’s five-rhetorical move model. Additionally, all sentences were evaluated for syntactic complexity, considering measures such as global, clausal and phrasal complexity. The findings reveal that expert writers exhibit a more balanced use of syntactic complexity across moves, effectively fulfilling the rhetorical objectives of abstracts. On the other hand, MA students tend to rely excessively on embedded structures and dependent clauses in an attempt to increase complexity. The implications of these findings for academic writing research, pedagogy, and assessment are thoroughly discussed.
基金supported by the National Social Science Foundation of China(2017CG29)the Science and Technology Research Project of Chongqing Municipal Education Commission(2019CJ50)the Natural Science Foundation of Chongqing(2017CC29).
文摘The existing abstractive text summarisation models only consider the word sequence correlations between the source document and the reference summary,and the summary generated by models lacks the cover of the subject of source document due to models'small perspective.In order to make up these disadvantages,a multi‐domain attention pointer(MDA‐Pointer)abstractive summarisation model is proposed in this work.First,the model uses bidirectional long short‐term memory to encode,respectively,the word and sentence sequence of source document for obtaining the semantic representations at word and sentence level.Furthermore,the multi‐domain attention mechanism between the semantic representations and the summary word is established,and the proposed model can generate summary words under the proposed attention mechanism based on the words and sen-tences.Then,the words are extracted from the vocabulary or the original word sequences through the pointer network to form the summary,and the coverage mechanism is introduced,respectively,into word and sentence level to reduce the redundancy of sum-mary content.Finally,experiment validation is conducted on CNN/Daily Mail dataset.ROUGE evaluation indexes of the model without and with the coverage mechanism are improved respectively,and the results verify the validation of model proposed by this paper.
基金supported by National Natural Science Foundation of China(62276058,61902057,41774063)Fundamental Research Funds for the Central Universities(N2217003)Joint Fund of Science&Technology Department of Liaoning Province and State Key Laboratory of Robotics,China(2020-KF-12-11).
文摘A large variety of complaint reports reflect subjective information expressed by citizens.A key challenge of text summarization for complaint reports is to ensure the factual consistency of generated summary.Therefore,in this paper,a simple and weakly supervised framework considering factual consistency is proposed to generate a summary of city-based complaint reports without pre-labeled sentences/words.Furthermore,it considers the importance of entity in complaint reports to ensure factual consistency of summary.Experimental results on the customer review datasets(Yelp and Amazon)and complaint report dataset(complaint reports of Shenyang in China)show that the proposed framework outperforms state-of-the-art approaches in ROUGE scores and human evaluation.It unveils the effectiveness of our approach to helping in dealing with complaint reports.
文摘This article examines the complex interplay between abstraction and representation in the ontology of images.Images inhabit an in-between space as tangible artifacts that also convey intangible ideas and meanings.The analysis synthesizes perspectives from across the history of philosophy to elucidate how images bridge abstraction and representation through their form and function.It engages with ongoing epistemological and aesthetic debates concerning the dual nature of images.Plato’s theory of ideal forms is outlined as an early attempt to define abstraction.Modern semiotic theories are discussed for their insights into how images create meaning through codes and signs.Phenomenology offers an alternative approach by prioritizing the sensorial,affective impact of images.Poststructuralism problematizes representation in the context of mechanical reproduction and simulacra.While diverse,these philosophical frameworks all grapple with the issues images pose between abstract essence and concrete appearance,conceptual ideas and sensory manifestations.The article reveals the richness of images as liminal constructs that collapse dualisms in their creative interfacing of material forms and immaterial meanings.It concludes that this ontological ambiguity empowers images as mediators between imagination and perception,subjectivity and reality.
基金funded by Vietnam National Foundation for Science and Technology Development(NAFOSTED)under Grant Number 102.05-2020.26。
文摘Text summarization aims to generate a concise version of the original text.The longer the summary text is,themore detailed it will be fromthe original text,and this depends on the intended use.Therefore,the problem of generating summary texts with desired lengths is a vital task to put the research into practice.To solve this problem,in this paper,we propose a new method to integrate the desired length of the summarized text into the encoder-decoder model for the abstractive text summarization problem.This length parameter is integrated into the encoding phase at each self-attention step and the decoding process by preserving the remaining length for calculating headattention in the generation process and using it as length embeddings added to theword embeddings.We conducted experiments for the proposed model on the two data sets,Cable News Network(CNN)Daily and NEWSROOM,with different desired output lengths.The obtained results show the proposed model’s effectiveness compared with related studies.
基金Educational Innovation Research Program of CSU(No.2022jy032)。
文摘Abstract is the epitome of the core idea of a journal paper.Excellent English abstract plays an important role in ensuring the quality of the paper and promoting its academic value in international exchanges.However,there are still many problems in the English abstracts of many papers published in academic journals.This paper analyzes and summarizes the grammatical errors of articles,singular and plural nouns,predicate verbs,conjunctions,Chinglish from other English abstracts of some papers in a vehicle engineering academic journal retrieved from CNCN.cn,and then corrects them.It is expected to provide some guidance for editors,academic workers,and engineering students in writing papers.
基金supported by the project “The demonstration system of rich semantic search application in scientific literature” (Grant No. 1734) from the Chinese Academy of Sciences
文摘Purpose:Mo ve recognition in scientific abstracts is an NLP task of classifying sentences of the abstracts into different types of language units.To improve the performance of move recognition in scientific abstracts,a novel model of move recognition is proposed that outperforms the BERT-based method.Design/methodology/approach:Prevalent models based on BERT for sentence classification often classify sentences without considering the context of the sentences.In this paper,inspired by the BERT masked language model(MLM),we propose a novel model called the masked sentence model that integrates the content and contextual information of the sentences in move recognition.Experiments are conducted on the benchmark dataset PubMed 20K RCT in three steps.Then,we compare our model with HSLN-RNN,BERT-based and SciBERT using the same dataset.Findings:Compared with the BERT-based and SciBERT models,the F1 score of our model outperforms them by 4.96%and 4.34%,respectively,which shows the feasibility and effectiveness of the novel model and the result of our model comes closest to the state-of-theart results of HSLN-RNN at present.Research limitations:The sequential features of move labels are not considered,which might be one of the reasons why HSLN-RNN has better performance.Our model is restricted to dealing with biomedical English literature because we use a dataset from PubMed,which is a typical biomedical database,to fine-tune our model.Practical implications:The proposed model is better and simpler in identifying move structures in scientific abstracts and is worthy of text classification experiments for capturing contextual features of sentences.Originality/value:T he study proposes a masked sentence model based on BERT that considers the contextual features of the sentences in abstracts in a new way.The performance of this classification model is significantly improved by rebuilding the input layer without changing the structure of neural networks.
基金This work is supported by the project“Research on Methods and Technologies of Scientific Researcher Entity Linking and Subject Indexing”(Grant No.G190091)from the National Science Library,Chinese Academy of Sciencesthe project“Design and Research on a Next Generation of Open Knowledge Services System and Key Technologies”(2019XM55).
文摘Purpose:Automatic keyphrase extraction(AKE)is an important task for grasping the main points of the text.In this paper,we aim to combine the benefits of sequence labeling formulation and pretrained language model to propose an automatic keyphrase extraction model for Chinese scientific research.Design/methodology/approach:We regard AKE from Chinese text as a character-level sequence labeling task to avoid segmentation errors of Chinese tokenizer and initialize our model with pretrained language model BERT,which was released by Google in 2018.We collect data from Chinese Science Citation Database and construct a large-scale dataset from medical domain,which contains 100,000 abstracts as training set,6,000 abstracts as development set and 3,094 abstracts as test set.We use unsupervised keyphrase extraction methods including term frequency(TF),TF-IDF,TextRank and supervised machine learning methods including Conditional Random Field(CRF),Bidirectional Long Short Term Memory Network(BiLSTM),and BiLSTM-CRF as baselines.Experiments are designed to compare word-level and character-level sequence labeling approaches on supervised machine learning models and BERT-based models.Findings:Compared with character-level BiLSTM-CRF,the best baseline model with F1 score of 50.16%,our character-level sequence labeling model based on BERT obtains F1 score of 59.80%,getting 9.64%absolute improvement.Research limitations:We just consider automatic keyphrase extraction task rather than keyphrase generation task,so only keyphrases that are occurred in the given text can be extracted.In addition,our proposed dataset is not suitable for dealing with nested keyphrases.Practical implications:We make our character-level IOB format dataset of Chinese Automatic Keyphrase Extraction from scientific Chinese medical abstracts(CAKE)publicly available for the benefits of research community,which is available at:https://github.com/possible1402/Dataset-For-Chinese-Medical-Keyphrase-Extraction.Originality/value:By designing comparative experiments,our study demonstrates that character-level formulation is more suitable for Chinese automatic keyphrase extraction task under the general trend of pretrained language models.And our proposed dataset provides a unified method for model evaluation and can promote the development of Chinese automatic keyphrase extraction to some extent.
文摘The Chinese Optics and Applied Optics Abstracts , sponsored by the Documentation andInformation Center of the Chinese Academy of Sciences, the Optical Information Networkof the Chinese Academy of Sciences and the Changchun Institute of Optics, Fine Mechanicsand Physics of the Chinese Academy of Sciences, is one of the series of science andtechnology indexing periodicals published by the Chinese Academy of Sciences.The Chinese Optics and Applied Optics Abstracts started a quarterly publication in 1985,with the name of Chinese Science and Technology Document Catalogues: Optics andApplied Optics. It changed into a bimonthly publication with the name of Chinese Opticsand Applied Optics Abstracts in 1987. In combination with the Chinese Optics Documen-
基金supported by ‘Qinglan’ Talent Engineering Funds and Key Subject of Inorganic Chemistry by Tianshui Normal University
文摘The excited-state intramolecular hydrogen abstraction reactions of butanal have been investigated using the CAS-MP2/6-311+G^*//CASSCF/6-31G^* methods. Calculated results show that the hydrogen transfer induced fluorescence quenching of the n,π^*-excited state of covalent butanal with three paths: (1) The first path corresponds to direct S0-react reconstitution, which involves the first S1 decay by partial hydrogen atom transfer. (2) The second stepwise mechanism can be viewed as a full hydrogen atom transfer followed by a partial hydrogen atom back transfer, electron transfer (near S1/S0 or S0-TS) and finally a proton transfer to S0-react. (3) On the triplet surface, the surface crossing to the singlet state would be clearly much efficient at the T1/S0 region due to the large SOC value of 8.3 cm^-1. The S0-react decay route from T1/S0 was studied with an intrinsic reaction coordinate (IRC) calculation at the CASSCF level, resulting in the S0-React minimum.
文摘Prognosis of residual coal gas capacity made by the 'Express' method Pavel Prokop, Pavel Zapletal, and Ivo Pegrimek Abstract An easy, reliable, and inexpensive method, called 'Express' method, was described to determine the residual gas capacity of deep mines using results from an air and gas balance. Air and gas balances are common elements of mine management and must be performed periodically. Using the process described here to obtain balance results,
文摘Kinetics of the electrochemical process of galena electrodes in the diethyldithiocarbamate solutionCHENG Lift, SUN Tichang, LUO Xianping, and WANG DianzuoAbstract The electrochemical process of galena in a pH 12.8 buffer solution was investigated using chronoamperometry and chronopotentiometry.
文摘Sentence similarity computing plays an important role in machine question-answering systems, machine-translation systems, information retrieval and automatic abstracting systems. This article firstly sums up several methods for calculating similarity between sentences, and brings out a new method which takes all factors into consideration including critical words, semantic information, sentential form and sen-tence length. And on this basis, a automatic abstracting system based on LexRank algorithm is implemented. We made several improvements in both sentence weight computing and redundancy resolution. The system described in this article could deal with single or multi-document summarization both in English and Chinese. With evaluations on two corpuses, our system could produce better summaries to a certain degree. We also show that our system is quite insensitive to the noise in the data that may result from an imperfect topical clustering of documents. And in the end, existing problem and the developing trend of automatic summariza-tion technology are discussed.