Purpose:Researchers frequently encounter the following problems when writing scientific articles:(1)Selecting appropriate citations to support the research idea is challenging.(2)The literature review is not conducted...Purpose:Researchers frequently encounter the following problems when writing scientific articles:(1)Selecting appropriate citations to support the research idea is challenging.(2)The literature review is not conducted extensively,which leads to working on a research problem that others have well addressed.The study focuses on citation recommendation in the related studies section by applying the term function of a citation context,potentially improving the efficiency of writing a literature review.Design/methodology/approach:We present nine term functions with three newly created and six identified from existing literature.Using these term functions as labels,we annotate 531 research papers in three topics to evaluate our proposed recommendation strategy.BM25 and Word2vec with VSM are implemented as the baseline models for the recommendation.Then the term function information is applied to enhance the performance.Findings:The experiments show that the term function-based methods outperform the baseline methods regarding the recall,precision,and F1-score measurement,demonstrating that term functions are useful in identifying valuable citations.Research limitations:The dataset is insufficient due to the complexity of annotating citation functions for paragraphs in the related studies section.More recent deep learning models should be performed to future validate the proposed approach.Practical implications:The citation recommendation strategy can be helpful for valuable citation discovery,semantic scientific retrieval,and automatic literature review generation.Originality/value:The proposed citation function-based citation recommendation can generate intuitive explanations of the results for users,improving the transparency,persuasiveness,and effectiveness of recommender systems.展开更多
Knowledge Bases (KBs) are valuable resources of human knowledge which contribute to many applications. However, since they are manually maintained, there is a big lag between their contents and the upto-date informa...Knowledge Bases (KBs) are valuable resources of human knowledge which contribute to many applications. However, since they are manually maintained, there is a big lag between their contents and the upto-date information of entities. Considering a target entity in KBs, this paper investigates how Cumulative Citation Recommendation (CCR) can be used to effectively detect its worthy-citation documents in large volumes of stream data. Most global relevant models only consider semantic and temporat features of entity-document instances, which does not sufficiently exploit prior knowledge underlying entity-document instances. To tackle this problem, we present a Mixture of Experts (ME) model by introducing a latent layer to capture relationships between the entity-document instances and their latent class information. An extensive set of experiments was conducted on TREC-KBA-2013 dataset. The results show that the model can significantly achieve a better performance gain compared to state-of-the-art models in CCR.展开更多
The rapid growth of scientific papers makes it difficult to query related papers efficiently,accurately and with high coverage.Traditional citation recommendation algorithms rely heavily on the metadata of query docum...The rapid growth of scientific papers makes it difficult to query related papers efficiently,accurately and with high coverage.Traditional citation recommendation algorithms rely heavily on the metadata of query documents,which leads to the low quality of recommendation results.In this paper,DeepCite,a content-based hybrid neural network citation recommendation method is proposed.First,the BERT model was used to extract the high-level semantic representation vectors in the text,then the multi-scale CNN model and BiLSTM model were used to obtain the local information and the sequence information of the context in the sentence,and the text vectors were matched in depth to generate candidate sets.Further,the depth neural network was used to rerank the candidate sets by combining the score of candidate sets and multisource features.In the reranking stage,a variety of Metapath features were extracted from the citation network,and added to the deep neural network to learn,and the ranking of recommendation results were optimized.Compared with PWFC,ClusCite,BM25,RW,NNRank models,the results of the Deepcite algorithm presented in the ANN datasets show that the precision(P@20),recall rate(R@20),MRR and MAP indexesrise by 2.3%,3.9%,2.4%and 2.1%respectively.Experimental results on DBLP datasets show that the improvement is 2.4%,4.3%,1.8%and 1.2%respectively.Therefore,the algorithm proposed in this paper effectively improves the quality of citation recommendation.展开更多
基金This work is supported by the National Natural Science Foundation of China(Grant No.7167030644 and 71704137)。
文摘Purpose:Researchers frequently encounter the following problems when writing scientific articles:(1)Selecting appropriate citations to support the research idea is challenging.(2)The literature review is not conducted extensively,which leads to working on a research problem that others have well addressed.The study focuses on citation recommendation in the related studies section by applying the term function of a citation context,potentially improving the efficiency of writing a literature review.Design/methodology/approach:We present nine term functions with three newly created and six identified from existing literature.Using these term functions as labels,we annotate 531 research papers in three topics to evaluate our proposed recommendation strategy.BM25 and Word2vec with VSM are implemented as the baseline models for the recommendation.Then the term function information is applied to enhance the performance.Findings:The experiments show that the term function-based methods outperform the baseline methods regarding the recall,precision,and F1-score measurement,demonstrating that term functions are useful in identifying valuable citations.Research limitations:The dataset is insufficient due to the complexity of annotating citation functions for paragraphs in the related studies section.More recent deep learning models should be performed to future validate the proposed approach.Practical implications:The citation recommendation strategy can be helpful for valuable citation discovery,semantic scientific retrieval,and automatic literature review generation.Originality/value:The proposed citation function-based citation recommendation can generate intuitive explanations of the results for users,improving the transparency,persuasiveness,and effectiveness of recommender systems.
基金supported by the National Key Research and Development Program of China(No.2016YFB1000902)the National Natural Science Foundation of China(Nos.61472040,61751217,and 61866038)+1 种基金Natural Science Basic Research Plan in Shaanxi Province of China(No.2016JM6082)PhD start project of Yan’an University(No.YDBK2018-09)
文摘Knowledge Bases (KBs) are valuable resources of human knowledge which contribute to many applications. However, since they are manually maintained, there is a big lag between their contents and the upto-date information of entities. Considering a target entity in KBs, this paper investigates how Cumulative Citation Recommendation (CCR) can be used to effectively detect its worthy-citation documents in large volumes of stream data. Most global relevant models only consider semantic and temporat features of entity-document instances, which does not sufficiently exploit prior knowledge underlying entity-document instances. To tackle this problem, we present a Mixture of Experts (ME) model by introducing a latent layer to capture relationships between the entity-document instances and their latent class information. An extensive set of experiments was conducted on TREC-KBA-2013 dataset. The results show that the model can significantly achieve a better performance gain compared to state-of-the-art models in CCR.
基金“Shenzhen Science and Technology Project”(JCYJ20180306170836595)“National key research and development program in China”(2019YFB2102300)+4 种基金“the World-Class Universities(Disciplines)and the Characteristic Development Guidance Funds for the Central Universities of China”(PY3A022)“Ministry of Education Fund Projects”(No.18JZD022 and 2017B00030)“Basic Scientific Research Operating Expenses of Central Universities”(No.ZDYF2017006)“Xi’an Navinfo Corp.&Engineering Center of Xi’an Intelligence Spatial-temporal Data Analysis Project”(C2020103)“Beilin District of Xi’an Science&Technology Project”(GX1803).
文摘The rapid growth of scientific papers makes it difficult to query related papers efficiently,accurately and with high coverage.Traditional citation recommendation algorithms rely heavily on the metadata of query documents,which leads to the low quality of recommendation results.In this paper,DeepCite,a content-based hybrid neural network citation recommendation method is proposed.First,the BERT model was used to extract the high-level semantic representation vectors in the text,then the multi-scale CNN model and BiLSTM model were used to obtain the local information and the sequence information of the context in the sentence,and the text vectors were matched in depth to generate candidate sets.Further,the depth neural network was used to rerank the candidate sets by combining the score of candidate sets and multisource features.In the reranking stage,a variety of Metapath features were extracted from the citation network,and added to the deep neural network to learn,and the ranking of recommendation results were optimized.Compared with PWFC,ClusCite,BM25,RW,NNRank models,the results of the Deepcite algorithm presented in the ANN datasets show that the precision(P@20),recall rate(R@20),MRR and MAP indexesrise by 2.3%,3.9%,2.4%and 2.1%respectively.Experimental results on DBLP datasets show that the improvement is 2.4%,4.3%,1.8%and 1.2%respectively.Therefore,the algorithm proposed in this paper effectively improves the quality of citation recommendation.