Purpose:This study attempts to disclose the characteristics of knowledge integration in an interdisciplinary field by looking into the content aspect of knowledge.Design/methodology/approach:The eHealth field was chos...Purpose:This study attempts to disclose the characteristics of knowledge integration in an interdisciplinary field by looking into the content aspect of knowledge.Design/methodology/approach:The eHealth field was chosen in the case study.Associated knowledge phrases(AKPs)that are shared between citing papers and their references were extracted from the citation contexts of the eHealth papers by applying a stem-matching method.A classification schema that considers the functions of knowledge in the domain was proposed to categorize the identified AKPs.The source disciplines of each knowledge type were analyzed.Quantitative indicators and a co-occurrence analysis were applied to disclose the integration patterns of different knowledge types.Findings:The annotated AKPs evidence the major disciplines supplying each type of knowledge.Different knowledge types have remarkably different integration patterns in terms of knowledge amount,the breadth of source disciplines,and the integration time lag.We also find several frequent co-occurrence patterns of different knowledge types.Research limitations:The collected articles of the field are limited to the two leading open access journals.The stem-matching method to extract AKPs could not identify those phrases with the same meaning but expressed in words with different stems.The type of Research Subject dominates the recognized AKPs,which calls on an improvement of the classification schema for better knowledge integration analysis on knowledge units.Practical implications:The methodology proposed in this paper sheds new light on knowledge integration characteristics of an interdisciplinary field from the content perspective.The findings have practical implications on the future development of research strategies in eHealth and the policies about interdisciplinary research.Originality/value:This study proposed a new methodology to explore the content characteristics of knowledge integration in an interdisciplinary field.展开更多
Purpose:Researchers frequently encounter the following problems when writing scientific articles:(1)Selecting appropriate citations to support the research idea is challenging.(2)The literature review is not conducted...Purpose:Researchers frequently encounter the following problems when writing scientific articles:(1)Selecting appropriate citations to support the research idea is challenging.(2)The literature review is not conducted extensively,which leads to working on a research problem that others have well addressed.The study focuses on citation recommendation in the related studies section by applying the term function of a citation context,potentially improving the efficiency of writing a literature review.Design/methodology/approach:We present nine term functions with three newly created and six identified from existing literature.Using these term functions as labels,we annotate 531 research papers in three topics to evaluate our proposed recommendation strategy.BM25 and Word2vec with VSM are implemented as the baseline models for the recommendation.Then the term function information is applied to enhance the performance.Findings:The experiments show that the term function-based methods outperform the baseline methods regarding the recall,precision,and F1-score measurement,demonstrating that term functions are useful in identifying valuable citations.Research limitations:The dataset is insufficient due to the complexity of annotating citation functions for paragraphs in the related studies section.More recent deep learning models should be performed to future validate the proposed approach.Practical implications:The citation recommendation strategy can be helpful for valuable citation discovery,semantic scientific retrieval,and automatic literature review generation.Originality/value:The proposed citation function-based citation recommendation can generate intuitive explanations of the results for users,improving the transparency,persuasiveness,and effectiveness of recommender systems.展开更多
Knowledge base acceleration-cumulative citation recommendation(KBA-CCR)aims to detect citation-worthiness documents from a chronological stream corpus for a set of target entities in a knowledge base.Most previous wor...Knowledge base acceleration-cumulative citation recommendation(KBA-CCR)aims to detect citation-worthiness documents from a chronological stream corpus for a set of target entities in a knowledge base.Most previous works only consider a number of semantic features between documents and target entities in the knowledge base,and then use powerful machine learning approaches such as logistic regression to classify relevant documents and non-relevant documents.However,the burst activities of an entity have been proved to be a significant signal to predict potential citations.In this paper,an entity burst discriminative model(EBDM)is presented to substantially exploit such burst features.The EBDM presents a new temporal representation based on the burst features,which can capture both temporal and semantic correlations between entities and documents.Meanwhile,in contrast to the bag-of-words model,the EBDM can significantly decrease the number of non-zero entries of feature vectors.An extensive set of experiments were conducted on the TREC-KBA-2012 dataset.The results show that the EBDM outperforms the performance of the state-of-the-art models.展开更多
In the tag recommendation task on academic platforms,existing methods disregard users’customized preferences in favor of extracting tags based just on the content of the articles.Besides,it uses co-occurrence techniq...In the tag recommendation task on academic platforms,existing methods disregard users’customized preferences in favor of extracting tags based just on the content of the articles.Besides,it uses co-occurrence techniques and tries to combine nodes’textual content for modelling.They still do not,however,directly simulate many interactions in network learning.In order to address these issues,we present a novel system that more thoroughly integrates user preferences and citation networks into article labelling recommendations.Specifically,we first employ path similarity to quantify the degree of similarity between user labelling preferences and articles in the citation network.Then,the Commuting Matrix for massive node pair paths is used to improve computational performance.Finally,the two commonalities mentioned above are combined with the interaction paper labels based on the additivity of Poisson distribution.In addition,we also consider solving the model’s parameters by applying variational inference.Experimental results demonstrate that our suggested framework agrees and significantly outperforms the state-of-the-art baseline on two real datasets by efficiently merging the three relational data.Based on the Area Under Curve(AUC)and Mean Average Precision(MAP)analysis,the performance of the suggested task is evaluated,and it is demonstrated to have a greater solving efficiency than current techniques.展开更多
Knowledge Bases (KBs) are valuable resources of human knowledge which contribute to many applications. However, since they are manually maintained, there is a big lag between their contents and the upto-date informa...Knowledge Bases (KBs) are valuable resources of human knowledge which contribute to many applications. However, since they are manually maintained, there is a big lag between their contents and the upto-date information of entities. Considering a target entity in KBs, this paper investigates how Cumulative Citation Recommendation (CCR) can be used to effectively detect its worthy-citation documents in large volumes of stream data. Most global relevant models only consider semantic and temporat features of entity-document instances, which does not sufficiently exploit prior knowledge underlying entity-document instances. To tackle this problem, we present a Mixture of Experts (ME) model by introducing a latent layer to capture relationships between the entity-document instances and their latent class information. An extensive set of experiments was conducted on TREC-KBA-2013 dataset. The results show that the model can significantly achieve a better performance gain compared to state-of-the-art models in CCR.展开更多
Citation Context Analysis(CCA)is a typical data-driven research field based on full-text information,which breaks the limitations of traditional citation analysis using only bibliographic data,and benefits further stu...Citation Context Analysis(CCA)is a typical data-driven research field based on full-text information,which breaks the limitations of traditional citation analysis using only bibliographic data,and benefits further studies on various citation behaviors and other core issues behind them,such as citation motivation,citation function and citation sentiment.Corpus for CCA is the most important guarantee and support for these issues.This paper attempts to discuss the corpus construction and mining for CCA in order to comprehensively review the research significance,research status and existing deficiencies in this area.Two main sections in our paper are:1)corpus construction for CCA,its three building tasks,such as citation sentence extraction,citation-reference mapping and citation context extraction,are discussed;2)corpus mining and utilization for CCA,following related topics or situations are explored,including classification of citation motivation(or behavior)and citation sentiment,indexing and retrieval based on citation,citation recommendation and evaluation,citation-based abstracting and review generation automatically,and domains knowledge metrics.Finally,some suggestions and future research directions are briefly listed.展开更多
The rapid growth of scientific papers makes it difficult to query related papers efficiently,accurately and with high coverage.Traditional citation recommendation algorithms rely heavily on the metadata of query docum...The rapid growth of scientific papers makes it difficult to query related papers efficiently,accurately and with high coverage.Traditional citation recommendation algorithms rely heavily on the metadata of query documents,which leads to the low quality of recommendation results.In this paper,DeepCite,a content-based hybrid neural network citation recommendation method is proposed.First,the BERT model was used to extract the high-level semantic representation vectors in the text,then the multi-scale CNN model and BiLSTM model were used to obtain the local information and the sequence information of the context in the sentence,and the text vectors were matched in depth to generate candidate sets.Further,the depth neural network was used to rerank the candidate sets by combining the score of candidate sets and multisource features.In the reranking stage,a variety of Metapath features were extracted from the citation network,and added to the deep neural network to learn,and the ranking of recommendation results were optimized.Compared with PWFC,ClusCite,BM25,RW,NNRank models,the results of the Deepcite algorithm presented in the ANN datasets show that the precision(P@20),recall rate(R@20),MRR and MAP indexesrise by 2.3%,3.9%,2.4%and 2.1%respectively.Experimental results on DBLP datasets show that the improvement is 2.4%,4.3%,1.8%and 1.2%respectively.Therefore,the algorithm proposed in this paper effectively improves the quality of citation recommendation.展开更多
Citation network is often used for academic recommendation. However, it is difficult to achieve high recommendation accuracy and low time complexity because it is often very large and sparse and different citations ha...Citation network is often used for academic recommendation. However, it is difficult to achieve high recommendation accuracy and low time complexity because it is often very large and sparse and different citations have different purposes. What's more, some citations include unreasonable information, such as in case of intentional self-citation. To improve the accuracy of citation network-based academic recommendation and reduce the time complexity, we propose an academic recommendation method for recommending authors and papers. In which, an author-paper bilayer citation network is built, then an enhanced topic model, Author Community Topic Time Model(ACTTM) is proposed to detect high quality author communities in the author layer, and a set of attributes are proposed to comprehensively depict the author/paper nodes in the bilayer citation network. Experimental results prove that the proposed ACTTM can detect high quality author communities and facilitate low time complexity, and the proposed academic recommendation method can effectively improve the recommendation accuracy.展开更多
Today, recommendation systems are everywhere, making a variety of activities considerably more manageable. These systems help users by personalizing their suggestions to their interests and needs. They can propose var...Today, recommendation systems are everywhere, making a variety of activities considerably more manageable. These systems help users by personalizing their suggestions to their interests and needs. They can propose various goods, including music, courses, articles, agricultural products, fertilizers, books, movies, and foods. In the case of research articles, recommendation algorithms play an essential role in minimizing the time required for researchers to find relevant articles. Despite multiple challenges, these systems must solve serious issues such as the cold-start problem, article privacy, and changing user interests. This research addresses these issues through the use of two techniques: hybrid recommendation systems and COOT optimization. To generate article recommendations, a hybrid recommendation system integrates features from content-based and graph-based recommendation systems. COOT optimization is used to optimize the results, inspired by the movement of water birds. The proposed method combines a graph-based recommendation system with COOT optimization to increase accuracy and reduce result inaccuracies. When compared to the baseline approaches described, the model provided in this study improves precision by 2.3%, recall by 1.6%, and mean reciprocal rank (MRR) by 5.7%.展开更多
Discovering publication hierarchically-ordered contexts is the main task in context-based searching paradigm. The proposed techniques to discover publication contexts relies on the availability of domain-specific inpu...Discovering publication hierarchically-ordered contexts is the main task in context-based searching paradigm. The proposed techniques to discover publication contexts relies on the availability of domain-specific inputs, namely a pre-specified ontology terms. A problem with this technique is that the needed domain-specific inputs may not be available in some scientific disciplines. In this paper, we propose utilizing a powerful input that is naturally available in any scientific discipline to discover the hierarchically-ordered contexts of it, namely paper citation and co-authorship graphs. More specifically, we propose a set of domain-specific bibliometry-aware features that are automatically computable instead of domain-specific inputs that need experts’ efforts to prepare. Another benefit behind considering bibliometric-features to adapt to the special characteristics of the literature environment being targeted, which in turn facilitates contexts membership decision making. One key advantage of our proposal is that it considers temporal changes of the targeted publication set.展开更多
Over the years, there has been increasing growth in academic digital libraries. It has therefore become overwhelming for researchers to determine important research materials. In most existing research works that cons...Over the years, there has been increasing growth in academic digital libraries. It has therefore become overwhelming for researchers to determine important research materials. In most existing research works that consider scholarly paper recommendation, the researcher’s preference is left out. In this paper, therefore, Frequent Pattern (FP) Growth Algorithm is employed on potential papers generated from the researcher’s preferences to create a list of ranked papers based on citation features. The purpose is to provide a recommender system that is user oriented. A walk through algorithm is implemented to generate all possible frequent patterns from the FP-tree after which an output of ordered recommended papers combining subjective and objective factors of the researchers is produced. Experimental results with a scholarly paper recommendation dataset show that the proposed method is very promising, as it outperforms recommendation baselines as measured with nDCG and MRR.展开更多
We search a variety of things over the Internet in our daily lives, and numerous search engines are available to get us more relevant results. With the rapid technological advancement, the internet has become a major ...We search a variety of things over the Internet in our daily lives, and numerous search engines are available to get us more relevant results. With the rapid technological advancement, the internet has become a major source of obtaining information. Further, the advent of the Web2.0 era has led to an increased interaction between the user and the website. It has become challenging to provide information to users as per their interests. Because of copyright restrictions, most of existing research studies are confronting the lack of availability of the content of candidates recommending articles. The content of such articles is not always available freely and hence leads to inadequate recommendation results. Moreover, various research studies base recommendation on user profiles. Therefore, their recommendation needs a significant number of registered users in the system. In recent years, research work proves that Knowledge graphs have yielded better in generating quality recommendation results and alleviating sparsity and cold start issues. Network embedding techniques try to learn high quality feature vectors automatically from network structures, enabling vector-based measurers of node relatedness. Keeping the strength of Network embedding techniques, the proposed citation-based recommendation approach makes use of heterogeneous network embedding in generating recommendation results. The novelty of this paper is in exploiting the performance of a network embedding approach i.e., matapath2vec to generate paper recommendations. Unlike existing approaches, the proposed method has the capability of learning low-dimensional latent representation of nodes (i.e., research papers) in a network. We apply metapath2vec on a knowledge network built by the ACL Anthology Network (all about NLP) and use the node relatedness to generate item (research article) recommendations.展开更多
基金This study was funded by the National Social Science Foundation of China with Grant No.20CTQ024.
文摘Purpose:This study attempts to disclose the characteristics of knowledge integration in an interdisciplinary field by looking into the content aspect of knowledge.Design/methodology/approach:The eHealth field was chosen in the case study.Associated knowledge phrases(AKPs)that are shared between citing papers and their references were extracted from the citation contexts of the eHealth papers by applying a stem-matching method.A classification schema that considers the functions of knowledge in the domain was proposed to categorize the identified AKPs.The source disciplines of each knowledge type were analyzed.Quantitative indicators and a co-occurrence analysis were applied to disclose the integration patterns of different knowledge types.Findings:The annotated AKPs evidence the major disciplines supplying each type of knowledge.Different knowledge types have remarkably different integration patterns in terms of knowledge amount,the breadth of source disciplines,and the integration time lag.We also find several frequent co-occurrence patterns of different knowledge types.Research limitations:The collected articles of the field are limited to the two leading open access journals.The stem-matching method to extract AKPs could not identify those phrases with the same meaning but expressed in words with different stems.The type of Research Subject dominates the recognized AKPs,which calls on an improvement of the classification schema for better knowledge integration analysis on knowledge units.Practical implications:The methodology proposed in this paper sheds new light on knowledge integration characteristics of an interdisciplinary field from the content perspective.The findings have practical implications on the future development of research strategies in eHealth and the policies about interdisciplinary research.Originality/value:This study proposed a new methodology to explore the content characteristics of knowledge integration in an interdisciplinary field.
基金This work is supported by the National Natural Science Foundation of China(Grant No.7167030644 and 71704137)。
文摘Purpose:Researchers frequently encounter the following problems when writing scientific articles:(1)Selecting appropriate citations to support the research idea is challenging.(2)The literature review is not conducted extensively,which leads to working on a research problem that others have well addressed.The study focuses on citation recommendation in the related studies section by applying the term function of a citation context,potentially improving the efficiency of writing a literature review.Design/methodology/approach:We present nine term functions with three newly created and six identified from existing literature.Using these term functions as labels,we annotate 531 research papers in three topics to evaluate our proposed recommendation strategy.BM25 and Word2vec with VSM are implemented as the baseline models for the recommendation.Then the term function information is applied to enhance the performance.Findings:The experiments show that the term function-based methods outperform the baseline methods regarding the recall,precision,and F1-score measurement,demonstrating that term functions are useful in identifying valuable citations.Research limitations:The dataset is insufficient due to the complexity of annotating citation functions for paragraphs in the related studies section.More recent deep learning models should be performed to future validate the proposed approach.Practical implications:The citation recommendation strategy can be helpful for valuable citation discovery,semantic scientific retrieval,and automatic literature review generation.Originality/value:The proposed citation function-based citation recommendation can generate intuitive explanations of the results for users,improving the transparency,persuasiveness,and effectiveness of recommender systems.
基金Supported by the National Natural Science Foundation of China(61866038,61751217)Special Research Project of Shaanxi Education Department of China(18JK0876)Ph.D.Start Project of Yan’an University(YDBK2018-09)
文摘Knowledge base acceleration-cumulative citation recommendation(KBA-CCR)aims to detect citation-worthiness documents from a chronological stream corpus for a set of target entities in a knowledge base.Most previous works only consider a number of semantic features between documents and target entities in the knowledge base,and then use powerful machine learning approaches such as logistic regression to classify relevant documents and non-relevant documents.However,the burst activities of an entity have been proved to be a significant signal to predict potential citations.In this paper,an entity burst discriminative model(EBDM)is presented to substantially exploit such burst features.The EBDM presents a new temporal representation based on the burst features,which can capture both temporal and semantic correlations between entities and documents.Meanwhile,in contrast to the bag-of-words model,the EBDM can significantly decrease the number of non-zero entries of feature vectors.An extensive set of experiments were conducted on the TREC-KBA-2012 dataset.The results show that the EBDM outperforms the performance of the state-of-the-art models.
基金supported by the National Natural Science Foundation of China(No.62271274).
文摘In the tag recommendation task on academic platforms,existing methods disregard users’customized preferences in favor of extracting tags based just on the content of the articles.Besides,it uses co-occurrence techniques and tries to combine nodes’textual content for modelling.They still do not,however,directly simulate many interactions in network learning.In order to address these issues,we present a novel system that more thoroughly integrates user preferences and citation networks into article labelling recommendations.Specifically,we first employ path similarity to quantify the degree of similarity between user labelling preferences and articles in the citation network.Then,the Commuting Matrix for massive node pair paths is used to improve computational performance.Finally,the two commonalities mentioned above are combined with the interaction paper labels based on the additivity of Poisson distribution.In addition,we also consider solving the model’s parameters by applying variational inference.Experimental results demonstrate that our suggested framework agrees and significantly outperforms the state-of-the-art baseline on two real datasets by efficiently merging the three relational data.Based on the Area Under Curve(AUC)and Mean Average Precision(MAP)analysis,the performance of the suggested task is evaluated,and it is demonstrated to have a greater solving efficiency than current techniques.
基金supported by the National Key Research and Development Program of China(No.2016YFB1000902)the National Natural Science Foundation of China(Nos.61472040,61751217,and 61866038)+1 种基金Natural Science Basic Research Plan in Shaanxi Province of China(No.2016JM6082)PhD start project of Yan’an University(No.YDBK2018-09)
文摘Knowledge Bases (KBs) are valuable resources of human knowledge which contribute to many applications. However, since they are manually maintained, there is a big lag between their contents and the upto-date information of entities. Considering a target entity in KBs, this paper investigates how Cumulative Citation Recommendation (CCR) can be used to effectively detect its worthy-citation documents in large volumes of stream data. Most global relevant models only consider semantic and temporat features of entity-document instances, which does not sufficiently exploit prior knowledge underlying entity-document instances. To tackle this problem, we present a Mixture of Experts (ME) model by introducing a latent layer to capture relationships between the entity-document instances and their latent class information. An extensive set of experiments was conducted on TREC-KBA-2013 dataset. The results show that the model can significantly achieve a better performance gain compared to state-of-the-art models in CCR.
文摘Citation Context Analysis(CCA)is a typical data-driven research field based on full-text information,which breaks the limitations of traditional citation analysis using only bibliographic data,and benefits further studies on various citation behaviors and other core issues behind them,such as citation motivation,citation function and citation sentiment.Corpus for CCA is the most important guarantee and support for these issues.This paper attempts to discuss the corpus construction and mining for CCA in order to comprehensively review the research significance,research status and existing deficiencies in this area.Two main sections in our paper are:1)corpus construction for CCA,its three building tasks,such as citation sentence extraction,citation-reference mapping and citation context extraction,are discussed;2)corpus mining and utilization for CCA,following related topics or situations are explored,including classification of citation motivation(or behavior)and citation sentiment,indexing and retrieval based on citation,citation recommendation and evaluation,citation-based abstracting and review generation automatically,and domains knowledge metrics.Finally,some suggestions and future research directions are briefly listed.
基金“Shenzhen Science and Technology Project”(JCYJ20180306170836595)“National key research and development program in China”(2019YFB2102300)+4 种基金“the World-Class Universities(Disciplines)and the Characteristic Development Guidance Funds for the Central Universities of China”(PY3A022)“Ministry of Education Fund Projects”(No.18JZD022 and 2017B00030)“Basic Scientific Research Operating Expenses of Central Universities”(No.ZDYF2017006)“Xi’an Navinfo Corp.&Engineering Center of Xi’an Intelligence Spatial-temporal Data Analysis Project”(C2020103)“Beilin District of Xi’an Science&Technology Project”(GX1803).
文摘The rapid growth of scientific papers makes it difficult to query related papers efficiently,accurately and with high coverage.Traditional citation recommendation algorithms rely heavily on the metadata of query documents,which leads to the low quality of recommendation results.In this paper,DeepCite,a content-based hybrid neural network citation recommendation method is proposed.First,the BERT model was used to extract the high-level semantic representation vectors in the text,then the multi-scale CNN model and BiLSTM model were used to obtain the local information and the sequence information of the context in the sentence,and the text vectors were matched in depth to generate candidate sets.Further,the depth neural network was used to rerank the candidate sets by combining the score of candidate sets and multisource features.In the reranking stage,a variety of Metapath features were extracted from the citation network,and added to the deep neural network to learn,and the ranking of recommendation results were optimized.Compared with PWFC,ClusCite,BM25,RW,NNRank models,the results of the Deepcite algorithm presented in the ANN datasets show that the precision(P@20),recall rate(R@20),MRR and MAP indexesrise by 2.3%,3.9%,2.4%and 2.1%respectively.Experimental results on DBLP datasets show that the improvement is 2.4%,4.3%,1.8%and 1.2%respectively.Therefore,the algorithm proposed in this paper effectively improves the quality of citation recommendation.
基金supported by the grants from Natural Science Foundation of China (Project No.61471060)
文摘Citation network is often used for academic recommendation. However, it is difficult to achieve high recommendation accuracy and low time complexity because it is often very large and sparse and different citations have different purposes. What's more, some citations include unreasonable information, such as in case of intentional self-citation. To improve the accuracy of citation network-based academic recommendation and reduce the time complexity, we propose an academic recommendation method for recommending authors and papers. In which, an author-paper bilayer citation network is built, then an enhanced topic model, Author Community Topic Time Model(ACTTM) is proposed to detect high quality author communities in the author layer, and a set of attributes are proposed to comprehensively depict the author/paper nodes in the bilayer citation network. Experimental results prove that the proposed ACTTM can detect high quality author communities and facilitate low time complexity, and the proposed academic recommendation method can effectively improve the recommendation accuracy.
文摘Today, recommendation systems are everywhere, making a variety of activities considerably more manageable. These systems help users by personalizing their suggestions to their interests and needs. They can propose various goods, including music, courses, articles, agricultural products, fertilizers, books, movies, and foods. In the case of research articles, recommendation algorithms play an essential role in minimizing the time required for researchers to find relevant articles. Despite multiple challenges, these systems must solve serious issues such as the cold-start problem, article privacy, and changing user interests. This research addresses these issues through the use of two techniques: hybrid recommendation systems and COOT optimization. To generate article recommendations, a hybrid recommendation system integrates features from content-based and graph-based recommendation systems. COOT optimization is used to optimize the results, inspired by the movement of water birds. The proposed method combines a graph-based recommendation system with COOT optimization to increase accuracy and reduce result inaccuracies. When compared to the baseline approaches described, the model provided in this study improves precision by 2.3%, recall by 1.6%, and mean reciprocal rank (MRR) by 5.7%.
文摘Discovering publication hierarchically-ordered contexts is the main task in context-based searching paradigm. The proposed techniques to discover publication contexts relies on the availability of domain-specific inputs, namely a pre-specified ontology terms. A problem with this technique is that the needed domain-specific inputs may not be available in some scientific disciplines. In this paper, we propose utilizing a powerful input that is naturally available in any scientific discipline to discover the hierarchically-ordered contexts of it, namely paper citation and co-authorship graphs. More specifically, we propose a set of domain-specific bibliometry-aware features that are automatically computable instead of domain-specific inputs that need experts’ efforts to prepare. Another benefit behind considering bibliometric-features to adapt to the special characteristics of the literature environment being targeted, which in turn facilitates contexts membership decision making. One key advantage of our proposal is that it considers temporal changes of the targeted publication set.
文摘Over the years, there has been increasing growth in academic digital libraries. It has therefore become overwhelming for researchers to determine important research materials. In most existing research works that consider scholarly paper recommendation, the researcher’s preference is left out. In this paper, therefore, Frequent Pattern (FP) Growth Algorithm is employed on potential papers generated from the researcher’s preferences to create a list of ranked papers based on citation features. The purpose is to provide a recommender system that is user oriented. A walk through algorithm is implemented to generate all possible frequent patterns from the FP-tree after which an output of ordered recommended papers combining subjective and objective factors of the researchers is produced. Experimental results with a scholarly paper recommendation dataset show that the proposed method is very promising, as it outperforms recommendation baselines as measured with nDCG and MRR.
文摘We search a variety of things over the Internet in our daily lives, and numerous search engines are available to get us more relevant results. With the rapid technological advancement, the internet has become a major source of obtaining information. Further, the advent of the Web2.0 era has led to an increased interaction between the user and the website. It has become challenging to provide information to users as per their interests. Because of copyright restrictions, most of existing research studies are confronting the lack of availability of the content of candidates recommending articles. The content of such articles is not always available freely and hence leads to inadequate recommendation results. Moreover, various research studies base recommendation on user profiles. Therefore, their recommendation needs a significant number of registered users in the system. In recent years, research work proves that Knowledge graphs have yielded better in generating quality recommendation results and alleviating sparsity and cold start issues. Network embedding techniques try to learn high quality feature vectors automatically from network structures, enabling vector-based measurers of node relatedness. Keeping the strength of Network embedding techniques, the proposed citation-based recommendation approach makes use of heterogeneous network embedding in generating recommendation results. The novelty of this paper is in exploiting the performance of a network embedding approach i.e., matapath2vec to generate paper recommendations. Unlike existing approaches, the proposed method has the capability of learning low-dimensional latent representation of nodes (i.e., research papers) in a network. We apply metapath2vec on a knowledge network built by the ACL Anthology Network (all about NLP) and use the node relatedness to generate item (research article) recommendations.