Purpose:The goal of this study is a comparative analysis of the relation between funding(a main driver for scientific research)and citations in papers of Nobel Laureates in physics,chemistry and medicine over 2019-202...Purpose:The goal of this study is a comparative analysis of the relation between funding(a main driver for scientific research)and citations in papers of Nobel Laureates in physics,chemistry and medicine over 2019-2020 and the same relation in these research fields as a whole.Design/methodology/approach:This study utilizes a power law model to explore the relationship between research funding and citations of related papers.The study here analyzes 3,539 recorded documents by Nobel Laureates in physics,chemistry and medicine and a broader dataset of 183,016 documents related to the fields of physics,medicine,and chemistry recorded in the Web of Science database.Findings:Results reveal that in chemistry and medicine,funded researches published in papers of Nobel Laureates have higher citations than unfunded studies published in articles;vice versa high citations of Nobel Laureates in physics are for unfunded studies published in papers.Instead,when overall data of publications and citations in physics,chemistry and medicine are analyzed,all papers based on funded researches show higher citations than unfunded ones.Originality/value:Results clarify the driving role of research funding for science diffusion that are systematized in general properties:a)articles concerning funded researches receive more citations than(un)funded studies published in papers of physics,chemistry and medicine sciences,generating a high Matthew effect(a higher growth of citations with the increase in the number of papers);b)research funding increases the citations of articles in fields oriented to applied research(e.g.,chemistry and medicine)more than fields oriented towards basic research(e.g.,physics).Practical implications:The results here explain some characteristics of scientific development and diffusion,highlighting the critical role of research funding in fostering citations and the expansion of scientific knowledge.This finding can support decision-making of policymakers and R&D managers to improve the effectiveness in allocating financial resources in science policies to generate a higher positive scientific and societal impact.展开更多
Purpose:The goal of this study is to analyze the relationship between funded and unfunded papers and their citations in both basic and applied sciences.Design/methodology/approach:A power law model analyzes the relati...Purpose:The goal of this study is to analyze the relationship between funded and unfunded papers and their citations in both basic and applied sciences.Design/methodology/approach:A power law model analyzes the relationship between research funding and citations of papers using 831,337 documents recorded in the Web of Science database.Findings:The original results reveal general characteristics of the diffusion of science in research fields:a)Funded articles receive higher citations compared to unfunded papers in journals;b)Funded articles exhibit a super-linear growth in citations,surpassing the increase seen in unfunded articles.This finding reveals a higher diffusion of scientific knowledge in funded articles.Moreover,c)funded articles in both basic and applied sciences demonstrate a similar expected change in citations,equivalent to about 1.23%,when the number of funded papers increases by 1%in journals.This result suggests,for the first time,that funding effect of scientific research is an invariant driver,irrespective of the nature of the basic or applied sciences.Originality/value:This evidence suggests empirical laws of funding for scientific citations that explain the importance of robust funding mechanisms for achieving impactful research outcomes in science and society.These findings here also highlight that funding for scientific research is a critical driving force in supporting citations and the dissemination of scientific knowledge in recorded documents in both basic and applied sciences.Practical implications:This comprehensive result provides a holistic view of the relationship between funding and citation performance in science to guide policymakers and R&D managers with science policies by directing funding to research in promoting the scientific development and higher diffusion of results for the progress of human society.展开更多
In the tag recommendation task on academic platforms,existing methods disregard users’customized preferences in favor of extracting tags based just on the content of the articles.Besides,it uses co-occurrence techniq...In the tag recommendation task on academic platforms,existing methods disregard users’customized preferences in favor of extracting tags based just on the content of the articles.Besides,it uses co-occurrence techniques and tries to combine nodes’textual content for modelling.They still do not,however,directly simulate many interactions in network learning.In order to address these issues,we present a novel system that more thoroughly integrates user preferences and citation networks into article labelling recommendations.Specifically,we first employ path similarity to quantify the degree of similarity between user labelling preferences and articles in the citation network.Then,the Commuting Matrix for massive node pair paths is used to improve computational performance.Finally,the two commonalities mentioned above are combined with the interaction paper labels based on the additivity of Poisson distribution.In addition,we also consider solving the model’s parameters by applying variational inference.Experimental results demonstrate that our suggested framework agrees and significantly outperforms the state-of-the-art baseline on two real datasets by efficiently merging the three relational data.Based on the Area Under Curve(AUC)and Mean Average Precision(MAP)analysis,the performance of the suggested task is evaluated,and it is demonstrated to have a greater solving efficiency than current techniques.展开更多
Purpose:To analyze the diversity of citation distributions to publications in different research topics to investigate the accuracy of size-independent,rank-based indicators.The top percentile-based indicators are the...Purpose:To analyze the diversity of citation distributions to publications in different research topics to investigate the accuracy of size-independent,rank-based indicators.The top percentile-based indicators are the most common indicators of this type,and the evaluations of Japan are the most evident misjudgments.Design/methodology/approach:The distributions of citations to publications from countries and journals in several research topics were analyzed along with the corresponding global publications using histograms with logarithmic binning,double rank plots,and normal probability plots of log-transformed numbers of citations.Findings:Size-independent,top percentile-based indicators are accurate when the global ranks of local publications fit a power law,but deviations in the least cited papers are frequent in countries and occur in all journals with high impact factors.In these cases,a single indicator is misleading.Comparisons of the proportions of uncited papers are the best way to predict these deviations.Research limitations:This study is fundamentally analytical,and its results describe mathematical facts that are self-evident.Practical implications:Respectable institutions,such as the OECD,the European Commission,and the U.S.National Science Board,produce research country rankings and individual evaluations using size-independent percentile indicators that are misleading in many countries.These misleading evaluations should be discontinued because they can cause confusion among research policymakers and lead to incorrect research policies.Originality/value:Studies linking the lower tail of citation distribution,including uncited papers,to percentile research indicators have not been performed previously.The present results demonstrate that studies of this type are necessary to find reliable procedures for research assessments.展开更多
Purpose:The disseminating of academic knowledge to nonacademic audiences partly relies on the transition of subsequent citing papers.This study aims to investigate direct and indirect impact on technology and policy o...Purpose:The disseminating of academic knowledge to nonacademic audiences partly relies on the transition of subsequent citing papers.This study aims to investigate direct and indirect impact on technology and policy originating from transformative research based on ego citation network.Design/methodology/approach:Key Nobel Prize-winning publications(NPs)in fields of gene engineering and astrophysics are regarded as a proxy for transformative research.In this contribution,we introduce a network-structural indicator of citing patents to measure technological impact of a target article and use policy citations as a preliminary tool for policy impact.Findings:The results show that the impact on technology and policy of NPs are higher than that of their subsequent citation generations in gene engineering but not in astrophysics.Research limitations:The selection of Nobel Prizes is not balanced and the database used in this study,Dimensions,suffers from incompleteness and inaccuracy of citation links.Practical implications:Our findings provide useful clues to better understand the characteristics of transformative research in technological and policy impact.Originality/value:This study proposes a new framework to explore the direct and indirect impact on technology and policy originating from transformative research.展开更多
Purpose:We analyzed the structure of a community of authors working in the field of social network analysis(SNA)based on citation indicators:direct citation and bibliographic coupling metrics.We observed patterns at t...Purpose:We analyzed the structure of a community of authors working in the field of social network analysis(SNA)based on citation indicators:direct citation and bibliographic coupling metrics.We observed patterns at the micro,meso,and macro levels of analysis.Design/methodology/approach:We used bibliometric network analysis,including the“temporal quantities”approach proposed to study temporal networks.Using a two-mode network linking publications with authors and a one-mode network of citations between the works,we constructed and analyzed the networks of citation and bibliographic coupling among authors.We used an iterated saturation data collection approach.Findings:At the macro-level,we observed the global structural features of citations between authors,showing that 80%of authors have not more than 15 citations from other works.At the meso-level,we extracted the groups of authors citing each other and similar to each other according to their citation patterns.We have seen a division of authors in SNA into groups of social scientists and physicists,as well as into other groups of authors from different disciplines.We found some examples of brokerage between different groups that maintained the common identity of the field.At the micro-level,we extracted authors with extremely high values of received citations,who can be considered as the most prominent authors in the field.We examined the temporal properties of the most popular authors.Research limitations:The main challenge in this approach is the resolution of the author’s name(synonyms and homonyms).We faced the author disambiguation,or“multiple personalities”(Harzing,2015)problem.To remain consistent and comparable with our previously published articles,we used the same SNA data collected up to 2018.The analysis and conclusions on the activity,productivity,and visibility of the authors are relative only to the field of SNA.Practical implications:The proposed approach can be utilized for similar objectives and identifying key structures and characteristics in other disciplines.This may potentially inspire the application of network approaches in other research areas,creating more authors collaborating in the field of SNA.Originality/value:We identified and applied an innovative approach and methods to study the structure of scientific communities,which allowed us to get the findings going beyond those obtained with other methods.We used a new approach to temporal network analysis,which is an important addition to the analysis as it provides detailed information on different measures for the authors and pairs of authors over time.展开更多
A novel indicator called price-citation was proposed.Based on the company integrated patent database of China listed companies of common stocks(A-shares)with the stock price and the stock return rate data,more than tw...A novel indicator called price-citation was proposed.Based on the company integrated patent database of China listed companies of common stocks(A-shares)with the stock price and the stock return rate data,more than two thousand of A-shares from 2017 to 2020 were selected.The effect of the traditional patent forward citation and the price-citation for discriminating the stock return rate was thoroughly analyzed via ANOVA.The A-shares of forward citation counts above the average showed higher stock return rate means than the A-shares having patents but receiving no forward citations.The price-citation,combining both the financial and patent attributes,defined as the multiplication of the current stock price and the currently receiving forward citation count,showed its excellence in discriminating the stock return rate.The A-shares of higher price-citation showed significantly higher stock return rate means while the A-shares of lower price-citation showed significantly lowest stock return rate means.The price-citation effect had not been changed by COVID-19 though COVID-19 affected the social and economic environment to a considerable extent in 2020.展开更多
This study examines how writer stance is projected in citation from a comparative perspective.The dataset consisted of 120 qualitative and quantitative research articles(RAs)that were authored by Anglophone and Chines...This study examines how writer stance is projected in citation from a comparative perspective.The dataset consisted of 120 qualitative and quantitative research articles(RAs)that were authored by Anglophone and Chinese applied linguists.ANO?VA tests revealed marked cross-language and cross-paradigmatic differences in stance-based dialogic engagement.展开更多
Purpose: First, to review the state-of-the-art in patent citation analysis, particularly characteristics of patent citations to scientific literature (scientific non-patent references, SNPRs). Second, to present a ...Purpose: First, to review the state-of-the-art in patent citation analysis, particularly characteristics of patent citations to scientific literature (scientific non-patent references, SNPRs). Second, to present a novel mapping approach to identify technology-relevant research based on the papers cited by and referring to the SNPRs. Design/methodology/approach: In the review part we discuss the context of SNPRs such as the time lags between scientific achievements and inventions. Also patent-to-patent citation is addressed particularly because this type of patent citation analysis is a major element in the assessment of the economic value of patents. We also review the research on the role of universities and researchers in technological development, with important issues such as universities as sources of technological knowledge and inventor-author relations. We conclude the review part of this paper with an overview of recent research on mapping and network analysis of the science and technology interface and of technological progress in interaction with science. In the second part we apply new techniques for the direct visualization of the cited and citing relations of SNPRs, the mapping of the landscape around SNPRs by bibliographic coupling and co-citation analysis, and the mapping of the conceptual environment of SNPRs by keyword co-occurrence analysis. Findings: We discuss several properties of SNPRs. Only a small minority of publications covered by the Web of Science or Scopus are cited by patents, about 3%-4%. However, for publications based on university-industry collaboration the number of SNPRs is considerably higher, around 15%. The proposed mapping methodology based on a "second order SNPR approach" enables a better assessment of the technological relevance of research. Research limitations: The main limitation is that a more advanced merging of patent and publication data, in particular unification of author and inventor names, in still a necessity. Practical implications: The proposed mapping methodology enables the creation of a database of technology-relevant papers (TRPs). In a bibliometric assessment the publications of research groups, research programs or institutes can be matched with the TRPs and thus the extent to which the work of groups, programs or institutes are relevant for technological development can be measured. Originality/value: The review part examines a wide range of findings in the research of patent citation analysis. The mapping approach to identify a broad range of technologyrelevant papers is novel and offers new opportunities in research evaluation practices.展开更多
Purpose: (1) To test basic assumptions underlying frequency-weighted citation analysis: (a) Uni-citations correspond to citations that are nonessential to the citing papers; (b) The influence of a cited paper ...Purpose: (1) To test basic assumptions underlying frequency-weighted citation analysis: (a) Uni-citations correspond to citations that are nonessential to the citing papers; (b) The influence of a cited paper on the citing paper increases with the frequency with which it is cited in the citing paper. (2) To explore the degree to which citation location may be used to help identify nonessential citations. Design/methodology/approach: Each of the in-text citations in all research articles published in Issue 1 of the Journal of the Association for Information Science and Technology (JASIST) 2016 was manually classified into one of these five categories: Applied, Contrastive, Supportive, Reviewed, and Perfunctory. The distributions of citations at different in-text frequencies and in different locations in the text by these functions were analyzed. Findings: Filtering out nonessential citations before assigning weight is important for frequency-weighted citation analysis. For this purpose, removing citations by location is more effective than re-citation analysis that simply removes uni-citations. Removing all citation occurrences in the Background and Literature Review sections and uni-citations in the Introduction section appears to provide a good balance between filtration and error rates. Research limitations: This case study suffers from the limitation of scalability and generalizability. We took careful measures to reduce the impact of other limitations of the data collection approach used. Relying on the researcher's judgment to attribute citation functions, this approach is unobtrusive but speculative, and can suffer from a low degree of confidence, thus creating reliability concerns. Practical implications: Weighted citation analysis promises to improve citation analysis for research evaluation, knowledge network analysis, knowledge representation, and information retrieval. The present study showed the importance of filtering out nonessential citations before assigning weight in a weighted citation analysis, which may be a significant step forward to realizing these promises. Originality/value: Weighted citation analysis has long been proposed as a theoretical solution to the problem of citation analysis that treats all citations equally, and has attracted increasing research interest in recent years. The present study showed, for the first time, the importance of filtering out nonessential citations in weighted citation analysis, pointing research in this area in a new direction.展开更多
AIM: To determine the citation status in 2004 and the citation trend of WJG by analyzing all articles cited by WJG and all WJG articles cited by SCI journals during 1998-2004.METHODS: The total number of published art...AIM: To determine the citation status in 2004 and the citation trend of WJG by analyzing all articles cited by WJG and all WJG articles cited by SCI journals during 1998-2004.METHODS: The total number of published articles and reference citations in WJG, authors' self-citations, WJG's self-citations, citations of WJG articles by SCI journals and inappropriate citations in WJG during 1998-2004 were statistically analyzed. Data on self-citations of the articles published between 1998 and August 2004 (Issues 1-16)were from ISI SCI-E, and data on self-citations of articles published after August 2004 (Issues 17-24) were from the WJG Editorial Office. Data on citations of WJG articles by other journals between 1998 and August 2004 were from ISI SCI-E.RESULTS: Annual number of published articles: WJG published 179, 144, 211, 174, 236, 634 and 830 articles,respectively, in 1998, 1999, 2000, 2001, 2002, 2003 and 2004. The number in 2004 increased by 594, compared to that in 2002, giving an increased rate of 251.7%. Annual references cited by WJG were 2 123, 2 125, 6 244,8 883, 11 442, 23 218 and 25 971, respectively, in 1998-2004.The average number was 31.3 per WJG article in 2004,which was less than that (48.5) in 2002, giving a reduction rate of 35.5%. Authors of WJG cited 125, 126, 343, 210,354, 310 and 470 of their own published articles,respectively, in 1998-2004. The average number of authors'self-citations was 0.57 per WJG article in 2004, which was decreased by 0.93 or 62.0%, compared with that in 2002.Annual numbers of journal's self-citations: Authors of WJG articles cited 5, 7, 373, 733, 1474, 1947 and 1412 of WJG articles, respectively, in 1998-2004. The average number of journal's self-citations was 1.70 per WJG article in 2004,which decreased by 4.55 or 72.8%. No WJG article was cited in 1998 by other SCI journals. However, the number of citations steadily increased afterwards, with 16, 18, 39,85, 372 and 580, respectively, in 1999-2004. The average number of citations by other SCI journals was 0.11, 0.09,0.22, 0.36, 0.59 and 1.06 per WJG article, respectively,1999-2004 (January-August). There was an increase by 582%, when comparing the citation numbers between 2004and 2002. Annual WJG self-citation rates and citation rates of WJG articles by other SCI journals: WJG self-citation rates were 30.43%, 95.40%, 95.07%, 94.55%, 83.96%and 67.47%, respectively, in 1999-2004 (January-August).Compared with 2002, the self-citation rate in 2004decreased by 26.87%. The citation rates of WJG articles by other SCI journals were 69.57%, 4.60%, 4.93%, 5.45%,16.04%, and 32.53%, respectively, in 1999-2004 (January-August). Compared with 2002, the citation rate in 2004decreased by 26.87%. There were 8, 19, 218, 274, 461,698 and 574 inappropriate citations, respectively, in 1998-2004.The average inappropriate citation in 2004 was 0.69 per article, which represents a decrease of 1.26, compared with that in 2002. Inappropriate citations were mostly those with the differences between the two sides of the hyphens of 5-9, and the proportions of inappropriate citations within the three subsections of the differences between the two sides of the hyphens (5-9, 10-19, and >=20) were approximately 7:2:1. In addition, inappropriate citations mostly occurred with frequencies of 1-3 in the articles,and the proportion of inappropriate citations within the two frequency subsections (1-3 and >3) have been approximately 4:1 since 1999.CONCLUSIONS: In 2004, the average number of reference citations, authors' self-citations and journal's self-citations were 31.3, 0.57 and 1.70 per article, respectively, which represents a decrease in the numbers by 35.5%. 62.0%,and 72.8% respectively compared to the corresponding numbers in 2002. WJG self-citation rate was 67.47% in 2004 (January-August), which was a decrease by26.87%,compared with 2002. The citation rate of WJG articles by other SCI journals was 32.53% in 2004 (January-August),an increase of 26.87%, compared to 2002. There were 574inappropriate citations in 2004, with an average of 0.69per article, which represents a decrease of 1.26, compared with that in 2002. These figures demonstrate that the overall citation status of WJG is improving.展开更多
Purpose: Communicating scientific results to the public is essential to inspire future researchers and ensure that discoveries are exploited. News stories about research are a key communication pathway for this and ha...Purpose: Communicating scientific results to the public is essential to inspire future researchers and ensure that discoveries are exploited. News stories about research are a key communication pathway for this and have been manually monitored to assess the extent of press coverage of scholarship.Design/methodology/Approach: To make larger scale studies practical, this paper introduces an automatic method to extract citations from newspaper stories to large sets of academic journals. Curated ProQuest queries were used to search for citations to 9,639 Science and3,412 Social Science Web of Science(WoS) journals from eight UK daily newspapers during2006–2015. False matches were automatically filtered out by a new program, with 94% of the remaining stories meaningfully citing research.Findings: Most Science(95%) and Social Science(94%) journals were never cited by these newspapers. Half of the cited Science journals covered medical or health-related topics,whereas 43% of the Social Sciences journals were related to psychiatry or psychology. From the citing news stories, 60% described research extensively and 53% used multiple sources,but few commented on research quality.Research Limitations: The method has only been tested in English and from the ProQuest Newspapers database.Practical implications: Others can use the new method to systematically harvest press coverage of research.Originality/value: An automatic method was introduced and tested to extract citations from newspaper stories to large sets of academic journals.展开更多
Purpose:We study the proportion of Web of Science(WoS)citation links that are represented in the Crossref Open Citation Index(COCI),with the possible aim of using COCI in research evaluation instead of the WoS,if the ...Purpose:We study the proportion of Web of Science(WoS)citation links that are represented in the Crossref Open Citation Index(COCI),with the possible aim of using COCI in research evaluation instead of the WoS,if the level of coverage was sufficient.Design/methodology/approach:We calculate the proportion on citation links where both publications have a WoS accession number and a DOI simultaneously,and where the cited publications have had at least one author from our institution,the Czech Technical University in Prague.We attempt to look up each such citation link in COCI.Findings:We find that 53.7%of WoS citation links are present in the COCI.The proportion varies largely by discipline.The total figures differ significantly from 40%in the large-scale study by Van Eck,Waltman,Larivière,and Sugimoto(blog 2018,https://www.cwts.nl/blog?article=n-r2s234).Research limitations:The sample does not cover all science areas uniformly;it is heavily focused on Engineering and Technology,and only some disciplines of Natural Sciences are present.However,this reflects the real scientific orientation and publication profile of our institution.Practical implications:The current level of coverage is not sufficient for the WoS to be replaced by COCI for research evaluation.Originality/value:The present study illustrates a COCI vs WoS comparison on the scale of a larger technical university in Central Europe.展开更多
Purpose:Research dynamics have long been a research interest.It is a macro perspective tool for discovering temporal research trends of a certain discipline or subject.A micro perspective of research dynamics,however,...Purpose:Research dynamics have long been a research interest.It is a macro perspective tool for discovering temporal research trends of a certain discipline or subject.A micro perspective of research dynamics,however,concerning a single researcher or a highly cited paper in terms of their citations and“citations of citations”(forward chaining)remains unexplored.Design/methodology/approach:In this paper,we use a cross-collection topic model to reveal the research dynamics of topic disappearance topic inheritance,and topic innovation in each generation of forward chaining.Findings:For highly cited work,scientific influence exists in indirect citations.Topic modeling can reveal how long this influence exists in forward chaining,as well as its influence.Research limitations:This paper measures scientific influence and indirect scientific influence only if the relevant words or phrases are borrowed or used in direct or indirect citations.Paraphrasing or semantically similar concept may be neglected in this research.Practical implications:This paper demonstrates that a scientific influence exists in indirect citations through its analysis of forward chaining.This can serve as an inspiration on how to adequately evaluate research influence.Originality:The main contributions of this paper are the following three aspects.First,besides research dynamics of topic inheritance and topic innovation,we model topic disappearance by using a cross-collection topic model.Second,we explore the length and character of the research impact through“citations of citations”content analysis.Finally,we analyze the research dynamics of artificial intelligence researcher Geoffrey Hinton’s publications and the topic dynamics of forward chaining.展开更多
Purpose: To design and test a method for normalizing book citations in Google Scholar.Design/methodology/approach: A hybrid citing-side, cited-side normalization method was developed and this was tested on a sample of...Purpose: To design and test a method for normalizing book citations in Google Scholar.Design/methodology/approach: A hybrid citing-side, cited-side normalization method was developed and this was tested on a sample of 285 research monographs. The results were analyzed and conclusions drawn.Findings: The method was technically feasible but required extensive manual intervention because of the poor quality of the Google Scholar data. Research limitations: The sample of books was limited and also all were from one discipline —business and management. Also, the method has only been tested on Google Scholar, it would be useful to test it on Web of Science or Scopus.Practical limitations: Google Scholar is a poor source of data although it does cover a much wider range citation sources that other databases. Originality/value: This is the first method that has been developed specifically for normalizing books which have so far not been able to be normalized.展开更多
Citations play an important role in the scientific community by assisting in measuring multifarious policies like the impact of journals,researchers,institutions,and countries.Authors cite papers for different reasons...Citations play an important role in the scientific community by assisting in measuring multifarious policies like the impact of journals,researchers,institutions,and countries.Authors cite papers for different reasons,such as extending previous work,comparing their study with the state-of-the-art,providing background of the field,etc.In recent years,researchers have tried to conceptualize all citations into two broad categories,important and incidental.Such a categorization is very important to enhance scientific output in multiple ways,for instance,(1)Helping a researcher in identifying meaningful citations from a list of 100 to 1000 citations(2)Enhancing the impact factor calculation mechanism by more strongly weighting important citations,and(3)Improving researcher,institutional,and university rankings by only considering important citations.All of these uses depend upon correctly identifying the important citations from the list of all citations in a paper.To date,researchers have utilized many features to classify citations into these broad categories:cue phrases,in-text citation counts,and metadata features,etc.However,contemporary approaches are based on identification of in-text citation counts,mapping sections onto the Introduction,Methods,Results,and Discussion(IMRAD)structure,identifying cue phrases,etc.Identifying such features accurately is a challenging task and is normally conducted manually,with the accuracy of citation classification demonstrated in terms of these manually extracted features.This research proposes to examine the content of the cited and citing pair to identify important citing papers for each cited paper.This content similarity approach was adopted from research paper recommendation approaches.Furthermore,a novel section-based content similarity approach is also proposed.The results show that solely using the abstract of the cited and citing papers can achieve similar accuracy as the stateof-the-art approaches.This makes the proposed approach a viable technique that does not depend on manual identification of complex features.展开更多
Purpose:Providing an overview of types of citation curves.Design/methodology/approach:The terms citation curves or citation graphs are made explicit.Findings:A framework for the study of diachronous(and synchronous)ci...Purpose:Providing an overview of types of citation curves.Design/methodology/approach:The terms citation curves or citation graphs are made explicit.Findings:A framework for the study of diachronous(and synchronous)citation curves is proposed.Research limitations:No new practical applications are given.Practical implications:This short note about citation curves will help readers to make the optimal choice for their applications.Originality/value:A new scheme for the meaning of the term"citation curve"is designed.展开更多
Citations based relevant research paper recommendations can be generated primarily with the assistance of three citation models:(1)Bibliographic Coupling,(2)Co-Citation,and(3)Direct Citations.Millions of new scholarly...Citations based relevant research paper recommendations can be generated primarily with the assistance of three citation models:(1)Bibliographic Coupling,(2)Co-Citation,and(3)Direct Citations.Millions of new scholarly articles are published every year.This flux of scientific information has made it a challenging task to devise techniques that could help researchers to find the most relevant research papers for the paper at hand.In this study,we have deployed an in-text citation analysis that extends the Direct Citation Model to discover the nature of the relationship degree-ofrelevancy among scientific papers.For this purpose,the relationship between citing and cited articles is categorized into three categories:weak,medium,and strong.As an experiment,around 5,000 research papers were crawled from the CiteSeerX.These research papers were parsed for the identification of in-text citation frequencies.Subsequently,0.1 million references of those articles were extracted,and their in-text citation frequencies were computed.A comprehensive benchmark dataset was established based on the user study.Afterwards,the results were validated with the help of Least Square Approximation by Quadratic Polynomial method.It was found that degreeof-relevancy between scientific papers is a quadratic increasing/decreasing polynomial with respect to-increase/decrease in the in-text citation frequencies of a cited article.Furthermore,the results of the proposed model were compared with state-of-the-art techniques by utilizing a well-known measure,known as the normalized Discount Cumulative Gain(nDCG).The proposed method received an nDCG score of 0.89,whereas the state-of-the-art models such as the Content,Bibliographic-coupling,and Metadata-based Models were able to acquire the nDCG values of 0.65,0.54,and 0.51 respectively.These results indicate that the proposed mechanism may be applied in future information retrieval systems for better results.展开更多
文摘Purpose:The goal of this study is a comparative analysis of the relation between funding(a main driver for scientific research)and citations in papers of Nobel Laureates in physics,chemistry and medicine over 2019-2020 and the same relation in these research fields as a whole.Design/methodology/approach:This study utilizes a power law model to explore the relationship between research funding and citations of related papers.The study here analyzes 3,539 recorded documents by Nobel Laureates in physics,chemistry and medicine and a broader dataset of 183,016 documents related to the fields of physics,medicine,and chemistry recorded in the Web of Science database.Findings:Results reveal that in chemistry and medicine,funded researches published in papers of Nobel Laureates have higher citations than unfunded studies published in articles;vice versa high citations of Nobel Laureates in physics are for unfunded studies published in papers.Instead,when overall data of publications and citations in physics,chemistry and medicine are analyzed,all papers based on funded researches show higher citations than unfunded ones.Originality/value:Results clarify the driving role of research funding for science diffusion that are systematized in general properties:a)articles concerning funded researches receive more citations than(un)funded studies published in papers of physics,chemistry and medicine sciences,generating a high Matthew effect(a higher growth of citations with the increase in the number of papers);b)research funding increases the citations of articles in fields oriented to applied research(e.g.,chemistry and medicine)more than fields oriented towards basic research(e.g.,physics).Practical implications:The results here explain some characteristics of scientific development and diffusion,highlighting the critical role of research funding in fostering citations and the expansion of scientific knowledge.This finding can support decision-making of policymakers and R&D managers to improve the effectiveness in allocating financial resources in science policies to generate a higher positive scientific and societal impact.
文摘Purpose:The goal of this study is to analyze the relationship between funded and unfunded papers and their citations in both basic and applied sciences.Design/methodology/approach:A power law model analyzes the relationship between research funding and citations of papers using 831,337 documents recorded in the Web of Science database.Findings:The original results reveal general characteristics of the diffusion of science in research fields:a)Funded articles receive higher citations compared to unfunded papers in journals;b)Funded articles exhibit a super-linear growth in citations,surpassing the increase seen in unfunded articles.This finding reveals a higher diffusion of scientific knowledge in funded articles.Moreover,c)funded articles in both basic and applied sciences demonstrate a similar expected change in citations,equivalent to about 1.23%,when the number of funded papers increases by 1%in journals.This result suggests,for the first time,that funding effect of scientific research is an invariant driver,irrespective of the nature of the basic or applied sciences.Originality/value:This evidence suggests empirical laws of funding for scientific citations that explain the importance of robust funding mechanisms for achieving impactful research outcomes in science and society.These findings here also highlight that funding for scientific research is a critical driving force in supporting citations and the dissemination of scientific knowledge in recorded documents in both basic and applied sciences.Practical implications:This comprehensive result provides a holistic view of the relationship between funding and citation performance in science to guide policymakers and R&D managers with science policies by directing funding to research in promoting the scientific development and higher diffusion of results for the progress of human society.
基金supported by the National Natural Science Foundation of China(No.62271274).
文摘In the tag recommendation task on academic platforms,existing methods disregard users’customized preferences in favor of extracting tags based just on the content of the articles.Besides,it uses co-occurrence techniques and tries to combine nodes’textual content for modelling.They still do not,however,directly simulate many interactions in network learning.In order to address these issues,we present a novel system that more thoroughly integrates user preferences and citation networks into article labelling recommendations.Specifically,we first employ path similarity to quantify the degree of similarity between user labelling preferences and articles in the citation network.Then,the Commuting Matrix for massive node pair paths is used to improve computational performance.Finally,the two commonalities mentioned above are combined with the interaction paper labels based on the additivity of Poisson distribution.In addition,we also consider solving the model’s parameters by applying variational inference.Experimental results demonstrate that our suggested framework agrees and significantly outperforms the state-of-the-art baseline on two real datasets by efficiently merging the three relational data.Based on the Area Under Curve(AUC)and Mean Average Precision(MAP)analysis,the performance of the suggested task is evaluated,and it is demonstrated to have a greater solving efficiency than current techniques.
文摘Purpose:To analyze the diversity of citation distributions to publications in different research topics to investigate the accuracy of size-independent,rank-based indicators.The top percentile-based indicators are the most common indicators of this type,and the evaluations of Japan are the most evident misjudgments.Design/methodology/approach:The distributions of citations to publications from countries and journals in several research topics were analyzed along with the corresponding global publications using histograms with logarithmic binning,double rank plots,and normal probability plots of log-transformed numbers of citations.Findings:Size-independent,top percentile-based indicators are accurate when the global ranks of local publications fit a power law,but deviations in the least cited papers are frequent in countries and occur in all journals with high impact factors.In these cases,a single indicator is misleading.Comparisons of the proportions of uncited papers are the best way to predict these deviations.Research limitations:This study is fundamentally analytical,and its results describe mathematical facts that are self-evident.Practical implications:Respectable institutions,such as the OECD,the European Commission,and the U.S.National Science Board,produce research country rankings and individual evaluations using size-independent percentile indicators that are misleading in many countries.These misleading evaluations should be discontinued because they can cause confusion among research policymakers and lead to incorrect research policies.Originality/value:Studies linking the lower tail of citation distribution,including uncited papers,to percentile research indicators have not been performed previously.The present results demonstrate that studies of this type are necessary to find reliable procedures for research assessments.
基金supported by the National Natural Science Foundation of China(Grant No.71974167).
文摘Purpose:The disseminating of academic knowledge to nonacademic audiences partly relies on the transition of subsequent citing papers.This study aims to investigate direct and indirect impact on technology and policy originating from transformative research based on ego citation network.Design/methodology/approach:Key Nobel Prize-winning publications(NPs)in fields of gene engineering and astrophysics are regarded as a proxy for transformative research.In this contribution,we introduce a network-structural indicator of citing patents to measure technological impact of a target article and use policy citations as a preliminary tool for policy impact.Findings:The results show that the impact on technology and policy of NPs are higher than that of their subsequent citation generations in gene engineering but not in astrophysics.Research limitations:The selection of Nobel Prizes is not balanced and the database used in this study,Dimensions,suffers from incompleteness and inaccuracy of citation links.Practical implications:Our findings provide useful clues to better understand the characteristics of transformative research in technological and policy impact.Originality/value:This study proposes a new framework to explore the direct and indirect impact on technology and policy originating from transformative research.
基金supported in part by the Slovenian Research Agency(VB,research program P1-0294)(VB,research project J5-2557)+2 种基金(VB,research project J5-4596)COST EU(VB,COST action CA21163(HiTEc)is prepared within the framework of the HSE University Basic Research Program.
文摘Purpose:We analyzed the structure of a community of authors working in the field of social network analysis(SNA)based on citation indicators:direct citation and bibliographic coupling metrics.We observed patterns at the micro,meso,and macro levels of analysis.Design/methodology/approach:We used bibliometric network analysis,including the“temporal quantities”approach proposed to study temporal networks.Using a two-mode network linking publications with authors and a one-mode network of citations between the works,we constructed and analyzed the networks of citation and bibliographic coupling among authors.We used an iterated saturation data collection approach.Findings:At the macro-level,we observed the global structural features of citations between authors,showing that 80%of authors have not more than 15 citations from other works.At the meso-level,we extracted the groups of authors citing each other and similar to each other according to their citation patterns.We have seen a division of authors in SNA into groups of social scientists and physicists,as well as into other groups of authors from different disciplines.We found some examples of brokerage between different groups that maintained the common identity of the field.At the micro-level,we extracted authors with extremely high values of received citations,who can be considered as the most prominent authors in the field.We examined the temporal properties of the most popular authors.Research limitations:The main challenge in this approach is the resolution of the author’s name(synonyms and homonyms).We faced the author disambiguation,or“multiple personalities”(Harzing,2015)problem.To remain consistent and comparable with our previously published articles,we used the same SNA data collected up to 2018.The analysis and conclusions on the activity,productivity,and visibility of the authors are relative only to the field of SNA.Practical implications:The proposed approach can be utilized for similar objectives and identifying key structures and characteristics in other disciplines.This may potentially inspire the application of network approaches in other research areas,creating more authors collaborating in the field of SNA.Originality/value:We identified and applied an innovative approach and methods to study the structure of scientific communities,which allowed us to get the findings going beyond those obtained with other methods.We used a new approach to temporal network analysis,which is an important addition to the analysis as it provides detailed information on different measures for the authors and pairs of authors over time.
基金support from Ministry of Science and Technology,Taiwan,R.O.C.under Grant No.MOST 109-2410-H-011-021-MY3.
文摘A novel indicator called price-citation was proposed.Based on the company integrated patent database of China listed companies of common stocks(A-shares)with the stock price and the stock return rate data,more than two thousand of A-shares from 2017 to 2020 were selected.The effect of the traditional patent forward citation and the price-citation for discriminating the stock return rate was thoroughly analyzed via ANOVA.The A-shares of forward citation counts above the average showed higher stock return rate means than the A-shares having patents but receiving no forward citations.The price-citation,combining both the financial and patent attributes,defined as the multiplication of the current stock price and the currently receiving forward citation count,showed its excellence in discriminating the stock return rate.The A-shares of higher price-citation showed significantly higher stock return rate means while the A-shares of lower price-citation showed significantly lowest stock return rate means.The price-citation effect had not been changed by COVID-19 though COVID-19 affected the social and economic environment to a considerable extent in 2020.
文摘This study examines how writer stance is projected in citation from a comparative perspective.The dataset consisted of 120 qualitative and quantitative research articles(RAs)that were authored by Anglophone and Chinese applied linguists.ANO?VA tests revealed marked cross-language and cross-paradigmatic differences in stance-based dialogic engagement.
文摘Purpose: First, to review the state-of-the-art in patent citation analysis, particularly characteristics of patent citations to scientific literature (scientific non-patent references, SNPRs). Second, to present a novel mapping approach to identify technology-relevant research based on the papers cited by and referring to the SNPRs. Design/methodology/approach: In the review part we discuss the context of SNPRs such as the time lags between scientific achievements and inventions. Also patent-to-patent citation is addressed particularly because this type of patent citation analysis is a major element in the assessment of the economic value of patents. We also review the research on the role of universities and researchers in technological development, with important issues such as universities as sources of technological knowledge and inventor-author relations. We conclude the review part of this paper with an overview of recent research on mapping and network analysis of the science and technology interface and of technological progress in interaction with science. In the second part we apply new techniques for the direct visualization of the cited and citing relations of SNPRs, the mapping of the landscape around SNPRs by bibliographic coupling and co-citation analysis, and the mapping of the conceptual environment of SNPRs by keyword co-occurrence analysis. Findings: We discuss several properties of SNPRs. Only a small minority of publications covered by the Web of Science or Scopus are cited by patents, about 3%-4%. However, for publications based on university-industry collaboration the number of SNPRs is considerably higher, around 15%. The proposed mapping methodology based on a "second order SNPR approach" enables a better assessment of the technological relevance of research. Research limitations: The main limitation is that a more advanced merging of patent and publication data, in particular unification of author and inventor names, in still a necessity. Practical implications: The proposed mapping methodology enables the creation of a database of technology-relevant papers (TRPs). In a bibliometric assessment the publications of research groups, research programs or institutes can be matched with the TRPs and thus the extent to which the work of groups, programs or institutes are relevant for technological development can be measured. Originality/value: The review part examines a wide range of findings in the research of patent citation analysis. The mapping approach to identify a broad range of technologyrelevant papers is novel and offers new opportunities in research evaluation practices.
文摘Purpose: (1) To test basic assumptions underlying frequency-weighted citation analysis: (a) Uni-citations correspond to citations that are nonessential to the citing papers; (b) The influence of a cited paper on the citing paper increases with the frequency with which it is cited in the citing paper. (2) To explore the degree to which citation location may be used to help identify nonessential citations. Design/methodology/approach: Each of the in-text citations in all research articles published in Issue 1 of the Journal of the Association for Information Science and Technology (JASIST) 2016 was manually classified into one of these five categories: Applied, Contrastive, Supportive, Reviewed, and Perfunctory. The distributions of citations at different in-text frequencies and in different locations in the text by these functions were analyzed. Findings: Filtering out nonessential citations before assigning weight is important for frequency-weighted citation analysis. For this purpose, removing citations by location is more effective than re-citation analysis that simply removes uni-citations. Removing all citation occurrences in the Background and Literature Review sections and uni-citations in the Introduction section appears to provide a good balance between filtration and error rates. Research limitations: This case study suffers from the limitation of scalability and generalizability. We took careful measures to reduce the impact of other limitations of the data collection approach used. Relying on the researcher's judgment to attribute citation functions, this approach is unobtrusive but speculative, and can suffer from a low degree of confidence, thus creating reliability concerns. Practical implications: Weighted citation analysis promises to improve citation analysis for research evaluation, knowledge network analysis, knowledge representation, and information retrieval. The present study showed the importance of filtering out nonessential citations before assigning weight in a weighted citation analysis, which may be a significant step forward to realizing these promises. Originality/value: Weighted citation analysis has long been proposed as a theoretical solution to the problem of citation analysis that treats all citations equally, and has attracted increasing research interest in recent years. The present study showed, for the first time, the importance of filtering out nonessential citations in weighted citation analysis, pointing research in this area in a new direction.
基金Supported by the National Natural Science Foundation of China,No.30224801
文摘AIM: To determine the citation status in 2004 and the citation trend of WJG by analyzing all articles cited by WJG and all WJG articles cited by SCI journals during 1998-2004.METHODS: The total number of published articles and reference citations in WJG, authors' self-citations, WJG's self-citations, citations of WJG articles by SCI journals and inappropriate citations in WJG during 1998-2004 were statistically analyzed. Data on self-citations of the articles published between 1998 and August 2004 (Issues 1-16)were from ISI SCI-E, and data on self-citations of articles published after August 2004 (Issues 17-24) were from the WJG Editorial Office. Data on citations of WJG articles by other journals between 1998 and August 2004 were from ISI SCI-E.RESULTS: Annual number of published articles: WJG published 179, 144, 211, 174, 236, 634 and 830 articles,respectively, in 1998, 1999, 2000, 2001, 2002, 2003 and 2004. The number in 2004 increased by 594, compared to that in 2002, giving an increased rate of 251.7%. Annual references cited by WJG were 2 123, 2 125, 6 244,8 883, 11 442, 23 218 and 25 971, respectively, in 1998-2004.The average number was 31.3 per WJG article in 2004,which was less than that (48.5) in 2002, giving a reduction rate of 35.5%. Authors of WJG cited 125, 126, 343, 210,354, 310 and 470 of their own published articles,respectively, in 1998-2004. The average number of authors'self-citations was 0.57 per WJG article in 2004, which was decreased by 0.93 or 62.0%, compared with that in 2002.Annual numbers of journal's self-citations: Authors of WJG articles cited 5, 7, 373, 733, 1474, 1947 and 1412 of WJG articles, respectively, in 1998-2004. The average number of journal's self-citations was 1.70 per WJG article in 2004,which decreased by 4.55 or 72.8%. No WJG article was cited in 1998 by other SCI journals. However, the number of citations steadily increased afterwards, with 16, 18, 39,85, 372 and 580, respectively, in 1999-2004. The average number of citations by other SCI journals was 0.11, 0.09,0.22, 0.36, 0.59 and 1.06 per WJG article, respectively,1999-2004 (January-August). There was an increase by 582%, when comparing the citation numbers between 2004and 2002. Annual WJG self-citation rates and citation rates of WJG articles by other SCI journals: WJG self-citation rates were 30.43%, 95.40%, 95.07%, 94.55%, 83.96%and 67.47%, respectively, in 1999-2004 (January-August).Compared with 2002, the self-citation rate in 2004decreased by 26.87%. The citation rates of WJG articles by other SCI journals were 69.57%, 4.60%, 4.93%, 5.45%,16.04%, and 32.53%, respectively, in 1999-2004 (January-August). Compared with 2002, the citation rate in 2004decreased by 26.87%. There were 8, 19, 218, 274, 461,698 and 574 inappropriate citations, respectively, in 1998-2004.The average inappropriate citation in 2004 was 0.69 per article, which represents a decrease of 1.26, compared with that in 2002. Inappropriate citations were mostly those with the differences between the two sides of the hyphens of 5-9, and the proportions of inappropriate citations within the three subsections of the differences between the two sides of the hyphens (5-9, 10-19, and >=20) were approximately 7:2:1. In addition, inappropriate citations mostly occurred with frequencies of 1-3 in the articles,and the proportion of inappropriate citations within the two frequency subsections (1-3 and >3) have been approximately 4:1 since 1999.CONCLUSIONS: In 2004, the average number of reference citations, authors' self-citations and journal's self-citations were 31.3, 0.57 and 1.70 per article, respectively, which represents a decrease in the numbers by 35.5%. 62.0%,and 72.8% respectively compared to the corresponding numbers in 2002. WJG self-citation rate was 67.47% in 2004 (January-August), which was a decrease by26.87%,compared with 2002. The citation rate of WJG articles by other SCI journals was 32.53% in 2004 (January-August),an increase of 26.87%, compared to 2002. There were 574inappropriate citations in 2004, with an average of 0.69per article, which represents a decrease of 1.26, compared with that in 2002. These figures demonstrate that the overall citation status of WJG is improving.
文摘Purpose: Communicating scientific results to the public is essential to inspire future researchers and ensure that discoveries are exploited. News stories about research are a key communication pathway for this and have been manually monitored to assess the extent of press coverage of scholarship.Design/methodology/Approach: To make larger scale studies practical, this paper introduces an automatic method to extract citations from newspaper stories to large sets of academic journals. Curated ProQuest queries were used to search for citations to 9,639 Science and3,412 Social Science Web of Science(WoS) journals from eight UK daily newspapers during2006–2015. False matches were automatically filtered out by a new program, with 94% of the remaining stories meaningfully citing research.Findings: Most Science(95%) and Social Science(94%) journals were never cited by these newspapers. Half of the cited Science journals covered medical or health-related topics,whereas 43% of the Social Sciences journals were related to psychiatry or psychology. From the citing news stories, 60% described research extensively and 53% used multiple sources,but few commented on research quality.Research Limitations: The method has only been tested in English and from the ProQuest Newspapers database.Practical implications: Others can use the new method to systematically harvest press coverage of research.Originality/value: An automatic method was introduced and tested to extract citations from newspaper stories to large sets of academic journals.
文摘Purpose:We study the proportion of Web of Science(WoS)citation links that are represented in the Crossref Open Citation Index(COCI),with the possible aim of using COCI in research evaluation instead of the WoS,if the level of coverage was sufficient.Design/methodology/approach:We calculate the proportion on citation links where both publications have a WoS accession number and a DOI simultaneously,and where the cited publications have had at least one author from our institution,the Czech Technical University in Prague.We attempt to look up each such citation link in COCI.Findings:We find that 53.7%of WoS citation links are present in the COCI.The proportion varies largely by discipline.The total figures differ significantly from 40%in the large-scale study by Van Eck,Waltman,Larivière,and Sugimoto(blog 2018,https://www.cwts.nl/blog?article=n-r2s234).Research limitations:The sample does not cover all science areas uniformly;it is heavily focused on Engineering and Technology,and only some disciplines of Natural Sciences are present.However,this reflects the real scientific orientation and publication profile of our institution.Practical implications:The current level of coverage is not sufficient for the WoS to be replaced by COCI for research evaluation.Originality/value:The present study illustrates a COCI vs WoS comparison on the scale of a larger technical university in Central Europe.
基金This work is supported by the Programs for the Young Talents of National Science Library,Chinese Academy of Sciences(Grant No.2019QNGR003).
文摘Purpose:Research dynamics have long been a research interest.It is a macro perspective tool for discovering temporal research trends of a certain discipline or subject.A micro perspective of research dynamics,however,concerning a single researcher or a highly cited paper in terms of their citations and“citations of citations”(forward chaining)remains unexplored.Design/methodology/approach:In this paper,we use a cross-collection topic model to reveal the research dynamics of topic disappearance topic inheritance,and topic innovation in each generation of forward chaining.Findings:For highly cited work,scientific influence exists in indirect citations.Topic modeling can reveal how long this influence exists in forward chaining,as well as its influence.Research limitations:This paper measures scientific influence and indirect scientific influence only if the relevant words or phrases are borrowed or used in direct or indirect citations.Paraphrasing or semantically similar concept may be neglected in this research.Practical implications:This paper demonstrates that a scientific influence exists in indirect citations through its analysis of forward chaining.This can serve as an inspiration on how to adequately evaluate research influence.Originality:The main contributions of this paper are the following three aspects.First,besides research dynamics of topic inheritance and topic innovation,we model topic disappearance by using a cross-collection topic model.Second,we explore the length and character of the research impact through“citations of citations”content analysis.Finally,we analyze the research dynamics of artificial intelligence researcher Geoffrey Hinton’s publications and the topic dynamics of forward chaining.
文摘Purpose: To design and test a method for normalizing book citations in Google Scholar.Design/methodology/approach: A hybrid citing-side, cited-side normalization method was developed and this was tested on a sample of 285 research monographs. The results were analyzed and conclusions drawn.Findings: The method was technically feasible but required extensive manual intervention because of the poor quality of the Google Scholar data. Research limitations: The sample of books was limited and also all were from one discipline —business and management. Also, the method has only been tested on Google Scholar, it would be useful to test it on Web of Science or Scopus.Practical limitations: Google Scholar is a poor source of data although it does cover a much wider range citation sources that other databases. Originality/value: This is the first method that has been developed specifically for normalizing books which have so far not been able to be normalized.
文摘Citations play an important role in the scientific community by assisting in measuring multifarious policies like the impact of journals,researchers,institutions,and countries.Authors cite papers for different reasons,such as extending previous work,comparing their study with the state-of-the-art,providing background of the field,etc.In recent years,researchers have tried to conceptualize all citations into two broad categories,important and incidental.Such a categorization is very important to enhance scientific output in multiple ways,for instance,(1)Helping a researcher in identifying meaningful citations from a list of 100 to 1000 citations(2)Enhancing the impact factor calculation mechanism by more strongly weighting important citations,and(3)Improving researcher,institutional,and university rankings by only considering important citations.All of these uses depend upon correctly identifying the important citations from the list of all citations in a paper.To date,researchers have utilized many features to classify citations into these broad categories:cue phrases,in-text citation counts,and metadata features,etc.However,contemporary approaches are based on identification of in-text citation counts,mapping sections onto the Introduction,Methods,Results,and Discussion(IMRAD)structure,identifying cue phrases,etc.Identifying such features accurately is a challenging task and is normally conducted manually,with the accuracy of citation classification demonstrated in terms of these manually extracted features.This research proposes to examine the content of the cited and citing pair to identify important citing papers for each cited paper.This content similarity approach was adopted from research paper recommendation approaches.Furthermore,a novel section-based content similarity approach is also proposed.The results show that solely using the abstract of the cited and citing papers can achieve similar accuracy as the stateof-the-art approaches.This makes the proposed approach a viable technique that does not depend on manual identification of complex features.
基金supported by the National Natural Science Foundation of China,Grant numbers 71573225 and 71974167。
文摘Purpose:Providing an overview of types of citation curves.Design/methodology/approach:The terms citation curves or citation graphs are made explicit.Findings:A framework for the study of diachronous(and synchronous)citation curves is proposed.Research limitations:No new practical applications are given.Practical implications:This short note about citation curves will help readers to make the optimal choice for their applications.Originality/value:A new scheme for the meaning of the term"citation curve"is designed.
文摘Citations based relevant research paper recommendations can be generated primarily with the assistance of three citation models:(1)Bibliographic Coupling,(2)Co-Citation,and(3)Direct Citations.Millions of new scholarly articles are published every year.This flux of scientific information has made it a challenging task to devise techniques that could help researchers to find the most relevant research papers for the paper at hand.In this study,we have deployed an in-text citation analysis that extends the Direct Citation Model to discover the nature of the relationship degree-ofrelevancy among scientific papers.For this purpose,the relationship between citing and cited articles is categorized into three categories:weak,medium,and strong.As an experiment,around 5,000 research papers were crawled from the CiteSeerX.These research papers were parsed for the identification of in-text citation frequencies.Subsequently,0.1 million references of those articles were extracted,and their in-text citation frequencies were computed.A comprehensive benchmark dataset was established based on the user study.Afterwards,the results were validated with the help of Least Square Approximation by Quadratic Polynomial method.It was found that degreeof-relevancy between scientific papers is a quadratic increasing/decreasing polynomial with respect to-increase/decrease in the in-text citation frequencies of a cited article.Furthermore,the results of the proposed model were compared with state-of-the-art techniques by utilizing a well-known measure,known as the normalized Discount Cumulative Gain(nDCG).The proposed method received an nDCG score of 0.89,whereas the state-of-the-art models such as the Content,Bibliographic-coupling,and Metadata-based Models were able to acquire the nDCG values of 0.65,0.54,and 0.51 respectively.These results indicate that the proposed mechanism may be applied in future information retrieval systems for better results.