As a major strategic technology for reducing greenhouse gas emissions and ensuring energy security,carbon capture,utilization,and storage(CCUS)is of great significance to large-scale emission reduction.From the perspe...As a major strategic technology for reducing greenhouse gas emissions and ensuring energy security,carbon capture,utilization,and storage(CCUS)is of great significance to large-scale emission reduction.From the perspective of knowledge discovery,it is important to analyse the study progress based on existing study achievements,excavate the evolution characteristics of study topics over time,review stage-specific findings,and construct CCUS domain knowledge map.This will help researchers gain an overall understanding of CCUS studies and promote the industry-college-research cooperation in respect to CCUS.Based on the Web of Science(WOS)database platform and CitNet-Explorer software,the present study explore the international research progress,topic evolution track,research hotspot and research trend of CCUS technology since its birth nearly 30 years ago,using bibliometric method,citation network visualization analysis method and cluster analysis method.Through the analysis of literature citation network,it is found that:16 CCUS topics,6 hotspots have been studied in the last three decades.The topics of CCUS studies present an evolution path from CCUS technology security and economicfeasibility analysis to CCUS technological popularization,and then CCUS technological improvement and development.Cutting-edge CCUS looks at the process and infrastructure construction,cost effectiveness and development prospect analysis.CCUS focuses on improvement of process technologies and related infrastructure.展开更多
Internet has become a major medium for infomation transmission, how to detect hot topic on web, track the event development and forecast emergency is important to many fields, particularly to some government departmen...Internet has become a major medium for infomation transmission, how to detect hot topic on web, track the event development and forecast emergency is important to many fields, particularly to some government departments. On the basis of the researches in the field of topic detection and tracking, we propose a model for hot topic discovery that will pick out hot topics by automatically detecting, clustering and weighting topics on the websites within a time period. Based on the idea of stock index, we also introduce a topic index approach in following the growth of topics, which is useful to analyze and forecast the development of topics on web.展开更多
Purpose:We present an analytical,open source and flexible natural language processing and text mining method for topic evolution,emerging topic detection and research trend forecasting for all kinds of data-tagged tex...Purpose:We present an analytical,open source and flexible natural language processing and text mining method for topic evolution,emerging topic detection and research trend forecasting for all kinds of data-tagged text.Design/methodology/approach:We make full use of the functions provided by the open source VOSviewer and Microsoft Office,including a thesaurus for data clean-up and a LOOKUP function for comparative analysis.Findings:Through application and verification in the domain of perovskite solar cells research,this method proves to be effective.Research limitations:A certain amount of manual data processing and a specific research domain background are required for better,more illustrative analysis results.Adequate time for analysis is also necessary.Practical implications:We try to set up an easy,useful,and flexible interdisciplinary text analyzing procedure for researchers,especially those without solid computer programming skills or who cannot easily access complex software.This procedure can also serve as a wonderful example for teaching information literacy.Originality/value:This text analysis approach has not been reported before.展开更多
Purpose: In this paper, we combined the method of co-word analysis and alluvial diagram to detect hot topics and illustrate their dynamics. Design/methodology/approach: Articles in the field of scientometrics were c...Purpose: In this paper, we combined the method of co-word analysis and alluvial diagram to detect hot topics and illustrate their dynamics. Design/methodology/approach: Articles in the field of scientometrics were chosen as research cases in this study. A time-sliced co-word network was generated and then clustered. Afterwards, we generated an alluvial diagram to show dynamic changes of hot topics, including their merges and splits over time. Findings: After analyzing the dynamic changes in the field of scientometrics from 2011 to 2015, we found that two clusters being merged did not mean that the old topics had disappeared and a totally new one had emerged. The topics were possibly still active the following year, but the newer topics had drawn more attention. The changes of hot topics reflected the shift in researchers' interests. subdivided and re-merged. For example, several topics as research progressed. Research topics in scientometrics were constantly a cluster involving "industry" was divided into Research limitations: When examining longer time periods, we encounter the problem of dealing with bigger data sets. Analyzing data year by year would be tedious, but if we combine, e.g. two years into one time slice, important details would be missed. Practical implications: This method can be applied to any research field to illustrate the dynamics of hot topics. It can indicate the promising directions for researchers and provide guidance to decision makers. Originality/value: The use of alluvial diagrams is a distinctive and meaningful approach to detecting hot topics and especially to illustrating their dynamics.展开更多
Purpose:To reveal the research hotpots and relationship among three research hot topics in b iomedicine,namely CRISPR,iPS(induced Pluripotent Stem)cell and Synthetic biology.Design/methodology/approach:We set up their...Purpose:To reveal the research hotpots and relationship among three research hot topics in b iomedicine,namely CRISPR,iPS(induced Pluripotent Stem)cell and Synthetic biology.Design/methodology/approach:We set up their keyword co-occurrence networks with using three indicators and information visualization for metric analysis.Findings:The results reveal the main research hotspots in the three topics are different,but the overlapping keywords in the three topics indicate that they are mutually integrated and interacted each other.Research limitations:All analyses use keywords,without any other forms.Practical implications:We try to find the information distribution and structure of these three hot topics for revealing their research status and interactions,and for promoting biomedical developments.Originality/value:We chose the core keywords in three research hot topics in biomedicine by using h-index.展开更多
In social networks,many complex factors affect the prediction of user forwarding behavior.This paper proposes an improved SVM prediction method for user forwarding behavior of hot topics to improve prediction accuracy...In social networks,many complex factors affect the prediction of user forwarding behavior.This paper proposes an improved SVM prediction method for user forwarding behavior of hot topics to improve prediction accuracy.Firstly,we consider that the improved Cuckoo Search algorithm can select the optimal penalty parameters and kernel function parameters to optimize the SVM and thus predict the user's forwarding behavior.Secondly,this paper considers the factors that affect the user forwarding behavior comprehensively from the user's own factors and external factors.Finally,based on the characteristics of the user's forwarding behavior changing over time,the time-slicing method is used to predict the trend of hot topics.Experiments show that the method can accurately predict the user's forwarding behavior and can sense the trend of hot topics.展开更多
Social applications such as Weibo have provided a quick platform for information propagation, which have led to an explosive propagation for hot topic. User sentiments about propagation information play an important r...Social applications such as Weibo have provided a quick platform for information propagation, which have led to an explosive propagation for hot topic. User sentiments about propagation information play an important role in propagation speed, which receive more and more attention from data mining field. In this paper, we propose an sentiment-based hot topics prediction model called PHT-US. PHT-US firstly classifies a large amount of text data in Weibo into different topics, then converts user sentiments and time factors into embedding vectors that are input into recurrent neural networks (both LSTM and GRU), and predicts whether the target topic could be a hot spot. Experiments on Sina Weibo show that PHT-US can effectively predict the hot topics in the future. Social applications such as Weibo provide a platform for quick information propagation, which leads to an explosive propagation for hot topics. User sentiments about propagation information play an important role in propagation speed, and thus receive more attention from data mining field. In this paper, a sentiment-based hot topics prediction model called PHT-US is proposed. Firstly a large amount of text data in Weibo was classified into different topics, and then user sentiments and time factors were converted into embedding vectors that are input into recurrent neural networks (both LSTM and GRU), and future hotspots were predicted. Experiments on Sina Weibo show that PHT-US can effectively predict hot topics in the future.展开更多
Revealing and comparing the evolution process of hot topics in the field of Digital Library in China and abroad.[Methods]:Taking data in the field of Digital Library from core journals in CKNI and Web of Science from ...Revealing and comparing the evolution process of hot topics in the field of Digital Library in China and abroad.[Methods]:Taking data in the field of Digital Library from core journals in CKNI and Web of Science from 1990 s to 2020,topics are extracted by LDA model and hot topics are selected based on life cycle theory.Topic evolution paths are generated to contrast evolution of hot topics between home and abroad which are grouped into dimensions of technology and application.It fails to analyze the lagging performance and reasons of research hot topics in the field of Digital Library at home and abroad.In technological dimension of Digital Library,the research content in China lags behind that at abroad.In terms of application dimension,Chinese application tends to focus on social sciences,while application at abroad tends to focus on natural sciences.The evolution of overall research focus is U-shaped,which gradually shifted from technological research to application research,and now turn back to technological dimension.Nowadays,there are also many emerging topics combined with big data technology.展开更多
Accurately representing the quantity and characteristics of users' interest in certain topics is an important problem facing topic evolution researchers, particularly as it applies to modem online environments. Searc...Accurately representing the quantity and characteristics of users' interest in certain topics is an important problem facing topic evolution researchers, particularly as it applies to modem online environments. Search engines can provide information retrieval for a specified topic from archived data, but fail to reflect changes in interest toward the topic over time in a structured way. This paper reviews notable research on topic evolution based on the probabilistic topic model from multiple aspects over the past decade. First, we introduce notations, terminology, and the basic topic model explored in the survey, then we summarize three categories of topic evolution based on the probabilistic topic model: the discrete time topic evolution model, the continuous time topic evolution model, and the online topic evolution model. Next, we describe applications of the topic evolution model and attempt to summarize model generalization performance evaluation and topic evolution evaluation methods, as well as providing comparative experimental results for different models. To conclude the review, we pose some open questions and discuss possible future research directions.展开更多
Purpose:This paper introduces an analysis framework for tracking the evolution of research topics at the selected topics level,covering a research topic’s evolution trend,evolution path and its content changes over t...Purpose:This paper introduces an analysis framework for tracking the evolution of research topics at the selected topics level,covering a research topic’s evolution trend,evolution path and its content changes over time.Design/methodology/approach:After the topics were recovered by the author-topic model,we first built the keyword-topic co-occurrence network to track the dynamics of topic trends.Then a single-mode network was constructed with each node representing a topic and edge indicating the relationship between topics.It was used to illustrate the evolution path and content changes of research topics.A case study was conducted on the digital library research in China to verify the effectiveness of the analysis framework.Findings:The experimental results show that this analysis framework can be used to track evolution of research topics at a micro level and using social network analysis method can help understand research topics’evolution paths and content changes with the passage of time.Research limitations:Using the analysis framework will produce limited results when examining unstructured data such as social media data.In addition,the effectiveness of the framework introduced in this paper needs to be verified with more research topics in information science and in more scientific fields.Practical implications:This analysis framework can help scholars and researchers map research topics’evolution process and gain insights into how a field’s topics have evolved over time.Originality/value:Tbe analysis framework used in this study can help reveal more micro evolution details.The index to measure topic association strength defined in this paper reflects both similarity and dissimilarity between topics,which belps better understand research topics’evolution paths and content changes.展开更多
This article presents the formal definition and description of popular topics on the Internet,analyzes the relationship between popular words and topics,and finally introduces a method that uses statistics and correla...This article presents the formal definition and description of popular topics on the Internet,analyzes the relationship between popular words and topics,and finally introduces a method that uses statistics and correlation of the popular words in traffic content and network flow characteristics as input for extracting popular topics on the Internet.Based on this,this article adapts a clustering algorithm to extract popular topics and gives formalized results.The test results show that this method has an accuracy of 16.7%in extracting popular topics on the Internet.Compared with web mining and topic detection and tracking(TDT),it can provide a more suitable data source for effective recovery of Internet public opinions.展开更多
The technology innovation management(TIM)field attracts an increasing amount of attention.This paper takes a retrospective look at high-quality publication output in the TIM field over the 55 years from 1968 to 2022,r...The technology innovation management(TIM)field attracts an increasing amount of attention.This paper takes a retrospective look at high-quality publication output in the TIM field over the 55 years from 1968 to 2022,revealing topics,their evolutions,and research trends.A total of 31,498 articles and proceeding papers published during this period are analyzed.The paper first extracts the fine-grained topic words using the tool ITGInsight.Then Linlog algorithm is used to cluster topics based on the cooccurrence of the topic words.Time is integrated within the topic cluster results so that topic evolutions and research trends are analyzed.The TIM field has four main topic clusters:technology research,product research,firm research,and future research.In every topic cluster,there are many fine-sorted macro-topics and micro-topics.There is an obvious increase in diversity in the topic clusters of technology research and firm research.Especially,the evolution of technology research has been closely connected with society.In contrast,product research has declined in its topic size.At the same time,future research maintains a certain stability of its scientific publications.The research predicts that all the four topics will retain their popularity,and play an important role in the TIM field.Among them,technology research will continue to expand and enrich the TIM field.The other three topics will deepen their research for a better development of the TIM field.The paper also proposes some advice for industry professionals,policymakers,and researchers.展开更多
INTRODUCTION Within the last few years,a new group of diseases featured with cognitive impairment, seizures and behavior disorders was reported. And these diseases were commonly diagnosed as 'viral encephalitis...INTRODUCTION Within the last few years,a new group of diseases featured with cognitive impairment, seizures and behavior disorders was reported. And these diseases were commonly diagnosed as 'viral encephalitis' and'sporadic encephalitis' than autoimmune encephalitis (AE) before AE had confirmed etiological.展开更多
The problem of "rich topics get richer"(RTGR) is popular to the topic models,which will bring the wrong topic distribution if the distributing process has not been intervened.In standard LDA(Latent Dirichlet...The problem of "rich topics get richer"(RTGR) is popular to the topic models,which will bring the wrong topic distribution if the distributing process has not been intervened.In standard LDA(Latent Dirichlet Allocation) model,each word in all the documents has the same statistical ability.In fact,the words have different impact towards different topics.Under the guidance of this thought,we extend ILDA(Infinite LDA) by considering the bias role of words to divide the topics.We propose a self-adaptive topic model to overcome the RTGR problem specifically.The model proposed in this paper is adapted to three questions:(1) the topic number is changeable with the collection of the documents,which is suitable for the dynamic data;(2) the words have discriminating attributes to topic distribution;(3) a selfadaptive method is used to realize the automatic re-sampling.To verify our model,we design a topic evolution analysis system which can realize the following functions:the topic classification in each cycle,the topic correlation in the adjacent cycles and the strength calculation of the sub topics in the order.The experiment both on NIPS corpus and our self-built news collections showed that the system could meet the given demand,the result was feasible.展开更多
By using CiteSpace software to create a knowledge map of authors,institutions and keywords,the literature on the spatio-temporal behavior of Chinese residents based on big data in the architectural planning discipline...By using CiteSpace software to create a knowledge map of authors,institutions and keywords,the literature on the spatio-temporal behavior of Chinese residents based on big data in the architectural planning discipline published in the China Academic Network Publishing Database(CNKI)was analyzed and discussed.It is found that there was a lack of communication and cooperation among research institutions and scholars;the research hotspots involved four main areas,including“application in tourism research”,“application in traffic travel research”,“application in work-housing relationship research”,and“application in personal family life research”.展开更多
基金Supported by the Fundamental Research funds for the China Central Universities“CCUS topic detection and evolution analysis based on CitNetExplorer”[Grant number.JBK2002042].
文摘As a major strategic technology for reducing greenhouse gas emissions and ensuring energy security,carbon capture,utilization,and storage(CCUS)is of great significance to large-scale emission reduction.From the perspective of knowledge discovery,it is important to analyse the study progress based on existing study achievements,excavate the evolution characteristics of study topics over time,review stage-specific findings,and construct CCUS domain knowledge map.This will help researchers gain an overall understanding of CCUS studies and promote the industry-college-research cooperation in respect to CCUS.Based on the Web of Science(WOS)database platform and CitNet-Explorer software,the present study explore the international research progress,topic evolution track,research hotspot and research trend of CCUS technology since its birth nearly 30 years ago,using bibliometric method,citation network visualization analysis method and cluster analysis method.Through the analysis of literature citation network,it is found that:16 CCUS topics,6 hotspots have been studied in the last three decades.The topics of CCUS studies present an evolution path from CCUS technology security and economicfeasibility analysis to CCUS technological popularization,and then CCUS technological improvement and development.Cutting-edge CCUS looks at the process and infrastructure construction,cost effectiveness and development prospect analysis.CCUS focuses on improvement of process technologies and related infrastructure.
文摘Internet has become a major medium for infomation transmission, how to detect hot topic on web, track the event development and forecast emergency is important to many fields, particularly to some government departments. On the basis of the researches in the field of topic detection and tracking, we propose a model for hot topic discovery that will pick out hot topics by automatically detecting, clustering and weighting topics on the websites within a time period. Based on the idea of stock index, we also introduce a topic index approach in following the growth of topics, which is useful to analyze and forecast the development of topics on web.
文摘Purpose:We present an analytical,open source and flexible natural language processing and text mining method for topic evolution,emerging topic detection and research trend forecasting for all kinds of data-tagged text.Design/methodology/approach:We make full use of the functions provided by the open source VOSviewer and Microsoft Office,including a thesaurus for data clean-up and a LOOKUP function for comparative analysis.Findings:Through application and verification in the domain of perovskite solar cells research,this method proves to be effective.Research limitations:A certain amount of manual data processing and a specific research domain background are required for better,more illustrative analysis results.Adequate time for analysis is also necessary.Practical implications:We try to set up an easy,useful,and flexible interdisciplinary text analyzing procedure for researchers,especially those without solid computer programming skills or who cannot easily access complex software.This procedure can also serve as a wonderful example for teaching information literacy.Originality/value:This text analysis approach has not been reported before.
基金supported by the National Social Science Foundation of China (Grant No.: 14BTQ030)
文摘Purpose: In this paper, we combined the method of co-word analysis and alluvial diagram to detect hot topics and illustrate their dynamics. Design/methodology/approach: Articles in the field of scientometrics were chosen as research cases in this study. A time-sliced co-word network was generated and then clustered. Afterwards, we generated an alluvial diagram to show dynamic changes of hot topics, including their merges and splits over time. Findings: After analyzing the dynamic changes in the field of scientometrics from 2011 to 2015, we found that two clusters being merged did not mean that the old topics had disappeared and a totally new one had emerged. The topics were possibly still active the following year, but the newer topics had drawn more attention. The changes of hot topics reflected the shift in researchers' interests. subdivided and re-merged. For example, several topics as research progressed. Research topics in scientometrics were constantly a cluster involving "industry" was divided into Research limitations: When examining longer time periods, we encounter the problem of dealing with bigger data sets. Analyzing data year by year would be tedious, but if we combine, e.g. two years into one time slice, important details would be missed. Practical implications: This method can be applied to any research field to illustrate the dynamics of hot topics. It can indicate the promising directions for researchers and provide guidance to decision makers. Originality/value: The use of alluvial diagrams is a distinctive and meaningful approach to detecting hot topics and especially to illustrating their dynamics.
基金the National Natural Science Foundation of China Grant 71673131 for financial support
文摘Purpose:To reveal the research hotpots and relationship among three research hot topics in b iomedicine,namely CRISPR,iPS(induced Pluripotent Stem)cell and Synthetic biology.Design/methodology/approach:We set up their keyword co-occurrence networks with using three indicators and information visualization for metric analysis.Findings:The results reveal the main research hotspots in the three topics are different,but the overlapping keywords in the three topics indicate that they are mutually integrated and interacted each other.Research limitations:All analyses use keywords,without any other forms.Practical implications:We try to find the information distribution and structure of these three hot topics for revealing their research status and interactions,and for promoting biomedical developments.Originality/value:We chose the core keywords in three research hot topics in biomedicine by using h-index.
基金This paper is partially supported by the National Natural Science Foundation of China(Grant No.62006032,62072066)Science and Technology Research Program of Chongqing Municipal Education Commission(Grant No.KJZD-K201900603,KJQN201900629)Chongqing Technology Innovation and Application Development Project(Grant No.cstc2020jscx-msxmX0150).
文摘In social networks,many complex factors affect the prediction of user forwarding behavior.This paper proposes an improved SVM prediction method for user forwarding behavior of hot topics to improve prediction accuracy.Firstly,we consider that the improved Cuckoo Search algorithm can select the optimal penalty parameters and kernel function parameters to optimize the SVM and thus predict the user's forwarding behavior.Secondly,this paper considers the factors that affect the user forwarding behavior comprehensively from the user's own factors and external factors.Finally,based on the characteristics of the user's forwarding behavior changing over time,the time-slicing method is used to predict the trend of hot topics.Experiments show that the method can accurately predict the user's forwarding behavior and can sense the trend of hot topics.
基金the National Natural Science Foundation of China (No. 61602159)the Natural Science Foundation of Heilongjiang Province (No. F201430)+1 种基金the Innovation Talents Project of Science and Technology Bureau of Harbin (No. 2017RAQXJ094)the fundamental research funds of universities in Heilongjiang Province, special fund of Heilongjiang University (No. HDJCCX-201608).
文摘Social applications such as Weibo have provided a quick platform for information propagation, which have led to an explosive propagation for hot topic. User sentiments about propagation information play an important role in propagation speed, which receive more and more attention from data mining field. In this paper, we propose an sentiment-based hot topics prediction model called PHT-US. PHT-US firstly classifies a large amount of text data in Weibo into different topics, then converts user sentiments and time factors into embedding vectors that are input into recurrent neural networks (both LSTM and GRU), and predicts whether the target topic could be a hot spot. Experiments on Sina Weibo show that PHT-US can effectively predict the hot topics in the future. Social applications such as Weibo provide a platform for quick information propagation, which leads to an explosive propagation for hot topics. User sentiments about propagation information play an important role in propagation speed, and thus receive more attention from data mining field. In this paper, a sentiment-based hot topics prediction model called PHT-US is proposed. Firstly a large amount of text data in Weibo was classified into different topics, and then user sentiments and time factors were converted into embedding vectors that are input into recurrent neural networks (both LSTM and GRU), and future hotspots were predicted. Experiments on Sina Weibo show that PHT-US can effectively predict hot topics in the future.
文摘Revealing and comparing the evolution process of hot topics in the field of Digital Library in China and abroad.[Methods]:Taking data in the field of Digital Library from core journals in CKNI and Web of Science from 1990 s to 2020,topics are extracted by LDA model and hot topics are selected based on life cycle theory.Topic evolution paths are generated to contrast evolution of hot topics between home and abroad which are grouped into dimensions of technology and application.It fails to analyze the lagging performance and reasons of research hot topics in the field of Digital Library at home and abroad.In technological dimension of Digital Library,the research content in China lags behind that at abroad.In terms of application dimension,Chinese application tends to focus on social sciences,while application at abroad tends to focus on natural sciences.The evolution of overall research focus is U-shaped,which gradually shifted from technological research to application research,and now turn back to technological dimension.Nowadays,there are also many emerging topics combined with big data technology.
基金Acknowledgements The authors would like to thank the anonymous reviewers for their constructive comments and suggestions, which significantly contributed to improving the manuscript. This work was supported by the National Key Basic Research Project of China (973 Program) (2012CB316400), the National Natural Science Foundation of China (Grant Nos. 61471321, 61202400, 31300539, and 31570629), the Zhejiang Provincial Natural Science Foundation of China (LY15C140005, LY16F010004), Science and Technology Department of Zhejiang Province Public Welfare Project (2016C31G2010057, 2015C31004), Fundamental Research Funds for the Central Universities (172210261) and the Zhejiang Provincial Key Laboratory of Forestry Intelligent Monitoring and Information Technology Research.
文摘Accurately representing the quantity and characteristics of users' interest in certain topics is an important problem facing topic evolution researchers, particularly as it applies to modem online environments. Search engines can provide information retrieval for a specified topic from archived data, but fail to reflect changes in interest toward the topic over time in a structured way. This paper reviews notable research on topic evolution based on the probabilistic topic model from multiple aspects over the past decade. First, we introduce notations, terminology, and the basic topic model explored in the survey, then we summarize three categories of topic evolution based on the probabilistic topic model: the discrete time topic evolution model, the continuous time topic evolution model, and the online topic evolution model. Next, we describe applications of the topic evolution model and attempt to summarize model generalization performance evaluation and topic evolution evaluation methods, as well as providing comparative experimental results for different models. To conclude the review, we pose some open questions and discuss possible future research directions.
文摘Purpose:This paper introduces an analysis framework for tracking the evolution of research topics at the selected topics level,covering a research topic’s evolution trend,evolution path and its content changes over time.Design/methodology/approach:After the topics were recovered by the author-topic model,we first built the keyword-topic co-occurrence network to track the dynamics of topic trends.Then a single-mode network was constructed with each node representing a topic and edge indicating the relationship between topics.It was used to illustrate the evolution path and content changes of research topics.A case study was conducted on the digital library research in China to verify the effectiveness of the analysis framework.Findings:The experimental results show that this analysis framework can be used to track evolution of research topics at a micro level and using social network analysis method can help understand research topics’evolution paths and content changes with the passage of time.Research limitations:Using the analysis framework will produce limited results when examining unstructured data such as social media data.In addition,the effectiveness of the framework introduced in this paper needs to be verified with more research topics in information science and in more scientific fields.Practical implications:This analysis framework can help scholars and researchers map research topics’evolution process and gain insights into how a field’s topics have evolved over time.Originality/value:Tbe analysis framework used in this study can help reveal more micro evolution details.The index to measure topic association strength defined in this paper reflects both similarity and dissimilarity between topics,which belps better understand research topics’evolution paths and content changes.
基金was supported by the National Natural Science Foundation of China (Grant No.60574087)the Hi-Tech Research and Development Program of China (2007AA01Z475,2007AA01Z480,2007A-A01Z464)the 111 International Collaboration Program of China.
文摘This article presents the formal definition and description of popular topics on the Internet,analyzes the relationship between popular words and topics,and finally introduces a method that uses statistics and correlation of the popular words in traffic content and network flow characteristics as input for extracting popular topics on the Internet.Based on this,this article adapts a clustering algorithm to extract popular topics and gives formalized results.The test results show that this method has an accuracy of 16.7%in extracting popular topics on the Internet.Compared with web mining and topic detection and tracking(TDT),it can provide a more suitable data source for effective recovery of Internet public opinions.
基金supported by the General Program of National Natural Science Foundation of China under(Grant No.72074020)the Young Scientists Fund of National Natural Science Foundation of China under(Grant No.72004009,72304074)
文摘The technology innovation management(TIM)field attracts an increasing amount of attention.This paper takes a retrospective look at high-quality publication output in the TIM field over the 55 years from 1968 to 2022,revealing topics,their evolutions,and research trends.A total of 31,498 articles and proceeding papers published during this period are analyzed.The paper first extracts the fine-grained topic words using the tool ITGInsight.Then Linlog algorithm is used to cluster topics based on the cooccurrence of the topic words.Time is integrated within the topic cluster results so that topic evolutions and research trends are analyzed.The TIM field has four main topic clusters:technology research,product research,firm research,and future research.In every topic cluster,there are many fine-sorted macro-topics and micro-topics.There is an obvious increase in diversity in the topic clusters of technology research and firm research.Especially,the evolution of technology research has been closely connected with society.In contrast,product research has declined in its topic size.At the same time,future research maintains a certain stability of its scientific publications.The research predicts that all the four topics will retain their popularity,and play an important role in the TIM field.Among them,technology research will continue to expand and enrich the TIM field.The other three topics will deepen their research for a better development of the TIM field.The paper also proposes some advice for industry professionals,policymakers,and researchers.
文摘INTRODUCTION Within the last few years,a new group of diseases featured with cognitive impairment, seizures and behavior disorders was reported. And these diseases were commonly diagnosed as 'viral encephalitis' and'sporadic encephalitis' than autoimmune encephalitis (AE) before AE had confirmed etiological.
基金ACKNOWLEDGMENTS This work is supported by grants National 973 project (No.2013CB29606), Natural Science Foundation of China (No.61202244), research fund of ShangQiu Normal Colledge (No. 2013GGJS013). N1PS corpus is supported by SourceForge. We thank the anonymous reviewers for their helpful comments.
文摘The problem of "rich topics get richer"(RTGR) is popular to the topic models,which will bring the wrong topic distribution if the distributing process has not been intervened.In standard LDA(Latent Dirichlet Allocation) model,each word in all the documents has the same statistical ability.In fact,the words have different impact towards different topics.Under the guidance of this thought,we extend ILDA(Infinite LDA) by considering the bias role of words to divide the topics.We propose a self-adaptive topic model to overcome the RTGR problem specifically.The model proposed in this paper is adapted to three questions:(1) the topic number is changeable with the collection of the documents,which is suitable for the dynamic data;(2) the words have discriminating attributes to topic distribution;(3) a selfadaptive method is used to realize the automatic re-sampling.To verify our model,we design a topic evolution analysis system which can realize the following functions:the topic classification in each cycle,the topic correlation in the adjacent cycles and the strength calculation of the sub topics in the order.The experiment both on NIPS corpus and our self-built news collections showed that the system could meet the given demand,the result was feasible.
文摘By using CiteSpace software to create a knowledge map of authors,institutions and keywords,the literature on the spatio-temporal behavior of Chinese residents based on big data in the architectural planning discipline published in the China Academic Network Publishing Database(CNKI)was analyzed and discussed.It is found that there was a lack of communication and cooperation among research institutions and scholars;the research hotspots involved four main areas,including“application in tourism research”,“application in traffic travel research”,“application in work-housing relationship research”,and“application in personal family life research”.