期刊文献+
共找到13篇文章
< 1 >
每页显示 20 50 100
Topic Evolution and Emerging Topic Analysis Based on Open Source Software 被引量:4
1
作者 Xiang Shen Li Wang 《Journal of Data and Information Science》 CSCD 2020年第4期126-136,共11页
Purpose:We present an analytical,open source and flexible natural language processing and text mining method for topic evolution,emerging topic detection and research trend forecasting for all kinds of data-tagged tex... Purpose:We present an analytical,open source and flexible natural language processing and text mining method for topic evolution,emerging topic detection and research trend forecasting for all kinds of data-tagged text.Design/methodology/approach:We make full use of the functions provided by the open source VOSviewer and Microsoft Office,including a thesaurus for data clean-up and a LOOKUP function for comparative analysis.Findings:Through application and verification in the domain of perovskite solar cells research,this method proves to be effective.Research limitations:A certain amount of manual data processing and a specific research domain background are required for better,more illustrative analysis results.Adequate time for analysis is also necessary.Practical implications:We try to set up an easy,useful,and flexible interdisciplinary text analyzing procedure for researchers,especially those without solid computer programming skills or who cannot easily access complex software.This procedure can also serve as a wonderful example for teaching information literacy.Originality/value:This text analysis approach has not been reported before. 展开更多
关键词 topic evolution Emerging topics Text mining THESAURUS VOSviewer
下载PDF
Three decades of topic evolution,hot spot mining and prospect in CCUS Studies based on CitNetExplorer
2
作者 Huajing Zhang Ding Li +1 位作者 Xuan Gu Nan Chen 《Chinese Journal of Population,Resources and Environment》 2022年第1期91-104,共14页
As a major strategic technology for reducing greenhouse gas emissions and ensuring energy security,carbon capture,utilization,and storage(CCUS)is of great significance to large-scale emission reduction.From the perspe... As a major strategic technology for reducing greenhouse gas emissions and ensuring energy security,carbon capture,utilization,and storage(CCUS)is of great significance to large-scale emission reduction.From the perspective of knowledge discovery,it is important to analyse the study progress based on existing study achievements,excavate the evolution characteristics of study topics over time,review stage-specific findings,and construct CCUS domain knowledge map.This will help researchers gain an overall understanding of CCUS studies and promote the industry-college-research cooperation in respect to CCUS.Based on the Web of Science(WOS)database platform and CitNet-Explorer software,the present study explore the international research progress,topic evolution track,research hotspot and research trend of CCUS technology since its birth nearly 30 years ago,using bibliometric method,citation network visualization analysis method and cluster analysis method.Through the analysis of literature citation network,it is found that:16 CCUS topics,6 hotspots have been studied in the last three decades.The topics of CCUS studies present an evolution path from CCUS technology security and economicfeasibility analysis to CCUS technological popularization,and then CCUS technological improvement and development.Cutting-edge CCUS looks at the process and infrastructure construction,cost effectiveness and development prospect analysis.CCUS focuses on improvement of process technologies and related infrastructure. 展开更多
关键词 CCUS CitNetExplorer BIBLIOMETRIC Citation network topic evolution
下载PDF
Technology Innovation Management:Topic Evolutions and Research Trends from 1968 to 2022
3
作者 Xinhang Zhao Xuefeng Wang +2 位作者 Hongshu Chen Yuqin Liu Zhinan Wang 《Innovation and Development Policy》 2023年第2期100-122,共23页
The technology innovation management(TIM)field attracts an increasing amount of attention.This paper takes a retrospective look at high-quality publication output in the TIM field over the 55 years from 1968 to 2022,r... The technology innovation management(TIM)field attracts an increasing amount of attention.This paper takes a retrospective look at high-quality publication output in the TIM field over the 55 years from 1968 to 2022,revealing topics,their evolutions,and research trends.A total of 31,498 articles and proceeding papers published during this period are analyzed.The paper first extracts the fine-grained topic words using the tool ITGInsight.Then Linlog algorithm is used to cluster topics based on the cooccurrence of the topic words.Time is integrated within the topic cluster results so that topic evolutions and research trends are analyzed.The TIM field has four main topic clusters:technology research,product research,firm research,and future research.In every topic cluster,there are many fine-sorted macro-topics and micro-topics.There is an obvious increase in diversity in the topic clusters of technology research and firm research.Especially,the evolution of technology research has been closely connected with society.In contrast,product research has declined in its topic size.At the same time,future research maintains a certain stability of its scientific publications.The research predicts that all the four topics will retain their popularity,and play an important role in the TIM field.Among them,technology research will continue to expand and enrich the TIM field.The other three topics will deepen their research for a better development of the TIM field.The paper also proposes some advice for industry professionals,policymakers,and researchers. 展开更多
关键词 technology innovation management word co-occurrence topic cluster topic evolution research trend ITGInsight
原文传递
Topic evolution based on the probabilistic topic model: a review 被引量:5
4
作者 Houkui ZHOU Huimin YU Roland HU 《Frontiers of Computer Science》 SCIE EI CSCD 2017年第5期786-802,共17页
Accurately representing the quantity and characteristics of users' interest in certain topics is an important problem facing topic evolution researchers, particularly as it applies to modem online environments. Searc... Accurately representing the quantity and characteristics of users' interest in certain topics is an important problem facing topic evolution researchers, particularly as it applies to modem online environments. Search engines can provide information retrieval for a specified topic from archived data, but fail to reflect changes in interest toward the topic over time in a structured way. This paper reviews notable research on topic evolution based on the probabilistic topic model from multiple aspects over the past decade. First, we introduce notations, terminology, and the basic topic model explored in the survey, then we summarize three categories of topic evolution based on the probabilistic topic model: the discrete time topic evolution model, the continuous time topic evolution model, and the online topic evolution model. Next, we describe applications of the topic evolution model and attempt to summarize model generalization performance evaluation and topic evolution evaluation methods, as well as providing comparative experimental results for different models. To conclude the review, we pose some open questions and discuss possible future research directions. 展开更多
关键词 topic evolution probabilistic topic models text corpora evaluation method
原文传递
Mapping the evolution of research topics using ATM and SNA 被引量:1
5
作者 Chunlei YE 《Chinese Journal of Library and Information Science》 2014年第4期46-62,共17页
Purpose:This paper introduces an analysis framework for tracking the evolution of research topics at the selected topics level,covering a research topic’s evolution trend,evolution path and its content changes over t... Purpose:This paper introduces an analysis framework for tracking the evolution of research topics at the selected topics level,covering a research topic’s evolution trend,evolution path and its content changes over time.Design/methodology/approach:After the topics were recovered by the author-topic model,we first built the keyword-topic co-occurrence network to track the dynamics of topic trends.Then a single-mode network was constructed with each node representing a topic and edge indicating the relationship between topics.It was used to illustrate the evolution path and content changes of research topics.A case study was conducted on the digital library research in China to verify the effectiveness of the analysis framework.Findings:The experimental results show that this analysis framework can be used to track evolution of research topics at a micro level and using social network analysis method can help understand research topics’evolution paths and content changes with the passage of time.Research limitations:Using the analysis framework will produce limited results when examining unstructured data such as social media data.In addition,the effectiveness of the framework introduced in this paper needs to be verified with more research topics in information science and in more scientific fields.Practical implications:This analysis framework can help scholars and researchers map research topics’evolution process and gain insights into how a field’s topics have evolved over time.Originality/value:Tbe analysis framework used in this study can help reveal more micro evolution details.The index to measure topic association strength defined in this paper reflects both similarity and dissimilarity between topics,which belps better understand research topics’evolution paths and content changes. 展开更多
关键词 topic evolution Social network analysis(SNA) Author-topic model(ATM) Digital library topic network
下载PDF
Self-Adaptive Topic Model: A Solution to the Problem of "Rich Topics Get Richer" 被引量:1
6
作者 FANG Ying 《China Communications》 SCIE CSCD 2014年第12期35-43,共9页
The problem of "rich topics get richer"(RTGR) is popular to the topic models,which will bring the wrong topic distribution if the distributing process has not been intervened.In standard LDA(Latent Dirichlet... The problem of "rich topics get richer"(RTGR) is popular to the topic models,which will bring the wrong topic distribution if the distributing process has not been intervened.In standard LDA(Latent Dirichlet Allocation) model,each word in all the documents has the same statistical ability.In fact,the words have different impact towards different topics.Under the guidance of this thought,we extend ILDA(Infinite LDA) by considering the bias role of words to divide the topics.We propose a self-adaptive topic model to overcome the RTGR problem specifically.The model proposed in this paper is adapted to three questions:(1) the topic number is changeable with the collection of the documents,which is suitable for the dynamic data;(2) the words have discriminating attributes to topic distribution;(3) a selfadaptive method is used to realize the automatic re-sampling.To verify our model,we design a topic evolution analysis system which can realize the following functions:the topic classification in each cycle,the topic correlation in the adjacent cycles and the strength calculation of the sub topics in the order.The experiment both on NIPS corpus and our self-built news collections showed that the system could meet the given demand,the result was feasible. 展开更多
关键词 topic model infinite Latent Dirichlet Allocation Dirichlet process topic evolution
下载PDF
Topic discovery and evolution in scientific literature based on content and citations 被引量:5
7
作者 Hou-kui ZHOU Hui-min YU Roland HU 《Frontiers of Information Technology & Electronic Engineering》 SCIE EI CSCD 2017年第10期1511-1524,共14页
Researchers across the globe have been increasingly interested in the manner in which important research topics evolve over time within the corpus of scientific literature. In a dataset of scientific articles, each do... Researchers across the globe have been increasingly interested in the manner in which important research topics evolve over time within the corpus of scientific literature. In a dataset of scientific articles, each document can be considered to comprise both the words of the document itself and its citations of other documents. In this paper, we propose a citationcontent-latent Dirichlet allocation(LDA) topic discovery method that accounts for both document citation relations and the content of the document itself via a probabilistic generative model. The citation-content-LDA topic model exploits a two-level topic model that includes the citation information for ‘father' topics and text information for sub-topics. The model parameters are estimated by a collapsed Gibbs sampling algorithm. We also propose a topic evolution algorithm that runs in two steps: topic segmentation and topic dependency relation calculation. We have tested the proposed citation-content-LDA model and topic evolution algorithm on two online datasets, IEEE Transactions on Pattern Analysis and Machine Intelligence(PAMI) and IEEE Computer Society(CS), to demonstrate that our algorithm effectively discovers important topics and reflects the topic evolution of important research themes. According to our evaluation metrics, citation-content-LDA outperforms both content-LDA and citation-LDA. 展开更多
关键词 topic extraction topic evolution Evaluation method
原文传递
Overview of Trends in Global Single Cell Research Based on Bibliometric Analysis and LDA Model(2009–2019) 被引量:2
8
作者 Tian Jiang Xiaoping Liu +2 位作者 Chao Zhang Chuanhao Yin Huizhou Liu 《Journal of Data and Information Science》 CSCD 2021年第2期163-178,共16页
Purpose:This article aims to describe the global research profile and the development trends of single cell research from the perspective of bibliometric analysis and semantic mining.Design/methodology/approach:The li... Purpose:This article aims to describe the global research profile and the development trends of single cell research from the perspective of bibliometric analysis and semantic mining.Design/methodology/approach:The literatures on single cell research were extracted from Clarivate Analytic’s Web of Science Core Collection between 2009 and 2019.Firstly,bibliometric analyses were performed with Thomson Data Analyzer(TDA).Secondly,topic identification and evolution trends of single cell research was conducted through the LDA topic model.Thirdly,taking the post-discretized method which is used for topic evolution analysis for reference,the topics were also be dispersed to countries to detect the spatial distribution.Findings:The publication of single cell research shows significantly increasing tendency in the last decade.The topics of single cell research field can be divided into three categories,which respectively refers to single cell research methods,mechanism of biological process,and clinical application of single cell technologies.The different trends of these categories indicate that technological innovation drives the development of applied research.The continuous and rapid growth of the topic strength in the field of cancer diagnosis and treatment indicates that this research topic has received extensive attention in recent years.The topic distributions of some countries are relatively balanced,while for the other countries,several topics show significant superiority.Research limitations:The analyzed data of this study only contain those were included in the Web of Science Core Collection.Practical implications:This study provides insights into the research progress regarding single cell field and identifies the most concerned topics which reflect potential opportunities and challenges.The national topic distribution analysis based on the post-discretized analysis method extends topic analysis from time dimension to space dimension.Originality/value:This paper combines bibliometric analysis and LDA model to analyze the evolution trends of single cell research field.The method of extending post-discretized analysis from time dimension to space dimension is distinctive and insightful. 展开更多
关键词 LDA model topic evolution Bibliometric analysis Post-discretized Singlecell
下载PDF
Contrastive analysis in China and abroad on the Evolution of hot topics in the field of digital library based on LDA model 被引量:1
9
作者 Chunhui Tan Mengyuan Xiong 《Data Science and Informetrics》 2021年第2期110-130,共21页
Revealing and comparing the evolution process of hot topics in the field of Digital Library in China and abroad.[Methods]:Taking data in the field of Digital Library from core journals in CKNI and Web of Science from ... Revealing and comparing the evolution process of hot topics in the field of Digital Library in China and abroad.[Methods]:Taking data in the field of Digital Library from core journals in CKNI and Web of Science from 1990 s to 2020,topics are extracted by LDA model and hot topics are selected based on life cycle theory.Topic evolution paths are generated to contrast evolution of hot topics between home and abroad which are grouped into dimensions of technology and application.It fails to analyze the lagging performance and reasons of research hot topics in the field of Digital Library at home and abroad.In technological dimension of Digital Library,the research content in China lags behind that at abroad.In terms of application dimension,Chinese application tends to focus on social sciences,while application at abroad tends to focus on natural sciences.The evolution of overall research focus is U-shaped,which gradually shifted from technological research to application research,and now turn back to technological dimension.Nowadays,there are also many emerging topics combined with big data technology. 展开更多
关键词 LDA Model topic Life cycle topic evolution Digital Library Hot topic
原文传递
Trends Analysis of Graphene Research and Development
10
作者 Lixue Zou Li Wang +3 位作者 Yingqi Wu Caroline Ma Sunny Yu Xiwen Liu 《Journal of Data and Information Science》 CSCD 2018年第1期82-100,共19页
Purpose: This study aims to reveal the landscape and trends ofgraphene research in the world by using data from Chemical Abstracts Service (CAS). Design/methodology/approach: Index data from CAS have been retrieve... Purpose: This study aims to reveal the landscape and trends ofgraphene research in the world by using data from Chemical Abstracts Service (CAS). Design/methodology/approach: Index data from CAS have been retrieved on 78,756 papers and 23,057 patents on graphene from 1985 to March 2016, and scientometric methods were used to analyze the growth and distribution of R&D output, topic distribution and evolution, and distribution and evolution of substance properties and roles. Findings: In recent years R&D in graphene keeps in rapid growth, while China, South Korea and United States are the largest producers in research but China is relatively weak in patent applications in other countries. Research topics in graphene are continuously expanding from mechanical, material, and electrical properties to a diverse range of application areas such as batteries, capacitors, semiconductors, and sensors devices. The roles of emerging substances are increasing in Preparation and Biological Study. More techniques have been included to improve the preparation processes and applications of graphene in various fields. Research limitations: Only data from CAS is used and some R&D activities solely reported through other channels may be missed. Also more detailed analysis need to be done to reveal the impact of research on development or vice verse, development dynamics among the players, and impact of emerging terms or substance roles on research and technology development. Practical implications: This will provide a valuable reference for scientists and developers, R&D managers, R&D policy makers, industrial and business investers to understand the landscape and trends ofgraphene research. Its methodologies can be applied to other fields or with data from other similar sources.Originality/value: The integrative use of indexing data on papers and patents of CAS and the systematic exploration of the distribution trends in output, topics, substance roles are distinctive and insightful. 展开更多
关键词 GRAPHENE R&D distribution topic distribution and evolution Substance rolesdistribution and evolution Text mining
下载PDF
TOPICS AND TRENDS OF THE ON-LINE PUBLIC CONCERNS BASED ON TIANYA FORUM 被引量:11
11
作者 Lina Cao Xijin Tang 《Journal of Systems Science and Systems Engineering》 SCIE EI CSCD 2014年第2期212-230,共19页
Many social events spread fast through the Internet and arouse wide community discussions. Those on-line public opinions emerge into diverse topics along the time. Moreover, the strength of the topics is fluctuating. ... Many social events spread fast through the Internet and arouse wide community discussions. Those on-line public opinions emerge into diverse topics along the time. Moreover, the strength of the topics is fluctuating. How to catch both primary topics and trend of topics over the shifting on-line discussions are not only of theoretical importance for scientific research, but also of practical importance for societal management especially in current China. To try the cutting-edge text analytic technologies to deal with unstructured on-line public opinions and provide support for social problem-solving in the big data era is worth an endeavour. This paper applies dynamic topic model (DTM) to explore the changing topics of new posts collected from Tianya Zatan Board of Tianya Club, the most influential Chinese BBS in China's Mainland. By analysis of the hot and cold terms trends, we catch the topics shift of main on-line concerns with illustrations of topics of school bus and environment in December of 2011. An algorithm is proposed to compute the strength fluctuation of each topic. With visualized analysis of the respective main topics in several months of 2012, some patterns of the topics fluctuation on the board are summarized. 展开更多
关键词 topic models dynamic topic model on-line topics evolution Tianya Club societal management
原文传递
A bibliometric analysis of worldwide cancer research using machine learning methods
12
作者 Lianghong Lin Likeng Liang +4 位作者 Maojie Wang Runyue Huang Mengchun Gong Guangjun Song Tianyong Hao 《Cancer Innovation》 2023年第3期219-232,共14页
With the progress and development of computer technology,applying machine learning methods to cancer research has become an important research field.To analyze the most recent research status and trends,main research ... With the progress and development of computer technology,applying machine learning methods to cancer research has become an important research field.To analyze the most recent research status and trends,main research topics,topic evolutions,research collaborations,and potential directions of this research field,this study conducts a bibliometric analysis on 6206 research articles worldwide collected from PubMed between 2011 and 2021 concerning cancer research using machine learning methods.Python is used as a tool for bibliometric analysis,Gephi is used for social network analysis,and the Latent Dirichlet Allocation model is used for topic modeling.The trend analysis of articles not only reflects the innovative research at the intersection of machine learning and cancer but also demonstrates its vigorous development and increasing impacts.In terms of journals,Nature Communications is the most influential journal and Scientific Reports is the most prolific one.The United States and Harvard University have contributed the most to cancer research using machine learning methods.As for the research topic,“Support Vector Machine,”“classification,”and“deep learning”have been the core focuses of the research field.Findings are helpful for scholars and related practitioners to better understand the development status and trends of cancer research using machine learning methods,as well as to have a deeper understanding of research hotspots. 展开更多
关键词 bibliometric analysis CANCER Latent Dirichlet Allocation machine learning research topic topic evolution
原文传递
Online Latent Dirichlet Allocation Model Based on Sentiment Polarity Time Series
13
作者 HUANG Bo JU Jiaji +3 位作者 CHEN Huan ZHU Yimin LIU Jin SHI Zhicai 《Wuhan University Journal of Natural Sciences》 CAS CSCD 2021年第6期464-472,共9页
The Product Sensitive Online Dirichlet Allocation model(PSOLDA)proposed in this paper mainly uses the sentiment polarity of topic words in the review text to improve the accuracy of topic evolution.First,we use Latent... The Product Sensitive Online Dirichlet Allocation model(PSOLDA)proposed in this paper mainly uses the sentiment polarity of topic words in the review text to improve the accuracy of topic evolution.First,we use Latent Dirichlet Allocation(LDA)to obtain the distribution of topic words in the current time window.Second,the word2 vec word vector is used as auxiliary information to determine the sentiment polarity and obtain the sentiment polarity distribution of the current topic.Finally,the sentiment polarity changes of the topics in the previous and next time window are mapped to the sentiment factors,and the distribution of topic words in the next time window is controlled through them.The experimental results show that the PSOLDA model decreases the probability distribution by 0.1601,while Online Twitter LDA only increases by 0.0699.The topic evolution method that integrates the sentimental information of topic words proposed in this paper is better than the traditional model. 展开更多
关键词 topic evolution sentiment factors word vector Latent Dirichlet Allocation(LDA)
原文传递
上一页 1 下一页 到第
使用帮助 返回顶部