With the number of social media users ramping up,microblogs are generated and shared at record levels.The high momentum and large volumes of short texts bring redundancies and noises,in which the users and analysts of...With the number of social media users ramping up,microblogs are generated and shared at record levels.The high momentum and large volumes of short texts bring redundancies and noises,in which the users and analysts often find it problematic to elicit useful information of interest.In this paper,we study a query-focused summarization as a solution to address this issue and propose a novel summarization framework to generate personalized online summaries and historical summaries of arbitrary time durations.Our framework can deal with dynamic,perpetual,and large-scale microblogging streams.Specifically,we propose an online microblogging stream clustering algorithm to cluster microblogs and maintain distilled statistics called Microblog Cluster Vectors(MCV).Then we develop a ranking method to extract the most representative sentences relative to the query from the MCVs and generate a query-focused summary of arbitrary time durations.Our experiments on large-scale real microblogs demonstrate the efficiency and effectiveness of our approach.展开更多
Microblog is a new Internet featured product, which has seen a rapid development in recent years. Researchers from different countries are making various technical analyses on microblogging applications. In this study...Microblog is a new Internet featured product, which has seen a rapid development in recent years. Researchers from different countries are making various technical analyses on microblogging applications. In this study, through using the natural language processing(NLP) and data mining, we analyzed the information content transmitted via a microblog, users' social networks and their interactions, and carried out an empirical analysis on the dissemination process of one particular piece of information via Sina Weibo.Based on the result of these analyses, we attempt to develop a better understanding about the rule and mechanism of the informal information flow in microblogging.展开更多
This study uses the Elaboration Likelihood Model (ELM) and social presence theory to examine the microblogging reposting mechanism. Subjective and objective data were collected from 216 respondents in a field experi...This study uses the Elaboration Likelihood Model (ELM) and social presence theory to examine the microblogging reposting mechanism. Subjective and objective data were collected from 216 respondents in a field experiment. The results indicate that information quality and source credibility of microblogging messages affect users' reposting intention by affecting their perceptions of the usefulness and enjoyment of the information. Perceived enjoyment has a greater impact on reposting intention than perceived usefulness. Furthermore, users are able to perceive social presence when interacting with microblogging messages. Social presence plays a full mediating role between information quality and perceived enjoyment, and a partial mediating role between information quality and perceived usefulness.展开更多
Microblogging services nformation and express opinions pro by vide a novel and popular communication scheme for Web users to share publishing short posts, which usually reflect the users' daily life. We can thus mode...Microblogging services nformation and express opinions pro by vide a novel and popular communication scheme for Web users to share publishing short posts, which usually reflect the users' daily life. We can thus model the users' daily status and interests according to their posts. Because of the high complexity and the large amount of the content of the microblog users' posts, it is necessary to provide a quick summary of the users' life status, both for personal users and commercial services. It is non-trivial to summarize the life status of microblog users, particularly when the summary is conducted over a long period. In this paper, we present a compact interactive visualization prototype, LifeCircle, as an efficient summary for exploring the long-term life status of microblog users. The radial visualization provides multiple views for a given microblog user, including annual topics, monthly keywords, monthly sentiments, and temporal trends of posts. We tightly integrate interactive visualization with novel and state-of-the-art microblogging analytics to maximize their advantages. We implement LifeCircle on Sina Weibo, the most popular microblogging service in China, and illustrate the effectiveness of our prototype with various case studies. Results show that our prototype makes users nostalgic and makes them reminiscent about past events, which helps them to better understand themselves and others展开更多
Traditional anomaly detection on microblogging mostly focuses on individual anomalous users or messages. Since anomalous users employ advanced intelligent means, the anomaly detection is greatly poor in performance. I...Traditional anomaly detection on microblogging mostly focuses on individual anomalous users or messages. Since anomalous users employ advanced intelligent means, the anomaly detection is greatly poor in performance. In this paper, we propose an innovative framework of anomaly detection based on bipartite graph and co-clustering. A bipartite graph between users and messages is built to model the homogeneous and heterogeneous interactions. The proposed co- clustering algorithm based on nonnegative matrix tri-factorization can detect anomalous users and messages simultaneously. The homogeneous relations modeled by the bipartite graph are used as constraints to improve the accuracy of the co- clustering algorithm. Experimental results show that the proposed scheme can detect individual and group anomalies with high accuracy on a Sina Weibo dataset.展开更多
With the development of online social networks,a special group of online users named organized posters(or Internet water army,Internet paid posters in some literatures) have fl ooded the social network communities. Th...With the development of online social networks,a special group of online users named organized posters(or Internet water army,Internet paid posters in some literatures) have fl ooded the social network communities. They are organized in groups to post with specific purposes and sometimes even confuse or mislead normal users.In this paper,we study the individual and group characteristics of organized posters. A classifier is constructed based on the individual and group characteristics to detect them. Extensive experimental results on three real datasets demonstrate that our method based on individual and group characteristics using SVM model(IGCSVM) is effective in detecting organized posters and better than existing methods. We take a first look at finding the promoters based on the detected organized posters of our IGCSVM method. Our experiments show that it is effective in detecting promoters.展开更多
The emergence of government microblog not only widens the channels for the people to participate in politics,but also is of great significance for the government to understand public opinion and promote the process of...The emergence of government microblog not only widens the channels for the people to participate in politics,but also is of great significance for the government to understand public opinion and promote the process of political democratization.In recent years,the frequency of network language in Chinese government microblog has gradually increased.Taking the“Beijing release”official microblog as an example,this paper discusses the characteristics of network language,and puts forward some suggestions on the use of network language in government microblog:in terms of syllables,network language below three syllables should be selected to meet the requirements of short length of government microblog;In terms of part of speech,network terms such as nouns,verbs and adjectives can express emotional attitude and value judgment;In terms of emotion and style,emotional attitude forms such as intimacy,love and respect are selected.In addition,the norms of network language in government microblog can not be generalized.We should not only pay attention to the expression effect,but also set foot in multiple disciplines and fields.展开更多
Microblogs have become an important platform for people to publish,transform information and acquire knowledge.This paper focuses on the problem of discovering user interest in microblogs.In this paper,we propose a to...Microblogs have become an important platform for people to publish,transform information and acquire knowledge.This paper focuses on the problem of discovering user interest in microblogs.In this paper,we propose a topic mining model based on Latent Dirichlet Allocation(LDA) named user-topic model.For each user,the interests are divided into two parts by different ways to generate the microblogs:original interest and retweet interest.We represent a Gibbs sampling implementation for inference the parameters of our model,and discover not only user's original interest,but also retweet interest.Then we combine original interest and retweet interest to compute interest words for users.Experiments on a dataset of Sina microblogs demonstrate that our model is able to discover user interest effectively and outperforms existing topic models in this task.And we find that original interest and retweet interest are similar and the topics of interest contain user labels.The interest words discovered by our model reflect user labels,but range is much broader.展开更多
Based on user's in-degree distribution, traditional ranking algorithms of user's weight usually neglect the considerations of the differences among user's followers and the features of user's tweets. In order to a...Based on user's in-degree distribution, traditional ranking algorithms of user's weight usually neglect the considerations of the differences among user's followers and the features of user's tweets. In order to analyze the factors which impact on user's weight, under the analysis of the data collected from SINA Microblog network, this paper discovers that user influence and active degrees are the dominant factors for this issue. The proposed algorithm evaluates user influence by user's follower number, the influence of user's followers and the reciprocity between users. User's active degree is modeled by user's participation and the quality of user's tweets. The models are tested by different data groups to confirm the parameters for the final calculation. Eventually, this paper compares the computational results with the user's ranking order given by the SINA official application. The performance of this algorithm presents a stronger stability on the fluctuant range of the value of user's weight.展开更多
This paper explores the uses’ influences on microblog. At first, according to the social network theory, we present an analysis of information transmitting network structure based on the relationship of following and...This paper explores the uses’ influences on microblog. At first, according to the social network theory, we present an analysis of information transmitting network structure based on the relationship of following and followed phenomenon of microblog users. Informed by the microblog user behavior analysis, the paper also addresses a model for calculating weights of users’ influence. It proposes a U-R model, using which we can evaluate users’ influence based on PageRank algorithms and analyzes user behaviors. In the U-R model, the effect of user behaviors is explored and PageRank is applied to evaluate the importance and the influence of every user in a microblog network by repeatedly iterating their own U-R value. The users’ influences in a microblog network can be ranked by the U-R value. Finally, the validity of U-R model is proved with a real-life numerical example.展开更多
The emergence of big data leads to an increasing demand for data processing methods.As the most influential media for Chinese domestic movie ratings,Douban contains a huge amount of data and one can understand users...The emergence of big data leads to an increasing demand for data processing methods.As the most influential media for Chinese domestic movie ratings,Douban contains a huge amount of data and one can understand users'perspectives towards these movies by analyzing these data.In this article,we study movie's critics from the Douban website,perform sentiment analysis on the data obtained by crawling,and visualize the results with a word cloud.We propose a lightweight sentiment analysis method which is free from heavy training and visualize the results in a more conceivable way.展开更多
A novel microblog summarization approach via enriching contextual features on sentencelevel semantic analysis is proposed in this paper. At first,a Chinese sentential semantic model( CSM) is employed to analyze the ...A novel microblog summarization approach via enriching contextual features on sentencelevel semantic analysis is proposed in this paper. At first,a Chinese sentential semantic model( CSM) is employed to analyze the semantic structure of each microblog sentence. Then,a combination of sentence-level semantic analysis and latent dirichlet allocation is utilized to acquire extra features and related words to enrich the collection of microblog messages. The simlilarites between the two sentences are calculated based on the enriched features. Finally,the semantic weight and relation weight are calculated to select the most informative sentences,which form the final summary for microblog messages. Experimental results demonstrate the advantages of our proposed approach.The results indicate that introducing sentence-level semantic analysis for context enrichment can better represent sentential semantic. The proposed criteria,namely,semantic weight and relation weight enhance summary result. Furthermore,CSM is a useful framework for sentence-level semantic analysis.展开更多
At present there are many socialized microblog platforms.With powerful mobility,real-time information,fragment of information dissemination,and innovation of interaction,the microblog has become a socialized interacti...At present there are many socialized microblog platforms.With powerful mobility,real-time information,fragment of information dissemination,and innovation of interaction,the microblog has become a socialized interaction mode in recent years.Since microblog is very popular with students of agricultural and forestry higher vocational schools,with the rising and development of network education,the microblog as a new information platform will be used by more and more teachers in education.From the perspective of microblog,this paper studied educational reform in management courses of agricultural and forestry higher vocational schools,in the hope of providing certain reference and help for current education practice of agricultural and forestry management courses.展开更多
Purpose: This paper intends to explore a quantitative method for investigating the characteristics of information diffusion through social media like weblogs and microblogs.By using the social network analysis methods...Purpose: This paper intends to explore a quantitative method for investigating the characteristics of information diffusion through social media like weblogs and microblogs.By using the social network analysis methods,we attempt to analyze the different characteristics of information diffusion in weblogs and microblogs as well as the possible reasons of these differences.Design/methodology/approach: Using the social network analysis methods,this paper carries out an empirical study by taking the Chinese weblogs and microblogs in the field of Library and Information Science(LIS) as the research sample and employing measures such as network density,core/peripheral structure and centrality.Findings: Firstly,both bloggers and microbloggers maintain weak ties,and both of their social networks display a small-world effect. Secondly,compared with weblog users,microblog users are more interconnected,more equal and more capable of developing relationships with people outside their own social networks. Thirdly,the microblogging social network is more conducive to information diffusion than the blogging network,because of their differences in functions and the information flow mechanism. Finally,the communication mode emerged with microblogging,with the characteristics of micro-content,multi-channel information dissemination,dense and decentralized social network and content aggregation,will be one of the trends in the development of the information exchange platform in the future.Research limitations: The sample size needs to be increased so that samples are more representative. Errors may exist during the data collection. Moreover,the individual-level characteristics of the samples as well as the types of information exchanged need to be further studied.Practical implications: This preliminary study explores the characteristics of information diffusion in the network environment and verifies the feasibility of conducting a quantitative analysis of information diffusion through social media. In addition,it provides insight into the characteristics of information diffusion in weblogs and microblogs and the possible reasons of these differences.Originality/value: We have analyzed the characteristics of information diffusion in weblogs and microblogs by using the social network analysis methods. This research will be useful for a quantitative analysis of the underlying mechanisms of information flow through social media in the network environment.展开更多
Forwarding is a major means of information dissemination on the Microblog platform.The article,combining static analysis and dynamic analysis,takes Microblog forwarding as the object of study,and studies the network t...Forwarding is a major means of information dissemination on the Microblog platform.The article,combining static analysis and dynamic analysis,takes Microblog forwarding as the object of study,and studies the network topology of grass-roots Microblog forwarding users.It also studies the correlation between characteristic quantity and forwarding times of Microblog network topology.Furthermore,it conducts modification on virus transmission model,builds and verifies the Microblog forwarding dynamical model.The study finds out that Microblog postings present qute strong dissemination capacity on the initial stage,and some Microblog postings with many forwarding times and long duration of forwarding process due to the dynamic growth of the forwarding user network and the joining of strong nodes make network infection density decrease in some phases.展开更多
Some research work has showed that public mood and stock market price have some relations in some degree. Although it is difficult to clear the relation, the research about the relation between stock market price and ...Some research work has showed that public mood and stock market price have some relations in some degree. Although it is difficult to clear the relation, the research about the relation between stock market price and public mood is interested by some scientists. This paper tries to find the relationship between Chinese stock market and Chinese local Microblog. First, C-POMS(Chinese Profile of Mood States) was proposed to analyze sentiment of Microblog feeds. Then Granger causality test confirmed the relation between C-POMS analysis and price series. SVM and Probabilistic Neural Network were used to make prediction, and experiments show that SVM is better to predict stock market movements than Probabilistic Neural Network. Experiments also indicate that adding certain dimension of C-POMS as the input data will improve the prediction accuracy to 66.667%. Two dimensions to input data leads to the highest accuracy of 71.429%, which is about 20% higher than using only history stock data as the input data. This paper also compared the proposed method with the ROSTEA scores, and concluded that only the proposed method brings more accurate predicts.展开更多
Starting from late 2019,the new coronavirus disease(COVID-19)has become a global crisis.With the development of online social media,people prefer to express their opinions and discuss the latest news online.We have wi...Starting from late 2019,the new coronavirus disease(COVID-19)has become a global crisis.With the development of online social media,people prefer to express their opinions and discuss the latest news online.We have witnessed the positive influence of online social media,which helped citizens and governments track the development of this pandemic in time.It is necessary to apply artificial intelligence(AI)techniques to online social media and automatically discover and track public opinions posted online.In this paper,we take Sina Weibo,the most widely used online social media in China,for analysis and experiments.We collect multi-modal microblogs about COVID-19 from 2020/1/1 to 2020/3/31 with a web crawler,including texts and images posted by users.In order to effectively discover what is being discussed about COVID-19 without human labeling,we propose a unified multi-modal framework,including an unsupervised short-text topic model to discover and track bursty topics,and a self-supervised model to learn image features so that we can retrieve related images about COVID-19.Experimental results have shown the effectiveness and superiority of the proposed models,and also have shown the considerable application prospects for analyzing and tracking public opinions about COVID-19.展开更多
Considering that there exists a strong similarity between behaviors of users and intelligence of swarm of agents,in this paper we propose a novel user recommendation strategy based on particle swarm optimization(PSO)f...Considering that there exists a strong similarity between behaviors of users and intelligence of swarm of agents,in this paper we propose a novel user recommendation strategy based on particle swarm optimization(PSO)for Microblog network. Specifically,a PSO-based algorithm is developed to learn the user influence,where not only the number of followers is incorporated,but also the interactions among users(e.g.,forwarding and commenting on other users' tweets). Three social factors,the influence and the activity of the target user,together with the coherence between users,are fused to improve the performance of proposed recommendation strategy. Experimental results show that,compared to the well-known Page Rank-based algorithm,the proposed strategy performs much better in terms of precision and recall and it can effectively avoid a biased result caused by celebrity effect and zombie fans effect.展开更多
The development of microblog services has a considerable etfect on the patterns oI wed access and Internet resources discovery. Understanding the interrelation between information diffusion in online social media and ...The development of microblog services has a considerable etfect on the patterns oI wed access and Internet resources discovery. Understanding the interrelation between information diffusion in online social media and user web interests can help the web ecosystem stakeholders in developing new services and designing efficient systems with optimized resources. This paper explores whether or not one can infer the trends of topics in the web by observing the Twitter microcosm. Using data- sets collected from Twitter and two representative web services (Google and Alexa), this work con- ducts a comparative analysis between trending patterns of topics in Twitter and in the web by consid- ering both the temporal and spatial perspectives, and finds that individual topics in Twitter and in the web share similar trending patterns both from the temporal and spatial aspects. Nevertheless, the tren- diness in Twitter can precede for a few hours and is highly unstable compared to the one in web. The application of these findings is also discussed on ad keywords planning in Search Engine Marketing.展开更多
The emergence and application of new media are both opportunities and challenges for the development of libraries.In light of statistical analysis of development and use of new media platform tools such as the library...The emergence and application of new media are both opportunities and challenges for the development of libraries.In light of statistical analysis of development and use of new media platform tools such as the library portal websites of independent colleges in Jiangsu Province,WeChat,MicroBlog,and mobile libraries,etc.,the author in this paper is aimed at studying independent college libraries’self-development level and innovation of service models under the new information network environment as well as how to win favorable time and space so as to easily face the advent of cloud service era.展开更多
基金This work was supported by Chongqing Research Program of Basic Research and Frontier Technology(cstc2017jcyjAX0071)Basic and Advanced Research Projects of CSTC(cstc2019jcyjzdxm0102)+1 种基金Chongqing Science and Technology Innovation Leading Talent Support Program(CSTCCXLJRC201908)Science and Technology Research Program of Chongqing Municipal Education Commission(KJZD-K201900605).
文摘With the number of social media users ramping up,microblogs are generated and shared at record levels.The high momentum and large volumes of short texts bring redundancies and noises,in which the users and analysts often find it problematic to elicit useful information of interest.In this paper,we study a query-focused summarization as a solution to address this issue and propose a novel summarization framework to generate personalized online summaries and historical summaries of arbitrary time durations.Our framework can deal with dynamic,perpetual,and large-scale microblogging streams.Specifically,we propose an online microblogging stream clustering algorithm to cluster microblogs and maintain distilled statistics called Microblog Cluster Vectors(MCV).Then we develop a ranking method to extract the most representative sentences relative to the query from the MCVs and generate a query-focused summary of arbitrary time durations.Our experiments on large-scale real microblogs demonstrate the efficiency and effectiveness of our approach.
文摘Microblog is a new Internet featured product, which has seen a rapid development in recent years. Researchers from different countries are making various technical analyses on microblogging applications. In this study, through using the natural language processing(NLP) and data mining, we analyzed the information content transmitted via a microblog, users' social networks and their interactions, and carried out an empirical analysis on the dissemination process of one particular piece of information via Sina Weibo.Based on the result of these analyses, we attempt to develop a better understanding about the rule and mechanism of the informal information flow in microblogging.
基金supported by the National Natural Science Foundation of China (No. 71272028)the MOE Project of Key Research Institute of Humanity and Social Sciences at Universities (No. 13JJD630008)
文摘This study uses the Elaboration Likelihood Model (ELM) and social presence theory to examine the microblogging reposting mechanism. Subjective and objective data were collected from 216 respondents in a field experiment. The results indicate that information quality and source credibility of microblogging messages affect users' reposting intention by affecting their perceptions of the usefulness and enjoyment of the information. Perceived enjoyment has a greater impact on reposting intention than perceived usefulness. Furthermore, users are able to perceive social presence when interacting with microblogging messages. Social presence plays a full mediating role between information quality and perceived enjoyment, and a partial mediating role between information quality and perceived usefulness.
基金supported by the National Natural Science Foundation of China (Nos. 61170196 and 61202140)by the Singapore National Research Foundation under its International Research Centre @ Singapore Funding Initiative and administered by the IDM Programme Office
文摘Microblogging services nformation and express opinions pro by vide a novel and popular communication scheme for Web users to share publishing short posts, which usually reflect the users' daily life. We can thus model the users' daily status and interests according to their posts. Because of the high complexity and the large amount of the content of the microblog users' posts, it is necessary to provide a quick summary of the users' life status, both for personal users and commercial services. It is non-trivial to summarize the life status of microblog users, particularly when the summary is conducted over a long period. In this paper, we present a compact interactive visualization prototype, LifeCircle, as an efficient summary for exploring the long-term life status of microblog users. The radial visualization provides multiple views for a given microblog user, including annual topics, monthly keywords, monthly sentiments, and temporal trends of posts. We tightly integrate interactive visualization with novel and state-of-the-art microblogging analytics to maximize their advantages. We implement LifeCircle on Sina Weibo, the most popular microblogging service in China, and illustrate the effectiveness of our prototype with various case studies. Results show that our prototype makes users nostalgic and makes them reminiscent about past events, which helps them to better understand themselves and others
基金the National Natural Science Foundation of China under Grant No. 61170242, the National High Technology Research and Development 863 Program of China under Grant No. 2012AA012802, and the Fundamental Research Fhnds for the Central Universities of China under Grant No. HEUCF100605.
文摘Traditional anomaly detection on microblogging mostly focuses on individual anomalous users or messages. Since anomalous users employ advanced intelligent means, the anomaly detection is greatly poor in performance. In this paper, we propose an innovative framework of anomaly detection based on bipartite graph and co-clustering. A bipartite graph between users and messages is built to model the homogeneous and heterogeneous interactions. The proposed co- clustering algorithm based on nonnegative matrix tri-factorization can detect anomalous users and messages simultaneously. The homogeneous relations modeled by the bipartite graph are used as constraints to improve the accuracy of the co- clustering algorithm. Experimental results show that the proposed scheme can detect individual and group anomalies with high accuracy on a Sina Weibo dataset.
基金supported by 973 Program of China(Grant No.2013CB329601, 2013CB329602,2013CB329604)NSFC of China(Grant No.60933005,91124002)+1 种基金863 Program of China(Grant No.2012AA01A401, 2012AA01A402)National Key Technology RD Program of China(Grant No.2012BAH38B04, 2012BAH38B06)
文摘With the development of online social networks,a special group of online users named organized posters(or Internet water army,Internet paid posters in some literatures) have fl ooded the social network communities. They are organized in groups to post with specific purposes and sometimes even confuse or mislead normal users.In this paper,we study the individual and group characteristics of organized posters. A classifier is constructed based on the individual and group characteristics to detect them. Extensive experimental results on three real datasets demonstrate that our method based on individual and group characteristics using SVM model(IGCSVM) is effective in detecting organized posters and better than existing methods. We take a first look at finding the promoters based on the detected organized posters of our IGCSVM method. Our experiments show that it is effective in detecting promoters.
文摘The emergence of government microblog not only widens the channels for the people to participate in politics,but also is of great significance for the government to understand public opinion and promote the process of political democratization.In recent years,the frequency of network language in Chinese government microblog has gradually increased.Taking the“Beijing release”official microblog as an example,this paper discusses the characteristics of network language,and puts forward some suggestions on the use of network language in government microblog:in terms of syllables,network language below three syllables should be selected to meet the requirements of short length of government microblog;In terms of part of speech,network terms such as nouns,verbs and adjectives can express emotional attitude and value judgment;In terms of emotion and style,emotional attitude forms such as intimacy,love and respect are selected.In addition,the norms of network language in government microblog can not be generalized.We should not only pay attention to the expression effect,but also set foot in multiple disciplines and fields.
基金This work was supported by the National High Technology Research and Development Program of China(No. 2010AA012505, 2011AA010702, 2012AA01A401 and 2012AA01A402), Chinese National Science Foundation (No. 60933005, 91124002,61303265), National Technology Support Foundation (No. 2012BAH38B04) and National 242 Foundation (No. 2011A010)
文摘Microblogs have become an important platform for people to publish,transform information and acquire knowledge.This paper focuses on the problem of discovering user interest in microblogs.In this paper,we propose a topic mining model based on Latent Dirichlet Allocation(LDA) named user-topic model.For each user,the interests are divided into two parts by different ways to generate the microblogs:original interest and retweet interest.We represent a Gibbs sampling implementation for inference the parameters of our model,and discover not only user's original interest,but also retweet interest.Then we combine original interest and retweet interest to compute interest words for users.Experiments on a dataset of Sina microblogs demonstrate that our model is able to discover user interest effectively and outperforms existing topic models in this task.And we find that original interest and retweet interest are similar and the topics of interest contain user labels.The interest words discovered by our model reflect user labels,but range is much broader.
基金supported by the National Natural Sciences Foundation of China under Grant No. 61172072the Beijing Natural Science Foundation under Grant No. 4112045the Fundamental Research Funds for the Central Universities under Grant No. 2011YJS215
文摘Based on user's in-degree distribution, traditional ranking algorithms of user's weight usually neglect the considerations of the differences among user's followers and the features of user's tweets. In order to analyze the factors which impact on user's weight, under the analysis of the data collected from SINA Microblog network, this paper discovers that user influence and active degrees are the dominant factors for this issue. The proposed algorithm evaluates user influence by user's follower number, the influence of user's followers and the reciprocity between users. User's active degree is modeled by user's participation and the quality of user's tweets. The models are tested by different data groups to confirm the parameters for the final calculation. Eventually, this paper compares the computational results with the user's ranking order given by the SINA official application. The performance of this algorithm presents a stronger stability on the fluctuant range of the value of user's weight.
文摘This paper explores the uses’ influences on microblog. At first, according to the social network theory, we present an analysis of information transmitting network structure based on the relationship of following and followed phenomenon of microblog users. Informed by the microblog user behavior analysis, the paper also addresses a model for calculating weights of users’ influence. It proposes a U-R model, using which we can evaluate users’ influence based on PageRank algorithms and analyzes user behaviors. In the U-R model, the effect of user behaviors is explored and PageRank is applied to evaluate the importance and the influence of every user in a microblog network by repeatedly iterating their own U-R value. The users’ influences in a microblog network can be ranked by the U-R value. Finally, the validity of U-R model is proved with a real-life numerical example.
文摘The emergence of big data leads to an increasing demand for data processing methods.As the most influential media for Chinese domestic movie ratings,Douban contains a huge amount of data and one can understand users'perspectives towards these movies by analyzing these data.In this article,we study movie's critics from the Douban website,perform sentiment analysis on the data obtained by crawling,and visualize the results with a word cloud.We propose a lightweight sentiment analysis method which is free from heavy training and visualize the results in a more conceivable way.
基金Supported by 242 National Information Security Projects(2017A149)
文摘A novel microblog summarization approach via enriching contextual features on sentencelevel semantic analysis is proposed in this paper. At first,a Chinese sentential semantic model( CSM) is employed to analyze the semantic structure of each microblog sentence. Then,a combination of sentence-level semantic analysis and latent dirichlet allocation is utilized to acquire extra features and related words to enrich the collection of microblog messages. The simlilarites between the two sentences are calculated based on the enriched features. Finally,the semantic weight and relation weight are calculated to select the most informative sentences,which form the final summary for microblog messages. Experimental results demonstrate the advantages of our proposed approach.The results indicate that introducing sentence-level semantic analysis for context enrichment can better represent sentential semantic. The proposed criteria,namely,semantic weight and relation weight enhance summary result. Furthermore,CSM is a useful framework for sentence-level semantic analysis.
文摘At present there are many socialized microblog platforms.With powerful mobility,real-time information,fragment of information dissemination,and innovation of interaction,the microblog has become a socialized interaction mode in recent years.Since microblog is very popular with students of agricultural and forestry higher vocational schools,with the rising and development of network education,the microblog as a new information platform will be used by more and more teachers in education.From the perspective of microblog,this paper studied educational reform in management courses of agricultural and forestry higher vocational schools,in the hope of providing certain reference and help for current education practice of agricultural and forestry management courses.
基金supported by Sun Yat-sen University Cultivation Fund for Young Teachers(Grant No.:20000-3161102)the National Social Science Fundation of China(Grant No.:08CTQ015)
文摘Purpose: This paper intends to explore a quantitative method for investigating the characteristics of information diffusion through social media like weblogs and microblogs.By using the social network analysis methods,we attempt to analyze the different characteristics of information diffusion in weblogs and microblogs as well as the possible reasons of these differences.Design/methodology/approach: Using the social network analysis methods,this paper carries out an empirical study by taking the Chinese weblogs and microblogs in the field of Library and Information Science(LIS) as the research sample and employing measures such as network density,core/peripheral structure and centrality.Findings: Firstly,both bloggers and microbloggers maintain weak ties,and both of their social networks display a small-world effect. Secondly,compared with weblog users,microblog users are more interconnected,more equal and more capable of developing relationships with people outside their own social networks. Thirdly,the microblogging social network is more conducive to information diffusion than the blogging network,because of their differences in functions and the information flow mechanism. Finally,the communication mode emerged with microblogging,with the characteristics of micro-content,multi-channel information dissemination,dense and decentralized social network and content aggregation,will be one of the trends in the development of the information exchange platform in the future.Research limitations: The sample size needs to be increased so that samples are more representative. Errors may exist during the data collection. Moreover,the individual-level characteristics of the samples as well as the types of information exchanged need to be further studied.Practical implications: This preliminary study explores the characteristics of information diffusion in the network environment and verifies the feasibility of conducting a quantitative analysis of information diffusion through social media. In addition,it provides insight into the characteristics of information diffusion in weblogs and microblogs and the possible reasons of these differences.Originality/value: We have analyzed the characteristics of information diffusion in weblogs and microblogs by using the social network analysis methods. This research will be useful for a quantitative analysis of the underlying mechanisms of information flow through social media in the network environment.
基金The research is supported by National Basic Research Program of China (973 Program),Project of National Natural Science Foundation of China,the Fundamental Research Funds for the Central Universities (2013RC0603)."
文摘Forwarding is a major means of information dissemination on the Microblog platform.The article,combining static analysis and dynamic analysis,takes Microblog forwarding as the object of study,and studies the network topology of grass-roots Microblog forwarding users.It also studies the correlation between characteristic quantity and forwarding times of Microblog network topology.Furthermore,it conducts modification on virus transmission model,builds and verifies the Microblog forwarding dynamical model.The study finds out that Microblog postings present qute strong dissemination capacity on the initial stage,and some Microblog postings with many forwarding times and long duration of forwarding process due to the dynamic growth of the forwarding user network and the joining of strong nodes make network infection density decrease in some phases.
基金supported by the National High Technology Research and Development Program of China(863 Program)(No.2015AA050204)
文摘Some research work has showed that public mood and stock market price have some relations in some degree. Although it is difficult to clear the relation, the research about the relation between stock market price and public mood is interested by some scientists. This paper tries to find the relationship between Chinese stock market and Chinese local Microblog. First, C-POMS(Chinese Profile of Mood States) was proposed to analyze sentiment of Microblog feeds. Then Granger causality test confirmed the relation between C-POMS analysis and price series. SVM and Probabilistic Neural Network were used to make prediction, and experiments show that SVM is better to predict stock market movements than Probabilistic Neural Network. Experiments also indicate that adding certain dimension of C-POMS as the input data will improve the prediction accuracy to 66.667%. Two dimensions to input data leads to the highest accuracy of 71.429%, which is about 20% higher than using only history stock data as the input data. This paper also compared the proposed method with the ROSTEA scores, and concluded that only the proposed method brings more accurate predicts.
基金This paper is supported by the Fundamental Research Funds for the Central Universities[No.JUSRP12021].
文摘Starting from late 2019,the new coronavirus disease(COVID-19)has become a global crisis.With the development of online social media,people prefer to express their opinions and discuss the latest news online.We have witnessed the positive influence of online social media,which helped citizens and governments track the development of this pandemic in time.It is necessary to apply artificial intelligence(AI)techniques to online social media and automatically discover and track public opinions posted online.In this paper,we take Sina Weibo,the most widely used online social media in China,for analysis and experiments.We collect multi-modal microblogs about COVID-19 from 2020/1/1 to 2020/3/31 with a web crawler,including texts and images posted by users.In order to effectively discover what is being discussed about COVID-19 without human labeling,we propose a unified multi-modal framework,including an unsupervised short-text topic model to discover and track bursty topics,and a self-supervised model to learn image features so that we can retrieve related images about COVID-19.Experimental results have shown the effectiveness and superiority of the proposed models,and also have shown the considerable application prospects for analyzing and tracking public opinions about COVID-19.
基金supported by National Natural Science Foundation of China(No.61171109)Applied Basic Research Programs of Sichuan Science and Technology Department(No.2014JY0215)Basic Research Plan in SWUST(No.13zx9101)
文摘Considering that there exists a strong similarity between behaviors of users and intelligence of swarm of agents,in this paper we propose a novel user recommendation strategy based on particle swarm optimization(PSO)for Microblog network. Specifically,a PSO-based algorithm is developed to learn the user influence,where not only the number of followers is incorporated,but also the interactions among users(e.g.,forwarding and commenting on other users' tweets). Three social factors,the influence and the activity of the target user,together with the coherence between users,are fused to improve the performance of proposed recommendation strategy. Experimental results show that,compared to the well-known Page Rank-based algorithm,the proposed strategy performs much better in terms of precision and recall and it can effectively avoid a biased result caused by celebrity effect and zombie fans effect.
基金Supported by the Beijing Municipal Natural Science Foundation(No.2015AA010201)
文摘The development of microblog services has a considerable etfect on the patterns oI wed access and Internet resources discovery. Understanding the interrelation between information diffusion in online social media and user web interests can help the web ecosystem stakeholders in developing new services and designing efficient systems with optimized resources. This paper explores whether or not one can infer the trends of topics in the web by observing the Twitter microcosm. Using data- sets collected from Twitter and two representative web services (Google and Alexa), this work con- ducts a comparative analysis between trending patterns of topics in Twitter and in the web by consid- ering both the temporal and spatial perspectives, and finds that individual topics in Twitter and in the web share similar trending patterns both from the temporal and spatial aspects. Nevertheless, the tren- diness in Twitter can precede for a few hours and is highly unstable compared to the one in web. The application of these findings is also discussed on ad keywords planning in Search Engine Marketing.
文摘The emergence and application of new media are both opportunities and challenges for the development of libraries.In light of statistical analysis of development and use of new media platform tools such as the library portal websites of independent colleges in Jiangsu Province,WeChat,MicroBlog,and mobile libraries,etc.,the author in this paper is aimed at studying independent college libraries’self-development level and innovation of service models under the new information network environment as well as how to win favorable time and space so as to easily face the advent of cloud service era.