In order to provide predictable runtime performante for text categorization (TC) systems, an innovative system design method is proposed for soft real time TC systems. An analyzable mathematical model is established...In order to provide predictable runtime performante for text categorization (TC) systems, an innovative system design method is proposed for soft real time TC systems. An analyzable mathematical model is established to approximately describe the nonlinear and time-varying TC systems. According to this mathematical model, the feedback control theory is adopted to prove the system's stableness and zero steady state error. The experiments result shows that the error of deadline satisfied ratio in the system is kept within 4 of the desired value. And the number of classifiers can be dynamically adjusted by the system itself to save the computa tion resources. The proposed methodology enables the theo retical analysis and evaluation to the TC systems, leading to a high-quality and low cost implementation approach.展开更多
With the emergence and further development of the digital library, the approaches of information acquisition correspondingly change a lot. This paper makes a statistical analysis on the journal downloading and citatio...With the emergence and further development of the digital library, the approaches of information acquisition correspondingly change a lot. This paper makes a statistical analysis on the journal downloading and citation behaviors under the digital environment conceived by the National Science Library(NSL), Chinese Academy of Sciences(CAS). It can be seen that the development of digital resources has influenced scientific research behaviors. For example, the large quantity of full-text downloading will maintain; the trend of journal downloading behaviors is basically same as the journal citation behavior; journals with large quantity of full-text downloading also boast the high cited times, and vice versa. Furthermore, authors make a linear regression analysis, with the journal downloading amount as the independent variable and journal cited times as dependent variable. Then they also prove the positive correlation between the journal downloading and citation behaviors by means of Pearson's correlation coefficient formula.展开更多
In order to ensure the safety,quality and efficiency of computer numerical control(CNC)machine tool processing,a real-time monitoring and visible solution for CNC machine tools based on hyper text markup language(HTML...In order to ensure the safety,quality and efficiency of computer numerical control(CNC)machine tool processing,a real-time monitoring and visible solution for CNC machine tools based on hyper text markup language(HTML)5 is proposed.The characteristics of the real-time monitoring technology of CNC machine tools under the traditional Client/Server(C/S)structure are compared and analyzed,and the technical drawbacks are proposed.Web real-time communication technology and browser drawing technology are deeply studied.A real-time monitoring and visible system for CNC machine tool data is developed based on Metro platform,combining WebSocket real-time communication technology and Canvas drawing technology.The system architecture is given,and the functions and implementation methods of the system are described in detail.The practical application results show that the WebSocket real-time communication technology can effectively reduce the bandwidth and network delay and save server resources.The numerical control machine data monitoring system can intuitively reflect the machine data,and the visible effect is good.It realizes timely monitoring of equipment alarms and prompts maintenance and management personnel.展开更多
Short text, based on the platform of web2.0, gained rapid development in a relatively short time. Recommendation systems analyzing user’s interest by short texts becomes more and more important. Collaborative filteri...Short text, based on the platform of web2.0, gained rapid development in a relatively short time. Recommendation systems analyzing user’s interest by short texts becomes more and more important. Collaborative filtering is one of the most promising recommendation technologies. However, the existing collaborative filtering methods don’t consider the drifting of user’s interest. This often leads to a big difference between the result of recommendation and user’s real demands. In this paper, according to the traditional collaborative filtering algorithm, a new personalized recommendation algorithm is proposed. It traced user’s interest by using Ebbinghaus Forgetting Curve. Some experiments have been done. The results demonstrated that the new algorithm could indeed make a contribution to getting rid of user’s overdue interests and discovering their real-time interests for more accurate recommendation.展开更多
Opinion (sentiment) analysis on big data streams from the constantly generated text streams on social media networks to hundreds of millions of online consumer reviews provides many organizations in every field with o...Opinion (sentiment) analysis on big data streams from the constantly generated text streams on social media networks to hundreds of millions of online consumer reviews provides many organizations in every field with opportunities to discover valuable intelligence from the massive user generated text streams. However, the traditional content analysis frameworks are inefficient to handle the unprecedentedly big volume of unstructured text streams and the complexity of text analysis tasks for the real time opinion analysis on the big data streams. In this paper, we propose a parallel real time sentiment analysis system: Social Media Data Stream Sentiment Analysis Service (SMDSSAS) that performs multiple phases of sentiment analysis of social media text streams effectively in real time with two fully analytic opinion mining models to combat the scale of text data streams and the complexity of sentiment analysis processing on unstructured text streams. We propose two aspect based opinion mining models: Deterministic and Probabilistic sentiment models for a real time sentiment analysis on the user given topic related data streams. Experiments on the social media Twitter stream traffic captured during the pre-election weeks of the 2016 Presidential election for real-time analysis of public opinions toward two presidential candidates showed that the proposed system was able to predict correctly Donald Trump as the winner of the 2016 Presidential election. The cross validation results showed that the proposed sentiment models with the real-time streaming components in our proposed framework delivered effectively the analysis of the opinions on two presidential candidates with average 81% accuracy for the Deterministic model and 80% for the Probabilistic model, which are 1% - 22% improvements from the results of the existing literature.展开更多
基金Supported by the National Natural Science Foun-dation of China (90104032) ,the National High-Tech Research andDevelopment Plan of China (2003AA1Z2090)
文摘In order to provide predictable runtime performante for text categorization (TC) systems, an innovative system design method is proposed for soft real time TC systems. An analyzable mathematical model is established to approximately describe the nonlinear and time-varying TC systems. According to this mathematical model, the feedback control theory is adopted to prove the system's stableness and zero steady state error. The experiments result shows that the error of deadline satisfied ratio in the system is kept within 4 of the desired value. And the number of classifiers can be dynamically adjusted by the system itself to save the computa tion resources. The proposed methodology enables the theo retical analysis and evaluation to the TC systems, leading to a high-quality and low cost implementation approach.
文摘With the emergence and further development of the digital library, the approaches of information acquisition correspondingly change a lot. This paper makes a statistical analysis on the journal downloading and citation behaviors under the digital environment conceived by the National Science Library(NSL), Chinese Academy of Sciences(CAS). It can be seen that the development of digital resources has influenced scientific research behaviors. For example, the large quantity of full-text downloading will maintain; the trend of journal downloading behaviors is basically same as the journal citation behavior; journals with large quantity of full-text downloading also boast the high cited times, and vice versa. Furthermore, authors make a linear regression analysis, with the journal downloading amount as the independent variable and journal cited times as dependent variable. Then they also prove the positive correlation between the journal downloading and citation behaviors by means of Pearson's correlation coefficient formula.
文摘In order to ensure the safety,quality and efficiency of computer numerical control(CNC)machine tool processing,a real-time monitoring and visible solution for CNC machine tools based on hyper text markup language(HTML)5 is proposed.The characteristics of the real-time monitoring technology of CNC machine tools under the traditional Client/Server(C/S)structure are compared and analyzed,and the technical drawbacks are proposed.Web real-time communication technology and browser drawing technology are deeply studied.A real-time monitoring and visible system for CNC machine tool data is developed based on Metro platform,combining WebSocket real-time communication technology and Canvas drawing technology.The system architecture is given,and the functions and implementation methods of the system are described in detail.The practical application results show that the WebSocket real-time communication technology can effectively reduce the bandwidth and network delay and save server resources.The numerical control machine data monitoring system can intuitively reflect the machine data,and the visible effect is good.It realizes timely monitoring of equipment alarms and prompts maintenance and management personnel.
文摘Short text, based on the platform of web2.0, gained rapid development in a relatively short time. Recommendation systems analyzing user’s interest by short texts becomes more and more important. Collaborative filtering is one of the most promising recommendation technologies. However, the existing collaborative filtering methods don’t consider the drifting of user’s interest. This often leads to a big difference between the result of recommendation and user’s real demands. In this paper, according to the traditional collaborative filtering algorithm, a new personalized recommendation algorithm is proposed. It traced user’s interest by using Ebbinghaus Forgetting Curve. Some experiments have been done. The results demonstrated that the new algorithm could indeed make a contribution to getting rid of user’s overdue interests and discovering their real-time interests for more accurate recommendation.
文摘Opinion (sentiment) analysis on big data streams from the constantly generated text streams on social media networks to hundreds of millions of online consumer reviews provides many organizations in every field with opportunities to discover valuable intelligence from the massive user generated text streams. However, the traditional content analysis frameworks are inefficient to handle the unprecedentedly big volume of unstructured text streams and the complexity of text analysis tasks for the real time opinion analysis on the big data streams. In this paper, we propose a parallel real time sentiment analysis system: Social Media Data Stream Sentiment Analysis Service (SMDSSAS) that performs multiple phases of sentiment analysis of social media text streams effectively in real time with two fully analytic opinion mining models to combat the scale of text data streams and the complexity of sentiment analysis processing on unstructured text streams. We propose two aspect based opinion mining models: Deterministic and Probabilistic sentiment models for a real time sentiment analysis on the user given topic related data streams. Experiments on the social media Twitter stream traffic captured during the pre-election weeks of the 2016 Presidential election for real-time analysis of public opinions toward two presidential candidates showed that the proposed system was able to predict correctly Donald Trump as the winner of the 2016 Presidential election. The cross validation results showed that the proposed sentiment models with the real-time streaming components in our proposed framework delivered effectively the analysis of the opinions on two presidential candidates with average 81% accuracy for the Deterministic model and 80% for the Probabilistic model, which are 1% - 22% improvements from the results of the existing literature.