With the spread and development of new epidemics,it is of great reference value to identify the changing trends of epidemics in public emotions.We designed and implemented the COVID-19 public opinion monitoring system...With the spread and development of new epidemics,it is of great reference value to identify the changing trends of epidemics in public emotions.We designed and implemented the COVID-19 public opinion monitoring system based on time series thermal new word mining.A new word structure discovery scheme based on the timing explosion of network topics and a Chinese sentiment analysis method for the COVID-19 public opinion environment are proposed.Establish a“Scrapy-Redis-Bloomfilter”distributed crawler framework to collect data.The system can judge the positive and negative emotions of the reviewer based on the comments,and can also reflect the depth of the seven emotions such as Hopeful,Happy,and Depressed.Finally,we improved the sentiment discriminant model of this system and compared the sentiment discriminant error of COVID-19 related comments with the Jiagu deep learning model.The results show that our model has better generalization ability and smaller discriminant error.We designed a large data visualization screen,which can clearly show the trend of public emotions,the proportion of various emotion categories,keywords,hot topics,etc.,and fully and intuitively reflect the development of public opinion.展开更多
Sentiment analysis or opinion mining(OM)concepts become familiar due to advances in networking technologies and social media.Recently,massive amount of text has been generated over Internet daily which makes the patte...Sentiment analysis or opinion mining(OM)concepts become familiar due to advances in networking technologies and social media.Recently,massive amount of text has been generated over Internet daily which makes the pattern recognition and decision making process difficult.Since OM find useful in business sectors to improve the quality of the product as well as services,machine learning(ML)and deep learning(DL)models can be considered into account.Besides,the hyperparameters involved in the DL models necessitate proper adjustment process to boost the classification process.Therefore,in this paper,a new Artificial Fish Swarm Optimization with Bidirectional Long Short Term Memory(AFSO-BLSTM)model has been developed for OM process.The major intention of the AFSO-BLSTM model is to effectively mine the opinions present in the textual data.In addition,the AFSO-BLSTM model undergoes pre-processing and TF-IFD based feature extraction process.Besides,BLSTM model is employed for the effectual detection and classification of opinions.Finally,the AFSO algorithm is utilized for effective hyperparameter adjustment process of the BLSTM model,shows the novelty of the work.A complete simulation study of the AFSO-BLSTM model is validated using benchmark dataset and the obtained experimental values revealed the high potential of the AFSO-BLSTM model on mining opinions.展开更多
Sentiment analysis or Opinion Mining (OM) has gained significant interest among research communities and entrepreneurs in the recentyears. Likewise, Machine Learning (ML) approaches is one of the interestingresearch d...Sentiment analysis or Opinion Mining (OM) has gained significant interest among research communities and entrepreneurs in the recentyears. Likewise, Machine Learning (ML) approaches is one of the interestingresearch domains that are highly helpful and are increasingly applied in severalbusiness domains. In this background, the current research paper focuses onthe design of automated opinion mining model using Deer Hunting Optimization Algorithm (DHOA) with Fuzzy Neural Network (FNN) abbreviatedas DHOA-FNN model. The proposed DHOA-FNN technique involves fourdifferent stages namely, preprocessing, feature extraction, classification, andparameter tuning. In addition to the above, the proposed DHOA-FNN modelhas two stages of feature extraction namely, Glove and N-gram approach.Moreover, FNN model is utilized as a classification model whereas GTOA isused for the optimization of parameters. The novelty of current work is thatthe GTOA is designed to tune the parameters of FNN model. An extensiverange of simulations was carried out on the benchmark dataset and the resultswere examined under diverse measures. The experimental results highlightedthe promising performance of DHOA-FNN model over recent state-of-the-arttechniques with a maximum accuracy of 0.9928.展开更多
Opinion summarization recapitulates the opinions about a common topic automatically.The primary motive of summarization is to preserve the properties of the text and is shortened in a way with no loss in the semantics...Opinion summarization recapitulates the opinions about a common topic automatically.The primary motive of summarization is to preserve the properties of the text and is shortened in a way with no loss in the semantics of the text.The need of automatic summarization efficiently resulted in increased interest among communities of Natural Language Processing and Text Mining.This paper emphasis on building an extractive summarization system combining the features of principal component analysis for dimensionality reduction and bidirectional Recurrent Neural Networks and Long Short-Term Memory(RNN-LSTM)deep learning model for short and exact synopsis using seq2seq model.It presents a paradigm shift with regard to the way extractive summaries are generated.Novel algorithms for word extraction using assertions are proposed.The semantic framework is well-grounded in this research facilitating the correct decision making process after reviewing huge amount of online reviews,considering all its important features into account.The advantages of the proposed solution provides greater computational efficiency,better inferences from social media,data understanding,robustness and handling sparse data.Experiments on the different datasets also outperforms the previous researches and the accuracy is claimed to achieve more than the baselines,showing the efficiency and the novelty in the research paper.The comparisons are done by calculating accuracy with different baselines using Rouge tool.展开更多
At present,the immense development of social networks allows generating a significant amount of textual data,which has facilitated researchers to explore the field of opinion mining.In addition,the processing of textu...At present,the immense development of social networks allows generating a significant amount of textual data,which has facilitated researchers to explore the field of opinion mining.In addition,the processing of textual opinions based on the term frequency-inverse document frequency method gives rise to a dimensionality problem.This study aims to detect the nature of opinions in the Arabic language employing a swarm intelligence(SI)-based algorithm,Harris hawks algorithm,to select the most relevant terms.The experimental study has been tested on two datasets:Arabic Jordanian General Tweets and Opinion Corpus for Arabic.In terms of accuracy and number of features,the results are better than those of other SI based algorithms,such as grey wolf optimizer and grasshopper optimization algorithm,and other algorithms in the literature,such as differential evolution,genetic algorithm,particle swarm optimization,basic and enhanced whale optimizer algorithm,slap swarm algorithm,and ant–lion optimizer.展开更多
This paper focuses on how to improve aspect-level opinion mining for online customer reviews. We first propose a novel generative topic model, the Joint Aspect/Sentiment (JAS) model, to jointly extract aspects and asp...This paper focuses on how to improve aspect-level opinion mining for online customer reviews. We first propose a novel generative topic model, the Joint Aspect/Sentiment (JAS) model, to jointly extract aspects and aspect-dependent sentiment lexicons from online customer reviews. An aspect-dependent sentiment lexicon refers to the aspect-specific opinion words along with their aspect-aware sentiment polarities with respect to a specific aspect. We then apply the extracted aspectdependent sentiment lexicons to a series of aspect-level opinion mining tasks, including implicit aspect identification, aspect-based extractive opinion summarization, and aspect-level sentiment classification. Experimental results demonstrate the effectiveness of the JAS model in learning aspectdependent sentiment lexicons and the practical values of the extracted lexicons when applied to these practical tasks.展开更多
Opinion Mining(OM)studies in Arabic are limited though it is one of the most extensively-spoken languages worldwide.Though the interest in OM studies in the Arabic language is growing among researchers,it needs a vast...Opinion Mining(OM)studies in Arabic are limited though it is one of the most extensively-spoken languages worldwide.Though the interest in OM studies in the Arabic language is growing among researchers,it needs a vast number of investigations due to the unique morphological principles of the language.Arabic OM studies experience multiple challenges owing to the poor existence of language sources and Arabic-specific linguistic features.The comparative OM studies in the English language are wide and novel.But,comparative OM studies in the Arabic language are yet to be established and are still in a nascent stage.The unique features of the Arabic language make it essential to expand the studies regarding the Arabic text.It contains unique featuressuchasdiacritics,elongation,inflectionandwordlength.Thecurrent study proposes a Political Optimizer with Probabilistic Neural Network-based Comparative Opinion Mining(POPNN-COM)model for the Arabic text.The proposed POPNN-COM model aims to recognize comparative and non-comparative texts in Arabic in the context of social media.Initially,the POPNN-COM model involves different levels of data pre-processing to transform the input data into a useful format.Then,the pre-processed data is fed into the PNN model for classification and recognition of the data under different class labels.At last,the PO algorithm is employed for fine-tuning the parameters involved in this model to achieve enhanced results.The proposed POPNN-COM model was experimentally validated using two standard datasets,and the outcomes established the promising performance of the proposed POPNN-COM method over other recent approaches.展开更多
The Internet provides a large number of tools and resources, such as social media sites, online newsgroups, blogs, electronic forums, virtual communities, and online travel sites, for consumers to express their views ...The Internet provides a large number of tools and resources, such as social media sites, online newsgroups, blogs, electronic forums, virtual communities, and online travel sites, for consumers to express their views or opinions regarding various issues. These opinions can help organizations like tourism to improve their products and services for their consumers. Opinion mining refers to a process of identifying emotions by applying Natural Language Processing (NLP) techniques to a pool of texts. This paper mainly focuses on mining public opinion from the hotel reviews domain. To do so, we proposed a novel technique called the Attention-Based Long Short Term Memory (Attention-LSTM) Network using a transfer learning approach. We empirically analyzed several machine learning and deep learning methods and observed our proposed technique provided an adequate performance for mining public opinion in the hotel reviews domain.展开更多
Global changes took place at a neck-breaking speed in lots of fields along with the Web 2.0 era, which can be stated as the new Internet trend. Web pages which once were a statical structure that can be said to become...Global changes took place at a neck-breaking speed in lots of fields along with the Web 2.0 era, which can be stated as the new Internet trend. Web pages which once were a statical structure that can be said to become dynamic pages created by users, and in this regard they can be said to have been democratized by evolving. Social media, which were structured alongside with this era, by providing a large data flow for businesses, present new and improvable opportunities in the field of creating effective strategies. There are lots of blogs in today's Internet environment which includes customer ideas regarding the products/services that they possess. This environment, which in a way globalizes the customer ideas, is a new medium suitable for examination in terms of its increasing the business-customer interaction and due to its transporter nature; it provides the text data that may be analyzed in the field of Customer Relationship Management to businesses. Thus, businesses should follow blog environments to see how the product/service they provide is greeted in terms of the customer focus and it should be seen as an important job on which they can conduct effective analyses. For this purpose, a model proposal that will assign the ideas to the Turkish blogs was given in the study. Opinion mining methods were used in the model, and so to perceive a general look-on about products/services, a methodology was devised, which will assign the text based opinion data on the Turkish blogs to the poles. Success of the pole assignment of the model is evaluated with the precision measure.展开更多
Blog opinion retrieval aims to find blogs with opinionated information related to a given topic.Its main problem is to compute the opinion score,which balances topic relevance and opinion relevance.To deal with this p...Blog opinion retrieval aims to find blogs with opinionated information related to a given topic.Its main problem is to compute the opinion score,which balances topic relevance and opinion relevance.To deal with this problem a generative model deduced by a Bayesian approach is pro-posed,and an improved mixture model is proposed to estimate the opinion relevance between a blog and a given topic in our retrieval framework.Moreover,pointwise mutual information is used to expand sentiment words for different topics based on a general sentimental lexicon.The correlation between topic and candidate words is applied in the process of both expanding sentiment words and estimating sentence opinion scores.Experimental results show that the proposed approaches improve upon the state-of-the-art opinion retrieval method on TREC2010 dataset.展开更多
The Web development has drastically changed the human interaction and communication, leading to an exponential growth of data generated by users in various digital media. This mass of data provides opportunities for u...The Web development has drastically changed the human interaction and communication, leading to an exponential growth of data generated by users in various digital media. This mass of data provides opportunities for understanding people’s opinions about products, services, processes, events, political movements, and organizational strategies. In this context, it becomes important for companies to be able to assess customer satisfaction about their products or services. One of the ways to evaluate customer sentiment is the use of Sentiment Analysis, also known as Opinion Mining. This research aims to compare the efficiency of an automatic classifier based on dictionary with the classification by human jurors in a set of comments made by customers in Portuguese language. The data consist of opinions of service users of one of the largest Brazilian online employment agencies. The performance evaluation of the classification models was done using Kappa index and a Confusion Matrix. As the main finding, it is noteworthy that the agreement between the classifier and the human jurors came to moderate, with better performance for the dictionary-based classifier. This result was considered satisfactory, considering that the Sentiment Analysis in Portuguese language is a complex task and demands more research and development.展开更多
This research measures the reliability of audit firms in predicting bankruptcy for United States (US) listed financial institutions. The object of analysis is the going concern opinion (GCO), widely considered as ...This research measures the reliability of audit firms in predicting bankruptcy for United States (US) listed financial institutions. The object of analysis is the going concern opinion (GCO), widely considered as a bankruptcy warning signal to stakeholders. The sample is composed of 42 US listed financial companies that filed for Chapter 11 between 1998 and 2011. To highlight the differences between bankrupting and healthy firms, a matching sample composed of 42 randomly picked healthy US listed financial companies is collected. We concentrate on financial institutions, whereas the existing literature pays considerably greater attention to the industrial sector. This research imbalance is remarkable and particularly unexpected in the wake of recent financial scandals. Literature points out two main approaches on bankruptcy prediction: (1) purely mathematical; and (2) approaches based on a combination of auditor knowledge, expertise, and experience. The use of data mining techniques allows us to benefit from the best features of both approaches. Statistical tools used in the analysis are: Logit regression, support vector machines (SVMs), and an AdaBoost meta-algorithm. Findings show a quite low reliability of GCOs in predicting bankruptcy. It is likely that auditors consider further information in supporting their audit opinions, aside from financial-economic ratios. The scant predictive ability of auditors might be due to critical relationships with distressed clients, as suggested by recent literature.展开更多
Community based churn prediction,or the assignment of recognising the influence of a customer’s community in churn prediction has become an important concern for firms in many different industries.While churn predi...Community based churn prediction,or the assignment of recognising the influence of a customer’s community in churn prediction has become an important concern for firms in many different industries.While churn prediction until recent times have focused only on transactional dataset(targeted approach),the untargeted approach through product advisement,digital marketing and expressions in customer’s opinion on the social media like Twitter,have not been fully harnessed.Although this data source has become an important influencing factor with lasting impact on churn management.Since Social Network Analysis(SNA)has become a blended approach for churn prediction and management in modern era,customers residing online predominantly and collectively decide and determines the momentum of churn prediction,retention and decision support.In existing SNA approaches,customers are classified as churner or non-churner(1 or 0).Oftentimes,the customer’s opinion is also neglected and the network structure of community members are not exploited.Consequently,the pattern and influential abilities of customers’opinion on relative members of the community are not analysed.Thus,the research developed a Churn Service Information Graph(CSIG)to define a quadruple churn category(churner,potential churner,inertia customer,premium customer)for non-opinionated customers via the power of relative affinity around opinionated customers on a direct node to node SNA.The essence is to use data mining technique to investigate the patterns of opinion between people in a network or group.Consequently,every member of the online social network community is dynamically classified into a churn category for an improved targeted customer acquisition,retention and/or decision supports in churn management.展开更多
基金This work was supported by the Hainan Provincial Natural Science Foundation of China[2019RC041,2019RC098]National Natural Science Foundation of China[61762033]+3 种基金Hainan University Doctor Start Fund Project[kyqd1328]Hainan University Youth Fund Project[qnjj1444]Ministry of education humanities and social sciences research program fund project(19YJA710010)The Opening Project of Shanghai Trusted Industrial Control Platform(Grant No.TICPSH202003005-ZC).
文摘With the spread and development of new epidemics,it is of great reference value to identify the changing trends of epidemics in public emotions.We designed and implemented the COVID-19 public opinion monitoring system based on time series thermal new word mining.A new word structure discovery scheme based on the timing explosion of network topics and a Chinese sentiment analysis method for the COVID-19 public opinion environment are proposed.Establish a“Scrapy-Redis-Bloomfilter”distributed crawler framework to collect data.The system can judge the positive and negative emotions of the reviewer based on the comments,and can also reflect the depth of the seven emotions such as Hopeful,Happy,and Depressed.Finally,we improved the sentiment discriminant model of this system and compared the sentiment discriminant error of COVID-19 related comments with the Jiagu deep learning model.The results show that our model has better generalization ability and smaller discriminant error.We designed a large data visualization screen,which can clearly show the trend of public emotions,the proportion of various emotion categories,keywords,hot topics,etc.,and fully and intuitively reflect the development of public opinion.
基金The authors extend their appreciation to the Deanship of Scientific Research at King Khalid University for funding this work under grant number(RGP 2/142/43).
文摘Sentiment analysis or opinion mining(OM)concepts become familiar due to advances in networking technologies and social media.Recently,massive amount of text has been generated over Internet daily which makes the pattern recognition and decision making process difficult.Since OM find useful in business sectors to improve the quality of the product as well as services,machine learning(ML)and deep learning(DL)models can be considered into account.Besides,the hyperparameters involved in the DL models necessitate proper adjustment process to boost the classification process.Therefore,in this paper,a new Artificial Fish Swarm Optimization with Bidirectional Long Short Term Memory(AFSO-BLSTM)model has been developed for OM process.The major intention of the AFSO-BLSTM model is to effectively mine the opinions present in the textual data.In addition,the AFSO-BLSTM model undergoes pre-processing and TF-IFD based feature extraction process.Besides,BLSTM model is employed for the effectual detection and classification of opinions.Finally,the AFSO algorithm is utilized for effective hyperparameter adjustment process of the BLSTM model,shows the novelty of the work.A complete simulation study of the AFSO-BLSTM model is validated using benchmark dataset and the obtained experimental values revealed the high potential of the AFSO-BLSTM model on mining opinions.
基金Taif University Researchers Supporting Project Number(TURSP-2020/216),Taif University,Taif,Saudi Arabia.
文摘Sentiment analysis or Opinion Mining (OM) has gained significant interest among research communities and entrepreneurs in the recentyears. Likewise, Machine Learning (ML) approaches is one of the interestingresearch domains that are highly helpful and are increasingly applied in severalbusiness domains. In this background, the current research paper focuses onthe design of automated opinion mining model using Deer Hunting Optimization Algorithm (DHOA) with Fuzzy Neural Network (FNN) abbreviatedas DHOA-FNN model. The proposed DHOA-FNN technique involves fourdifferent stages namely, preprocessing, feature extraction, classification, andparameter tuning. In addition to the above, the proposed DHOA-FNN modelhas two stages of feature extraction namely, Glove and N-gram approach.Moreover, FNN model is utilized as a classification model whereas GTOA isused for the optimization of parameters. The novelty of current work is thatthe GTOA is designed to tune the parameters of FNN model. An extensiverange of simulations was carried out on the benchmark dataset and the resultswere examined under diverse measures. The experimental results highlightedthe promising performance of DHOA-FNN model over recent state-of-the-arttechniques with a maximum accuracy of 0.9928.
基金to the Deanship of Scientific Research at King Faisal University for its financial support,with reference to the research grant number as 216082.
文摘Opinion summarization recapitulates the opinions about a common topic automatically.The primary motive of summarization is to preserve the properties of the text and is shortened in a way with no loss in the semantics of the text.The need of automatic summarization efficiently resulted in increased interest among communities of Natural Language Processing and Text Mining.This paper emphasis on building an extractive summarization system combining the features of principal component analysis for dimensionality reduction and bidirectional Recurrent Neural Networks and Long Short-Term Memory(RNN-LSTM)deep learning model for short and exact synopsis using seq2seq model.It presents a paradigm shift with regard to the way extractive summaries are generated.Novel algorithms for word extraction using assertions are proposed.The semantic framework is well-grounded in this research facilitating the correct decision making process after reviewing huge amount of online reviews,considering all its important features into account.The advantages of the proposed solution provides greater computational efficiency,better inferences from social media,data understanding,robustness and handling sparse data.Experiments on the different datasets also outperforms the previous researches and the accuracy is claimed to achieve more than the baselines,showing the efficiency and the novelty in the research paper.The comparisons are done by calculating accuracy with different baselines using Rouge tool.
基金This research was supported by Misr International University(MIU),(Grant Number.DSA28211231302952)to Diaa Salama,https://www.miuegypt.edu.eg/.
文摘At present,the immense development of social networks allows generating a significant amount of textual data,which has facilitated researchers to explore the field of opinion mining.In addition,the processing of textual opinions based on the term frequency-inverse document frequency method gives rise to a dimensionality problem.This study aims to detect the nature of opinions in the Arabic language employing a swarm intelligence(SI)-based algorithm,Harris hawks algorithm,to select the most relevant terms.The experimental study has been tested on two datasets:Arabic Jordanian General Tweets and Opinion Corpus for Arabic.In terms of accuracy and number of features,the results are better than those of other SI based algorithms,such as grey wolf optimizer and grasshopper optimization algorithm,and other algorithms in the literature,such as differential evolution,genetic algorithm,particle swarm optimization,basic and enhanced whale optimizer algorithm,slap swarm algorithm,and ant–lion optimizer.
基金supported by National Natural Science Foundation of China under Grants No.61232010, No.60903139, No.60933005, No.61202215, No.61100083National 242 Project under Grant No.2011F65China Information Technology Security Evaluation Center Program under Grant No.Z1277
文摘This paper focuses on how to improve aspect-level opinion mining for online customer reviews. We first propose a novel generative topic model, the Joint Aspect/Sentiment (JAS) model, to jointly extract aspects and aspect-dependent sentiment lexicons from online customer reviews. An aspect-dependent sentiment lexicon refers to the aspect-specific opinion words along with their aspect-aware sentiment polarities with respect to a specific aspect. We then apply the extracted aspectdependent sentiment lexicons to a series of aspect-level opinion mining tasks, including implicit aspect identification, aspect-based extractive opinion summarization, and aspect-level sentiment classification. Experimental results demonstrate the effectiveness of the JAS model in learning aspectdependent sentiment lexicons and the practical values of the extracted lexicons when applied to these practical tasks.
基金Princess Nourah bint Abdulrahman University Researchers Supporting Project Number(PNURSP2022R263)Princess Nourah bint Abdulrahman University,Riyadh,Saudi ArabiaThe authors would like to thank the Deanship of Scientific Research at Umm Al-Qura University for supporting this work by Grant Code:22UQU4310373DSR56.
文摘Opinion Mining(OM)studies in Arabic are limited though it is one of the most extensively-spoken languages worldwide.Though the interest in OM studies in the Arabic language is growing among researchers,it needs a vast number of investigations due to the unique morphological principles of the language.Arabic OM studies experience multiple challenges owing to the poor existence of language sources and Arabic-specific linguistic features.The comparative OM studies in the English language are wide and novel.But,comparative OM studies in the Arabic language are yet to be established and are still in a nascent stage.The unique features of the Arabic language make it essential to expand the studies regarding the Arabic text.It contains unique featuressuchasdiacritics,elongation,inflectionandwordlength.Thecurrent study proposes a Political Optimizer with Probabilistic Neural Network-based Comparative Opinion Mining(POPNN-COM)model for the Arabic text.The proposed POPNN-COM model aims to recognize comparative and non-comparative texts in Arabic in the context of social media.Initially,the POPNN-COM model involves different levels of data pre-processing to transform the input data into a useful format.Then,the pre-processed data is fed into the PNN model for classification and recognition of the data under different class labels.At last,the PO algorithm is employed for fine-tuning the parameters involved in this model to achieve enhanced results.The proposed POPNN-COM model was experimentally validated using two standard datasets,and the outcomes established the promising performance of the proposed POPNN-COM method over other recent approaches.
文摘The Internet provides a large number of tools and resources, such as social media sites, online newsgroups, blogs, electronic forums, virtual communities, and online travel sites, for consumers to express their views or opinions regarding various issues. These opinions can help organizations like tourism to improve their products and services for their consumers. Opinion mining refers to a process of identifying emotions by applying Natural Language Processing (NLP) techniques to a pool of texts. This paper mainly focuses on mining public opinion from the hotel reviews domain. To do so, we proposed a novel technique called the Attention-Based Long Short Term Memory (Attention-LSTM) Network using a transfer learning approach. We empirically analyzed several machine learning and deep learning methods and observed our proposed technique provided an adequate performance for mining public opinion in the hotel reviews domain.
文摘Global changes took place at a neck-breaking speed in lots of fields along with the Web 2.0 era, which can be stated as the new Internet trend. Web pages which once were a statical structure that can be said to become dynamic pages created by users, and in this regard they can be said to have been democratized by evolving. Social media, which were structured alongside with this era, by providing a large data flow for businesses, present new and improvable opportunities in the field of creating effective strategies. There are lots of blogs in today's Internet environment which includes customer ideas regarding the products/services that they possess. This environment, which in a way globalizes the customer ideas, is a new medium suitable for examination in terms of its increasing the business-customer interaction and due to its transporter nature; it provides the text data that may be analyzed in the field of Customer Relationship Management to businesses. Thus, businesses should follow blog environments to see how the product/service they provide is greeted in terms of the customer focus and it should be seen as an important job on which they can conduct effective analyses. For this purpose, a model proposal that will assign the ideas to the Turkish blogs was given in the study. Opinion mining methods were used in the model, and so to perceive a general look-on about products/services, a methodology was devised, which will assign the text based opinion data on the Turkish blogs to the poles. Success of the pole assignment of the model is evaluated with the precision measure.
基金Supported by the National Natural Science Foundation of China(61370137,61672098,61272361)the Ministry of Education-China Mobile Research Foundation Project(2015/5-9,2016/2-7)
文摘Blog opinion retrieval aims to find blogs with opinionated information related to a given topic.Its main problem is to compute the opinion score,which balances topic relevance and opinion relevance.To deal with this problem a generative model deduced by a Bayesian approach is pro-posed,and an improved mixture model is proposed to estimate the opinion relevance between a blog and a given topic in our retrieval framework.Moreover,pointwise mutual information is used to expand sentiment words for different topics based on a general sentimental lexicon.The correlation between topic and candidate words is applied in the process of both expanding sentiment words and estimating sentence opinion scores.Experimental results show that the proposed approaches improve upon the state-of-the-art opinion retrieval method on TREC2010 dataset.
文摘The Web development has drastically changed the human interaction and communication, leading to an exponential growth of data generated by users in various digital media. This mass of data provides opportunities for understanding people’s opinions about products, services, processes, events, political movements, and organizational strategies. In this context, it becomes important for companies to be able to assess customer satisfaction about their products or services. One of the ways to evaluate customer sentiment is the use of Sentiment Analysis, also known as Opinion Mining. This research aims to compare the efficiency of an automatic classifier based on dictionary with the classification by human jurors in a set of comments made by customers in Portuguese language. The data consist of opinions of service users of one of the largest Brazilian online employment agencies. The performance evaluation of the classification models was done using Kappa index and a Confusion Matrix. As the main finding, it is noteworthy that the agreement between the classifier and the human jurors came to moderate, with better performance for the dictionary-based classifier. This result was considered satisfactory, considering that the Sentiment Analysis in Portuguese language is a complex task and demands more research and development.
文摘This research measures the reliability of audit firms in predicting bankruptcy for United States (US) listed financial institutions. The object of analysis is the going concern opinion (GCO), widely considered as a bankruptcy warning signal to stakeholders. The sample is composed of 42 US listed financial companies that filed for Chapter 11 between 1998 and 2011. To highlight the differences between bankrupting and healthy firms, a matching sample composed of 42 randomly picked healthy US listed financial companies is collected. We concentrate on financial institutions, whereas the existing literature pays considerably greater attention to the industrial sector. This research imbalance is remarkable and particularly unexpected in the wake of recent financial scandals. Literature points out two main approaches on bankruptcy prediction: (1) purely mathematical; and (2) approaches based on a combination of auditor knowledge, expertise, and experience. The use of data mining techniques allows us to benefit from the best features of both approaches. Statistical tools used in the analysis are: Logit regression, support vector machines (SVMs), and an AdaBoost meta-algorithm. Findings show a quite low reliability of GCOs in predicting bankruptcy. It is likely that auditors consider further information in supporting their audit opinions, aside from financial-economic ratios. The scant predictive ability of auditors might be due to critical relationships with distressed clients, as suggested by recent literature.
文摘Community based churn prediction,or the assignment of recognising the influence of a customer’s community in churn prediction has become an important concern for firms in many different industries.While churn prediction until recent times have focused only on transactional dataset(targeted approach),the untargeted approach through product advisement,digital marketing and expressions in customer’s opinion on the social media like Twitter,have not been fully harnessed.Although this data source has become an important influencing factor with lasting impact on churn management.Since Social Network Analysis(SNA)has become a blended approach for churn prediction and management in modern era,customers residing online predominantly and collectively decide and determines the momentum of churn prediction,retention and decision support.In existing SNA approaches,customers are classified as churner or non-churner(1 or 0).Oftentimes,the customer’s opinion is also neglected and the network structure of community members are not exploited.Consequently,the pattern and influential abilities of customers’opinion on relative members of the community are not analysed.Thus,the research developed a Churn Service Information Graph(CSIG)to define a quadruple churn category(churner,potential churner,inertia customer,premium customer)for non-opinionated customers via the power of relative affinity around opinionated customers on a direct node to node SNA.The essence is to use data mining technique to investigate the patterns of opinion between people in a network or group.Consequently,every member of the online social network community is dynamically classified into a churn category for an improved targeted customer acquisition,retention and/or decision supports in churn management.