Opinion target extraction is one of the core tasks in sentiment analysison text data. In recent years, dependency parser–based approaches have beencommonly studied for opinion target extraction. However, dependency p...Opinion target extraction is one of the core tasks in sentiment analysison text data. In recent years, dependency parser–based approaches have beencommonly studied for opinion target extraction. However, dependency parsersare limited by language and grammatical constraints. Therefore, in this work, asequential pattern-based rule mining model, which does not have such constraints,is proposed for cross-domain opinion target extraction from product reviews inunknown domains. Thus, knowing the domain of reviews while extracting opinion targets becomes no longer a requirement. The proposed model also revealsthe difference between the concepts of opinion target and aspect, which are commonly confused in the literature. The model consists of two stages. In the firststage, the aspects of reviews are extracted from the target domain using the rulesautomatically generated from source domains. The aspects are also transferredfrom the source domains to a target domain. Moreover, aspect pruning is appliedto further improve the performance of aspect extraction. In the second stage, theopinion target is extracted among the aspects extracted at the former stage usingthe rules automatically generated for opinion target extraction. The proposedmodel was evaluated on several benchmark datasets in different domains andcompared against the literature. The experimental results revealed that the opiniontargets of the reviews in unknown domains can be extracted with higher accuracythan those of the previous works.展开更多
Sentiment analysis or opinion mining(OM)concepts become familiar due to advances in networking technologies and social media.Recently,massive amount of text has been generated over Internet daily which makes the patte...Sentiment analysis or opinion mining(OM)concepts become familiar due to advances in networking technologies and social media.Recently,massive amount of text has been generated over Internet daily which makes the pattern recognition and decision making process difficult.Since OM find useful in business sectors to improve the quality of the product as well as services,machine learning(ML)and deep learning(DL)models can be considered into account.Besides,the hyperparameters involved in the DL models necessitate proper adjustment process to boost the classification process.Therefore,in this paper,a new Artificial Fish Swarm Optimization with Bidirectional Long Short Term Memory(AFSO-BLSTM)model has been developed for OM process.The major intention of the AFSO-BLSTM model is to effectively mine the opinions present in the textual data.In addition,the AFSO-BLSTM model undergoes pre-processing and TF-IFD based feature extraction process.Besides,BLSTM model is employed for the effectual detection and classification of opinions.Finally,the AFSO algorithm is utilized for effective hyperparameter adjustment process of the BLSTM model,shows the novelty of the work.A complete simulation study of the AFSO-BLSTM model is validated using benchmark dataset and the obtained experimental values revealed the high potential of the AFSO-BLSTM model on mining opinions.展开更多
Opinion summarization recapitulates the opinions about a common topic automatically.The primary motive of summarization is to preserve the properties of the text and is shortened in a way with no loss in the semantics...Opinion summarization recapitulates the opinions about a common topic automatically.The primary motive of summarization is to preserve the properties of the text and is shortened in a way with no loss in the semantics of the text.The need of automatic summarization efficiently resulted in increased interest among communities of Natural Language Processing and Text Mining.This paper emphasis on building an extractive summarization system combining the features of principal component analysis for dimensionality reduction and bidirectional Recurrent Neural Networks and Long Short-Term Memory(RNN-LSTM)deep learning model for short and exact synopsis using seq2seq model.It presents a paradigm shift with regard to the way extractive summaries are generated.Novel algorithms for word extraction using assertions are proposed.The semantic framework is well-grounded in this research facilitating the correct decision making process after reviewing huge amount of online reviews,considering all its important features into account.The advantages of the proposed solution provides greater computational efficiency,better inferences from social media,data understanding,robustness and handling sparse data.Experiments on the different datasets also outperforms the previous researches and the accuracy is claimed to achieve more than the baselines,showing the efficiency and the novelty in the research paper.The comparisons are done by calculating accuracy with different baselines using Rouge tool.展开更多
Sentiment analysis of online reviews and other user generated content is an important research problem for its wide range of applications.In this paper,we propose a feature-based vector model and a novel weighting alg...Sentiment analysis of online reviews and other user generated content is an important research problem for its wide range of applications.In this paper,we propose a feature-based vector model and a novel weighting algorithm for sentiment analysis of Chinese product reviews.Specifically,an opinionated document is modeled by a set of feature-based vectors and corresponding weights.Different from previous work,our model considers modifying relationships between words and contains rich sentiment strength descriptions which are represented by adverbs of degree and punctuations.Dependency parsing is applied to construct the feature vectors.A novel feature weighting algorithm is proposed for supervised sentiment classification based on rich sentiment strength related information.The experimental results demonstrate the effectiveness of the proposed method compared with a state of the art method using term level weighting algorithms.展开更多
With the spread and development of new epidemics,it is of great reference value to identify the changing trends of epidemics in public emotions.We designed and implemented the COVID-19 public opinion monitoring system...With the spread and development of new epidemics,it is of great reference value to identify the changing trends of epidemics in public emotions.We designed and implemented the COVID-19 public opinion monitoring system based on time series thermal new word mining.A new word structure discovery scheme based on the timing explosion of network topics and a Chinese sentiment analysis method for the COVID-19 public opinion environment are proposed.Establish a“Scrapy-Redis-Bloomfilter”distributed crawler framework to collect data.The system can judge the positive and negative emotions of the reviewer based on the comments,and can also reflect the depth of the seven emotions such as Hopeful,Happy,and Depressed.Finally,we improved the sentiment discriminant model of this system and compared the sentiment discriminant error of COVID-19 related comments with the Jiagu deep learning model.The results show that our model has better generalization ability and smaller discriminant error.We designed a large data visualization screen,which can clearly show the trend of public emotions,the proportion of various emotion categories,keywords,hot topics,etc.,and fully and intuitively reflect the development of public opinion.展开更多
Sentiment analysis or Opinion Mining (OM) has gained significant interest among research communities and entrepreneurs in the recentyears. Likewise, Machine Learning (ML) approaches is one of the interestingresearch d...Sentiment analysis or Opinion Mining (OM) has gained significant interest among research communities and entrepreneurs in the recentyears. Likewise, Machine Learning (ML) approaches is one of the interestingresearch domains that are highly helpful and are increasingly applied in severalbusiness domains. In this background, the current research paper focuses onthe design of automated opinion mining model using Deer Hunting Optimization Algorithm (DHOA) with Fuzzy Neural Network (FNN) abbreviatedas DHOA-FNN model. The proposed DHOA-FNN technique involves fourdifferent stages namely, preprocessing, feature extraction, classification, andparameter tuning. In addition to the above, the proposed DHOA-FNN modelhas two stages of feature extraction namely, Glove and N-gram approach.Moreover, FNN model is utilized as a classification model whereas GTOA isused for the optimization of parameters. The novelty of current work is thatthe GTOA is designed to tune the parameters of FNN model. An extensiverange of simulations was carried out on the benchmark dataset and the resultswere examined under diverse measures. The experimental results highlightedthe promising performance of DHOA-FNN model over recent state-of-the-arttechniques with a maximum accuracy of 0.9928.展开更多
The sentiment of a text depends on the clausal structure of the sentence and the connectives’discourse arguments.In this work,the clause boundary,discourse argument,and syntactic and semantic information of the sente...The sentiment of a text depends on the clausal structure of the sentence and the connectives’discourse arguments.In this work,the clause boundary,discourse argument,and syntactic and semantic information of the sentence are used to assign the text’s sentiment.The clause boundaries identify the span of the text,and the discourse connectives identify the arguments.Since the lexicon-based analysis of traditional sentiment analysis gives the wrong sentiment of the sentence,a deeper-level semantic analysis is required for the correct analysis of sentiments.Hence,in this study,explicit connectives in Malayalam are considered to identify the discourse arguments.A supervised method,conditional random fields,is used to identify the clause boundary and discourse arguments.For the study,1,000 sentiment sentences from Malayalam documents were analyzed.Experimental results show that the discourse structure integration considerably improves sentiment analysis performance from the baseline system.展开更多
The rise of social media paves way for unprecedented benefits or risks to several organisations depending on how they adapt to its changes. This rise comes with a great challenge of gaining insights from these big dat...The rise of social media paves way for unprecedented benefits or risks to several organisations depending on how they adapt to its changes. This rise comes with a great challenge of gaining insights from these big data for effective and efficient decision making that can improve quality, profitability, productivity, competitiveness and customer satisfaction. Sentiment analysis is the field that is concerned with the classification and analysis of user generated text under defined polarities. Despite the upsurge of research in sentiment analysis in recent years, there is a dearth in literature on sentiment analysis applied to banks social media data and mostly on African datasets. Against this background, this study applied machine learning technique (support vector machine) for sentiment analysis of Nigerian banks twitter data within a 2-year period, from 1st January 2017 to 31st December 2018. After crawling and preprocessing of the data, LibSVM algorithm in WEKA was used to build the sentiment classification model based on the training data. The performance of this model was evaluated on a pre-labelled test dataset generated from the five banks. The results show that the accuracy of the classifier was 71.8367%. The precision for both the positive and negative classes was above 0.7, the recall for the negative class was 0.696 and that of the positive class was 0.741 which shows the prediction did better than chance in addition to other measures. Applying the model in predicting the sentiments of the five Nigerian banks twitter data reveals that the number of positive tweets within this period was slightly greater than the number of negative tweets. The scatter plots for the sentiments series indicated that, majority of the data falls between 0 and 100 sentiments per day, with few outliers above this range.展开更多
Bike sharing is considered a state-of-the-art transportation program. It is ideal for short or medium trips providing riders the ability to pick up a bike at any self-serve bike station and return it to any bike stati...Bike sharing is considered a state-of-the-art transportation program. It is ideal for short or medium trips providing riders the ability to pick up a bike at any self-serve bike station and return it to any bike station located within the system’s coverage area. The bike sharing programs in the United States are still very young compared to those in European countries. Washington DC was the first jurisdiction to devise a third generation bike sharing system in the US in 2008. To evaluate the popularity of a bike sharing program, a sentiment analysis of the riders’ feedback can be performed. Twitter is a great platform to understand people’s views instantly. Social media mining is, thus, gaining popularity in many research areas including transportation. Social media mining has two major advantages over conventional attitudinal survey methods—it can easily reach a large audience and it can reflect the true behavior of participants because of the anonymity social media provides. It is known that self-imposed censor is common in responding to conversational attitudinal surveys. This study performed text mining on the tweets related to a case study (Capital Bike share of Washington DC) to perform sentiment analysis or opinion mining. The results of the text mining mostly revealed higher positive sentiments towards the current system.展开更多
Currently,the sentiment analysis research in the Malaysian context lacks in terms of the availability of the sentiment lexicon.Thus,this issue is addressed in this paper in order to enhance the accuracy of sentiment a...Currently,the sentiment analysis research in the Malaysian context lacks in terms of the availability of the sentiment lexicon.Thus,this issue is addressed in this paper in order to enhance the accuracy of sentiment analysis.In this study,a new lexicon for sentiment analysis is constructed.A detailed review of existing approaches has been conducted,and a new bilingual sentiment lexicon known as MELex(Malay-English Lexicon)has been generated.Constructing MELex involves three activities:seed words selection,polarity assignment,and synonym expansions.Our approach differs from previous works in that MELex can analyze text for the two most widely used languages in Malaysia,Malay,and English,with the accuracy achieved,is 90%.It is evaluated based on the experimentation and case study approaches where the affordable housing projects in Malaysia are selected as case projects.This finding has given an implication on the ability of MELex to analyze public sentiments in the Malaysian context.The novel aspects of this paper are two-fold.Firstly,it introduces the new technique in assigning the polarity score,and second,it improves the performance over the classification of mixed language content.展开更多
In the current era of the internet,people use online media for conversation,discussion,chatting,and other similar purposes.Analysis of such material where more than one person is involved has a spate challenge as comp...In the current era of the internet,people use online media for conversation,discussion,chatting,and other similar purposes.Analysis of such material where more than one person is involved has a spate challenge as compared to other text analysis tasks.There are several approaches to identify users’emotions fromthe conversational text for the English language,however regional or low resource languages have been neglected.The Urdu language is one of them and despite being used by millions of users across the globe,with the best of our knowledge there exists no work on dialogue analysis in the Urdu language.Therefore,in this paper,we have proposed a model which utilizes deep learning and machine learning approaches for the classification of users’emotions from the text.To accomplish this task,we have first created a dataset for the Urdu language with the help of existing English language datasets for dialogue analysis.After that,we have preprocessed the data and selected dialogues with common emotions.Once the dataset is prepared,we have used different deep learning and machine learning techniques for the classification of emotion.We have tuned the algorithms according to the Urdu language datasets.The experimental evaluation has shown encouraging results with 67%accuracy for the Urdu dialogue datasets,more than 10,000 dialogues are classified into five emotions i.e.,joy,fear,anger,sadness,and neutral.We believe that this is the first effort for emotion detection from the conversational text in the Urdu language domain.展开更多
The Web development has drastically changed the human interaction and communication, leading to an exponential growth of data generated by users in various digital media. This mass of data provides opportunities for u...The Web development has drastically changed the human interaction and communication, leading to an exponential growth of data generated by users in various digital media. This mass of data provides opportunities for understanding people’s opinions about products, services, processes, events, political movements, and organizational strategies. In this context, it becomes important for companies to be able to assess customer satisfaction about their products or services. One of the ways to evaluate customer sentiment is the use of Sentiment Analysis, also known as Opinion Mining. This research aims to compare the efficiency of an automatic classifier based on dictionary with the classification by human jurors in a set of comments made by customers in Portuguese language. The data consist of opinions of service users of one of the largest Brazilian online employment agencies. The performance evaluation of the classification models was done using Kappa index and a Confusion Matrix. As the main finding, it is noteworthy that the agreement between the classifier and the human jurors came to moderate, with better performance for the dictionary-based classifier. This result was considered satisfactory, considering that the Sentiment Analysis in Portuguese language is a complex task and demands more research and development.展开更多
文摘Opinion target extraction is one of the core tasks in sentiment analysison text data. In recent years, dependency parser–based approaches have beencommonly studied for opinion target extraction. However, dependency parsersare limited by language and grammatical constraints. Therefore, in this work, asequential pattern-based rule mining model, which does not have such constraints,is proposed for cross-domain opinion target extraction from product reviews inunknown domains. Thus, knowing the domain of reviews while extracting opinion targets becomes no longer a requirement. The proposed model also revealsthe difference between the concepts of opinion target and aspect, which are commonly confused in the literature. The model consists of two stages. In the firststage, the aspects of reviews are extracted from the target domain using the rulesautomatically generated from source domains. The aspects are also transferredfrom the source domains to a target domain. Moreover, aspect pruning is appliedto further improve the performance of aspect extraction. In the second stage, theopinion target is extracted among the aspects extracted at the former stage usingthe rules automatically generated for opinion target extraction. The proposedmodel was evaluated on several benchmark datasets in different domains andcompared against the literature. The experimental results revealed that the opiniontargets of the reviews in unknown domains can be extracted with higher accuracythan those of the previous works.
基金The authors extend their appreciation to the Deanship of Scientific Research at King Khalid University for funding this work under grant number(RGP 2/142/43).
文摘Sentiment analysis or opinion mining(OM)concepts become familiar due to advances in networking technologies and social media.Recently,massive amount of text has been generated over Internet daily which makes the pattern recognition and decision making process difficult.Since OM find useful in business sectors to improve the quality of the product as well as services,machine learning(ML)and deep learning(DL)models can be considered into account.Besides,the hyperparameters involved in the DL models necessitate proper adjustment process to boost the classification process.Therefore,in this paper,a new Artificial Fish Swarm Optimization with Bidirectional Long Short Term Memory(AFSO-BLSTM)model has been developed for OM process.The major intention of the AFSO-BLSTM model is to effectively mine the opinions present in the textual data.In addition,the AFSO-BLSTM model undergoes pre-processing and TF-IFD based feature extraction process.Besides,BLSTM model is employed for the effectual detection and classification of opinions.Finally,the AFSO algorithm is utilized for effective hyperparameter adjustment process of the BLSTM model,shows the novelty of the work.A complete simulation study of the AFSO-BLSTM model is validated using benchmark dataset and the obtained experimental values revealed the high potential of the AFSO-BLSTM model on mining opinions.
基金to the Deanship of Scientific Research at King Faisal University for its financial support,with reference to the research grant number as 216082.
文摘Opinion summarization recapitulates the opinions about a common topic automatically.The primary motive of summarization is to preserve the properties of the text and is shortened in a way with no loss in the semantics of the text.The need of automatic summarization efficiently resulted in increased interest among communities of Natural Language Processing and Text Mining.This paper emphasis on building an extractive summarization system combining the features of principal component analysis for dimensionality reduction and bidirectional Recurrent Neural Networks and Long Short-Term Memory(RNN-LSTM)deep learning model for short and exact synopsis using seq2seq model.It presents a paradigm shift with regard to the way extractive summaries are generated.Novel algorithms for word extraction using assertions are proposed.The semantic framework is well-grounded in this research facilitating the correct decision making process after reviewing huge amount of online reviews,considering all its important features into account.The advantages of the proposed solution provides greater computational efficiency,better inferences from social media,data understanding,robustness and handling sparse data.Experiments on the different datasets also outperforms the previous researches and the accuracy is claimed to achieve more than the baselines,showing the efficiency and the novelty in the research paper.The comparisons are done by calculating accuracy with different baselines using Rouge tool.
基金This work was supported in part by National Natural Science Foundation of China under Grants No.60970052,the Beijing Natural Science Foundation under Grants No.4133084,the Beijing Educational Committee Science and Technology Development Planned under Grants No.KM201410028017 and the Beijing Key Disciplines of Computer Application Technology
文摘Sentiment analysis of online reviews and other user generated content is an important research problem for its wide range of applications.In this paper,we propose a feature-based vector model and a novel weighting algorithm for sentiment analysis of Chinese product reviews.Specifically,an opinionated document is modeled by a set of feature-based vectors and corresponding weights.Different from previous work,our model considers modifying relationships between words and contains rich sentiment strength descriptions which are represented by adverbs of degree and punctuations.Dependency parsing is applied to construct the feature vectors.A novel feature weighting algorithm is proposed for supervised sentiment classification based on rich sentiment strength related information.The experimental results demonstrate the effectiveness of the proposed method compared with a state of the art method using term level weighting algorithms.
基金This work was supported by the Hainan Provincial Natural Science Foundation of China[2019RC041,2019RC098]National Natural Science Foundation of China[61762033]+3 种基金Hainan University Doctor Start Fund Project[kyqd1328]Hainan University Youth Fund Project[qnjj1444]Ministry of education humanities and social sciences research program fund project(19YJA710010)The Opening Project of Shanghai Trusted Industrial Control Platform(Grant No.TICPSH202003005-ZC).
文摘With the spread and development of new epidemics,it is of great reference value to identify the changing trends of epidemics in public emotions.We designed and implemented the COVID-19 public opinion monitoring system based on time series thermal new word mining.A new word structure discovery scheme based on the timing explosion of network topics and a Chinese sentiment analysis method for the COVID-19 public opinion environment are proposed.Establish a“Scrapy-Redis-Bloomfilter”distributed crawler framework to collect data.The system can judge the positive and negative emotions of the reviewer based on the comments,and can also reflect the depth of the seven emotions such as Hopeful,Happy,and Depressed.Finally,we improved the sentiment discriminant model of this system and compared the sentiment discriminant error of COVID-19 related comments with the Jiagu deep learning model.The results show that our model has better generalization ability and smaller discriminant error.We designed a large data visualization screen,which can clearly show the trend of public emotions,the proportion of various emotion categories,keywords,hot topics,etc.,and fully and intuitively reflect the development of public opinion.
基金Taif University Researchers Supporting Project Number(TURSP-2020/216),Taif University,Taif,Saudi Arabia.
文摘Sentiment analysis or Opinion Mining (OM) has gained significant interest among research communities and entrepreneurs in the recentyears. Likewise, Machine Learning (ML) approaches is one of the interestingresearch domains that are highly helpful and are increasingly applied in severalbusiness domains. In this background, the current research paper focuses onthe design of automated opinion mining model using Deer Hunting Optimization Algorithm (DHOA) with Fuzzy Neural Network (FNN) abbreviatedas DHOA-FNN model. The proposed DHOA-FNN technique involves fourdifferent stages namely, preprocessing, feature extraction, classification, andparameter tuning. In addition to the above, the proposed DHOA-FNN modelhas two stages of feature extraction namely, Glove and N-gram approach.Moreover, FNN model is utilized as a classification model whereas GTOA isused for the optimization of parameters. The novelty of current work is thatthe GTOA is designed to tune the parameters of FNN model. An extensiverange of simulations was carried out on the benchmark dataset and the resultswere examined under diverse measures. The experimental results highlightedthe promising performance of DHOA-FNN model over recent state-of-the-arttechniques with a maximum accuracy of 0.9928.
文摘The sentiment of a text depends on the clausal structure of the sentence and the connectives’discourse arguments.In this work,the clause boundary,discourse argument,and syntactic and semantic information of the sentence are used to assign the text’s sentiment.The clause boundaries identify the span of the text,and the discourse connectives identify the arguments.Since the lexicon-based analysis of traditional sentiment analysis gives the wrong sentiment of the sentence,a deeper-level semantic analysis is required for the correct analysis of sentiments.Hence,in this study,explicit connectives in Malayalam are considered to identify the discourse arguments.A supervised method,conditional random fields,is used to identify the clause boundary and discourse arguments.For the study,1,000 sentiment sentences from Malayalam documents were analyzed.Experimental results show that the discourse structure integration considerably improves sentiment analysis performance from the baseline system.
文摘The rise of social media paves way for unprecedented benefits or risks to several organisations depending on how they adapt to its changes. This rise comes with a great challenge of gaining insights from these big data for effective and efficient decision making that can improve quality, profitability, productivity, competitiveness and customer satisfaction. Sentiment analysis is the field that is concerned with the classification and analysis of user generated text under defined polarities. Despite the upsurge of research in sentiment analysis in recent years, there is a dearth in literature on sentiment analysis applied to banks social media data and mostly on African datasets. Against this background, this study applied machine learning technique (support vector machine) for sentiment analysis of Nigerian banks twitter data within a 2-year period, from 1st January 2017 to 31st December 2018. After crawling and preprocessing of the data, LibSVM algorithm in WEKA was used to build the sentiment classification model based on the training data. The performance of this model was evaluated on a pre-labelled test dataset generated from the five banks. The results show that the accuracy of the classifier was 71.8367%. The precision for both the positive and negative classes was above 0.7, the recall for the negative class was 0.696 and that of the positive class was 0.741 which shows the prediction did better than chance in addition to other measures. Applying the model in predicting the sentiments of the five Nigerian banks twitter data reveals that the number of positive tweets within this period was slightly greater than the number of negative tweets. The scatter plots for the sentiments series indicated that, majority of the data falls between 0 and 100 sentiments per day, with few outliers above this range.
文摘Bike sharing is considered a state-of-the-art transportation program. It is ideal for short or medium trips providing riders the ability to pick up a bike at any self-serve bike station and return it to any bike station located within the system’s coverage area. The bike sharing programs in the United States are still very young compared to those in European countries. Washington DC was the first jurisdiction to devise a third generation bike sharing system in the US in 2008. To evaluate the popularity of a bike sharing program, a sentiment analysis of the riders’ feedback can be performed. Twitter is a great platform to understand people’s views instantly. Social media mining is, thus, gaining popularity in many research areas including transportation. Social media mining has two major advantages over conventional attitudinal survey methods—it can easily reach a large audience and it can reflect the true behavior of participants because of the anonymity social media provides. It is known that self-imposed censor is common in responding to conversational attitudinal surveys. This study performed text mining on the tweets related to a case study (Capital Bike share of Washington DC) to perform sentiment analysis or opinion mining. The results of the text mining mostly revealed higher positive sentiments towards the current system.
文摘Currently,the sentiment analysis research in the Malaysian context lacks in terms of the availability of the sentiment lexicon.Thus,this issue is addressed in this paper in order to enhance the accuracy of sentiment analysis.In this study,a new lexicon for sentiment analysis is constructed.A detailed review of existing approaches has been conducted,and a new bilingual sentiment lexicon known as MELex(Malay-English Lexicon)has been generated.Constructing MELex involves three activities:seed words selection,polarity assignment,and synonym expansions.Our approach differs from previous works in that MELex can analyze text for the two most widely used languages in Malaysia,Malay,and English,with the accuracy achieved,is 90%.It is evaluated based on the experimentation and case study approaches where the affordable housing projects in Malaysia are selected as case projects.This finding has given an implication on the ability of MELex to analyze public sentiments in the Malaysian context.The novel aspects of this paper are two-fold.Firstly,it introduces the new technique in assigning the polarity score,and second,it improves the performance over the classification of mixed language content.
文摘In the current era of the internet,people use online media for conversation,discussion,chatting,and other similar purposes.Analysis of such material where more than one person is involved has a spate challenge as compared to other text analysis tasks.There are several approaches to identify users’emotions fromthe conversational text for the English language,however regional or low resource languages have been neglected.The Urdu language is one of them and despite being used by millions of users across the globe,with the best of our knowledge there exists no work on dialogue analysis in the Urdu language.Therefore,in this paper,we have proposed a model which utilizes deep learning and machine learning approaches for the classification of users’emotions from the text.To accomplish this task,we have first created a dataset for the Urdu language with the help of existing English language datasets for dialogue analysis.After that,we have preprocessed the data and selected dialogues with common emotions.Once the dataset is prepared,we have used different deep learning and machine learning techniques for the classification of emotion.We have tuned the algorithms according to the Urdu language datasets.The experimental evaluation has shown encouraging results with 67%accuracy for the Urdu dialogue datasets,more than 10,000 dialogues are classified into five emotions i.e.,joy,fear,anger,sadness,and neutral.We believe that this is the first effort for emotion detection from the conversational text in the Urdu language domain.
文摘The Web development has drastically changed the human interaction and communication, leading to an exponential growth of data generated by users in various digital media. This mass of data provides opportunities for understanding people’s opinions about products, services, processes, events, political movements, and organizational strategies. In this context, it becomes important for companies to be able to assess customer satisfaction about their products or services. One of the ways to evaluate customer sentiment is the use of Sentiment Analysis, also known as Opinion Mining. This research aims to compare the efficiency of an automatic classifier based on dictionary with the classification by human jurors in a set of comments made by customers in Portuguese language. The data consist of opinions of service users of one of the largest Brazilian online employment agencies. The performance evaluation of the classification models was done using Kappa index and a Confusion Matrix. As the main finding, it is noteworthy that the agreement between the classifier and the human jurors came to moderate, with better performance for the dictionary-based classifier. This result was considered satisfactory, considering that the Sentiment Analysis in Portuguese language is a complex task and demands more research and development.