The task of cross-target stance detection faces significant challenges due to the lack of additional background information in emerging knowledge domains and the colloquial nature of language patterns.Traditional stan...The task of cross-target stance detection faces significant challenges due to the lack of additional background information in emerging knowledge domains and the colloquial nature of language patterns.Traditional stance detection methods often struggle with understanding limited context and have insufficient generalization across diverse sentiments and semantic structures.This paper focuses on effectively mining and utilizing sentimentsemantics knowledge for stance knowledge transfer and proposes a sentiment-aware hierarchical attention network(SentiHAN)for cross-target stance detection.SentiHAN introduces an improved hierarchical attention network designed to maximize the use of high-level representations of targets and texts at various fine-grain levels.This model integrates phrase-level combinatorial sentiment knowledge to effectively bridge the knowledge gap between known and unknown targets.By doing so,it enables a comprehensive understanding of stance representations for unknown targets across different sentiments and semantic structures.The model’s ability to leverage sentimentsemantics knowledge enhances its performance in detecting stances that may not be directly observable from the immediate context.Extensive experimental results indicate that SentiHAN significantly outperforms existing benchmark methods in terms of both accuracy and robustness.Moreover,the paper employs ablation studies and visualization techniques to explore the intricate relationship between sentiment and stance.These analyses further confirm the effectiveness of sentence-level combinatorial sentiment knowledge in improving stance detection capabilities.展开更多
As the pivotal green space,urban parks play an important role in urban residents’daily activities.Thy can not only bring people physical health,but also can be more likely to elicit positive sentiment to those who vi...As the pivotal green space,urban parks play an important role in urban residents’daily activities.Thy can not only bring people physical health,but also can be more likely to elicit positive sentiment to those who visit them.Recently,social media big data has provided new data sources for sentiment analysis.However,there was limited researches that explored the connection between urban parks and individual’s sentiments.Therefore,this study firstly employed a pre-trained language model(BERT,Bidirectional Encoder Representations from Transformers)to calculate sentiment scores based on social media data.Secondly,this study analysed the relationship between urban parks and individual’s sentiment from both spatial and temporal perspectives.Finally,by utilizing structural equation model(SEM),we identified 13 factors and analyzed its degree of the influence.The research findings are listed as below:①It confirmed that individuals generally experienced positive sentiment with high sentiment scores in the majority of urban parks;②The urban park type showed an influence on sentiment scores.In this study,higher sentiment scores observed in Eco-parks,comprehensive parks,and historical parks;③The urban parks level showed low impact on sentiment scores.With distinctions observed mainly at level-3 and level-4;④Compared to internal factors in parks,the external infrastructure surround them exerted more significant impact on sentiment scores.For instance,number of bus and subway stations around urban parks led to higher sentiment scores,while scenic spots and restaurants had inverse result.This study provided a novel method to quantify the services of various urban parks,which can be served as inspiration for similar studies in other cities and countries,enhancing their park planning and management strategies.展开更多
Recent patterns of human sentiments are highly influenced by emoji based sentiments(EBS).Social media users are widely using emoji based sentiments(EBS)in between text messages,tweets and posts.Although tiny pictures ...Recent patterns of human sentiments are highly influenced by emoji based sentiments(EBS).Social media users are widely using emoji based sentiments(EBS)in between text messages,tweets and posts.Although tiny pictures of emoji contains sufficient information to be considered for construction of classification model;but due to the wide range of dissimilar,heterogynous and complex patterns of emoji with similarmeanings(SM)have become one of the significant research areas of machine vision.This paper proposes an approach to provide meticulous assistance to social media application(SMA)users to classify the EBS sentiments.Proposed methodology consists upon three layerswhere first layer deals with data cleaning and feature selection techniques to detect dissimilar emoji patterns(DEP)with similar meanings(SM).In first sub step we input set of emoji,in second sub step every emoji has to qualify user defined threshold,in third sub step algorithm detects every emoji by considering as objects and in fourth step emoji images are cropped,after data cleaning these tiny images are saved as emoji images.In second step we build classification model by using convolutional neural networks(CNN)to explore hidden knowledge of emoji datasets.In third step we present results visualization by using confusion matrix and other estimations.This paper contributes(1)data cleaning method to detect EBS;(2)highest classification accuracy for emoji classification measured as 97.63%.展开更多
I was sent twice by CAFIU to work at Beijing Office of the Friedrich-Ebert-Stiftung(FES) of Germany when I had the honor to get familiar with my German colleagues. This helped me know more about the national characte...I was sent twice by CAFIU to work at Beijing Office of the Friedrich-Ebert-Stiftung(FES) of Germany when I had the honor to get familiar with my German colleagues. This helped me know more about the national character of the German people. Moved by their friendly sentiments towards the Chinese people and their deep interest in展开更多
This study examines the relationship between positive and negative investor sentiments and stock market returns and volatility in Group of 20 countries using variousmethods, including panel regression with fixed effec...This study examines the relationship between positive and negative investor sentiments and stock market returns and volatility in Group of 20 countries using variousmethods, including panel regression with fixed effects, panel quantile regressions, apanel vector autoregression (PVAR) model, and country-specific regressions. We proxyfor negative and positive investor sentiments using the Google Search Volume Indexfor terms related to the coronavirus disease (COVID-19) and COVID-19 vaccine, respectively. Using weekly data from March 2020 to May 2021, we document significantrelationships between positive and negative investor sentiments and stock marketreturns and volatility. Specifically, an increase in positive investor sentiment leads toan increase in stock returns while negative investor sentiment decreases stock returnsat lower quantiles. The effect of investor sentiment on volatility is consistent acrossthe distribution: negative sentiment increases volatility, whereas positive sentimentreduces volatility. These results are robust as they are corroborated by Granger causalitytests and a PVAR model. The findings may have portfolio implications as they indicatethat proxies for positive and negative investor sentiments seem to be good predictorsof stock returns and volatility during the pandemic.展开更多
Bike sharing is considered a state-of-the-art transportation program. It is ideal for short or medium trips providing riders the ability to pick up a bike at any self-serve bike station and return it to any bike stati...Bike sharing is considered a state-of-the-art transportation program. It is ideal for short or medium trips providing riders the ability to pick up a bike at any self-serve bike station and return it to any bike station located within the system’s coverage area. The bike sharing programs in the United States are still very young compared to those in European countries. Washington DC was the first jurisdiction to devise a third generation bike sharing system in the US in 2008. To evaluate the popularity of a bike sharing program, a sentiment analysis of the riders’ feedback can be performed. Twitter is a great platform to understand people’s views instantly. Social media mining is, thus, gaining popularity in many research areas including transportation. Social media mining has two major advantages over conventional attitudinal survey methods—it can easily reach a large audience and it can reflect the true behavior of participants because of the anonymity social media provides. It is known that self-imposed censor is common in responding to conversational attitudinal surveys. This study performed text mining on the tweets related to a case study (Capital Bike share of Washington DC) to perform sentiment analysis or opinion mining. The results of the text mining mostly revealed higher positive sentiments towards the current system.展开更多
The coronavirus disease(COVID-19)pandemic has affected the lives of social media users in an unprecedentedmanner.They are constantly posting their satisfaction or dissatisfaction over the COVID-19 situation at their l...The coronavirus disease(COVID-19)pandemic has affected the lives of social media users in an unprecedentedmanner.They are constantly posting their satisfaction or dissatisfaction over the COVID-19 situation at their location of interest.Therefore,understanding location-oriented sentiments about this situation is of prime importance for political leaders,and strategic decision-makers.To this end,we present a new fully automated algorithm based on artificial intelligence(AI),for extraction of location-oriented public sentiments on the COVID-19 situation.We designed the proposed system to obtain exhaustive knowledge and insights on social media feeds related to COVID-19 in 110 languages through AI-based translation,sentiment analysis,location entity detection,and decomposition tree analysis.We deployed fully automated algorithm on live Twitter feed from July 15,2021 and it is still running as of 12 January,2022.The system was evaluated on a limited dataset between July 15,2021 to August 10,2021.During this evaluation timeframe 150,000 tweets were analyzed and our algorithm found that 9,900 tweets contained one or more location entities.In total,13,220 location entities were detected during the evaluation period,and the rates of average precision and recall rate were 0.901 and 0.967,respectively.As of 12 January,2022,the proposed solution has detected 43,169 locations using entity recognition.According to the best of our knowledge,this study is the first to report location intelligence with entity detection,sentiment analysis,and decomposition tree analysis on social media messages related to COVID-19 and has covered the largest set of languages.展开更多
Healthcare organizations rely on patients’feedback and experiences to evaluate their performance and services,thereby allowing such organizations to improve inadequate services and address any shortcomings.According ...Healthcare organizations rely on patients’feedback and experiences to evaluate their performance and services,thereby allowing such organizations to improve inadequate services and address any shortcomings.According to the literature,social networks and particularly Twitter are effective platforms for gathering public opinions.Moreover,recent studies have used natural language processing to measure sentiments in text segments collected from Twitter to capture public opinions about various sectors,including healthcare.The present study aimed to analyze Arabic Twitter-based patient experience sentiments and to introduce an Arabic patient experience corpus.The authors collected 12,400 tweets from Arabic patients discussing patient experiences related to healthcare organizations in Saudi Arabia from 1 January 2008 to 29 January 2022.The tweets were labeled according to sentiment(positive or negative)and sector(public or private),and thereby the Hospital Patient Experiences in Saudi Arabia(HoPE-SA)dataset was produced.A simple statistical analysis was conducted to examine differences in patient views of healthcare sectors.The authors trained five models to distinguish sentiments in tweets automatically with the following schemes:a transformer-based model fine-tuned with deep learning architecture and a transformer-based model fine-tuned with simple architecture,using two different transformer-based embeddings based on Bidirectional Encoder Representations from Transformers(BERT),Multi-dialect Arabic BERT(MAR-BERT),and multilingual BERT(mBERT),as well as a pretrained word2vec model with a support vector machine classifier.This is the first study to investigate the use of a bidirectional long short-term memory layer followed by a feedforward neural network for the fine-tuning of MARBERT.The deep-learning fine-tuned MARBERT-based model—the authors’best-performing model—achieved accuracy,micro-F1,and macro-F1 scores of 98.71%,98.73%,and 98.63%,respectively.展开更多
Millions of people are connecting and exchanging information on social media platforms,where interpersonal interactions are constantly being shared.However,due to inaccurate or misleading information about the COVID-1...Millions of people are connecting and exchanging information on social media platforms,where interpersonal interactions are constantly being shared.However,due to inaccurate or misleading information about the COVID-19 pandemic,social media platforms became the scene of tense debates between believers and doubters.Healthcare professionals and public health agencies also use social media to inform the public about COVID-19 news and updates.However,they occasionally have trouble managing massive pandemic-related rumors and frauds.One reason is that people share and engage,regardless of the information source,by assuming the content is unquestionably true.On Twitter,users use words and phrases literally to convey their views or opinion.However,other users choose to utilize idioms or proverbs that are implicit and indirect to make a stronger impression on the audience or perhaps to catch their attention.Idioms and proverbs are figurative expressions with a thematically coherent totality that cannot understand literally.Despite more than 10%of tweets containing idioms or slang,most sentiment analysis research focuses on the accuracy enhancement of various classification algorithms.However,little attention would decipher the hidden sentiments of the expressed idioms in tweets.This paper proposes a novel data expansion strategy for categorizing tweets concerning COVID-19.The following are the benefits of the suggested method:1)no transformer fine-tuning is necessary,2)the technique solves the fundamental challenge of the manual data labeling process by automating the construction and annotation of the sentiment lexicon,3)the method minimizes the error rate in annotating the lexicon,and drastically improves the tweet sentiment classification’s accuracy performance.展开更多
Forecasting economic indices on the basis of information extracted from text documents, like newspaper articles is an attractive idea. With the help of text mining techniques, in particular sentiment analysis, we eval...Forecasting economic indices on the basis of information extracted from text documents, like newspaper articles is an attractive idea. With the help of text mining techniques, in particular sentiment analysis, we evaluate the tone of individual New York Times (NYT) articles and compare our results to the Chicago Fed National Activity Index (CFNAI). In this paper, we present a simple, intuitive framework to derive sentiment scores from text documents In particular articles are tagged based on terms and their connotated sentiment. Subsequently, we forecast the CFNAI movements via support vector machines (SVM) trained on a subset of the observed sentiment scores. We apply our model into two different data sets, the whole NYT articles and the articles categorized as NYT business news. On both data sets, we applied a simple performance measure to evaluate forecasting accuracy of the CFNAI展开更多
Previously, rapid disease detection and prevention was difficult. This is because disease modeling and prediction was dependent on a manually obtained dataset that includes use of survey. With the increased use of soc...Previously, rapid disease detection and prevention was difficult. This is because disease modeling and prediction was dependent on a manually obtained dataset that includes use of survey. With the increased use of social media platforms like Twitter, Facebook, Instagram, etc., data mining and sentiment analysis can help avoid diseases. Sentiment analysis is a powerful tool for analyzing people’s perceptions, emotions, value assessments, attitudes, and feelings as expressed in texts. The purpose of this research is to use machine learning techniques to classify and predict the spatial distribution of positive and negative sentiments of Covid-19 pandemic. This study research has employed machine learning to classify spatial distribution of Covid-19 <span style="font-family:Verdana;">twitter sentiments as positive or negative. The data for this study were geo-tagged</span><span style="font-family:Verdana;"> tweets concerning COVID-19 which were live streamed using streamR package. The key terms used for streaming the data were</span><span style="font-family:Verdana;">:</span><span style="font-family:Verdana;"> Corona, Covid-19, sanitizer, virus, lockdown, quarantine, and social distance. The classification used Naive Bayes algorithms with ngram approaches. N-Gram model is a probabilistic language model used to predict next item in a sequence in the form (n</span><span style="font-family:;" "=""> <span style="font-family:Verdana;">-</span></span><span style="font-family:;" "=""> </span><span style="font-family:Verdana;">1) order Markov. It relies on the Markov assumption—the probability of a word depends only on the previous word without looking too far into the past. The steps followed in this research include</span><span style="font-family:Verdana;">: </span><span style="font-family:;" "=""><span style="font-family:Verdana;">cleaning and preprocessing the data, text tokenization using n-gram </span><i><span style="font-family:Verdana;">i.e.</span></i><span style="font-family:Verdana;"> 1-gram, 2-gram, and 3-gram, tweets were converted or weighted into a matrix of numeric vectors using Term Frequency Inverse-Document. Also, data were divided 80:20 between train and test data. A confusion matrix was utilized to evaluate the classification accuracy, precision, and recall performance of the various algorithms tested. Prediction was done using the best performing Naive Bayes algorithm. The results of this research showed that under Multinomial Naive Bayes, unigram accuracy was 92.02%, bigram accuracy was 97.37%, and trigram accuracy was 94.40%. Unigram had 89.34% accuracy, bigram had 96.80%, and trigram had 94.90% accuracy using Bernoulli Naive Bayes. Unigram accuracy was 90.43%, bigram accuracy was 95.67%, and trigram accuracy was 92.89% using Gaussian Naive Bayes. Bigram tokenization outperformed unigram and trigram tokenization. Bigram Multinomial Naive Bayes was used to predict test data since it was the most accurate in classifying train data. Prediction </span><span style="font-family:Verdana;">accuracy was 84.92%, precision 85.50%, recall 81.02%, and F1 measure 83.20%</span><span style="font-family:Verdana;">. TF-IDF was employed to increase prediction accuracy, obtaining 87.06%. These were then plotted on a globe map. The study indicates that machine learning can identify patterns and emotions in public tweets, which may then be used to steer targeted intervention programs aimed at limiting disease spread.</span></span>展开更多
Huge amount of data is being produced every second for microblogs,different content sharing sites,and social networking.Sentimental classification is a tool that is frequently used to identify underlying opinions and ...Huge amount of data is being produced every second for microblogs,different content sharing sites,and social networking.Sentimental classification is a tool that is frequently used to identify underlying opinions and sentiments present in the text and classifying them.It is widely used for social media platforms to find user’s sentiments about a particular topic or product.Capturing,assembling,and analyzing sentiments has been challenge for researchers.To handle these challenges,we present a comparative sentiment analysis study in which we used the fine-grained Stanford Sentiment Treebank(SST)dataset,based on 215,154 exclusive texts of different lengths that are manually labeled.We present comparative sentiment analysis to solve the fine-grained sentiment classification problem.The proposed approach takes start by pre-processing the data and then apply eight machine-learning algorithms for the sentiment classification namely Support Vector Machine(SVM),Logistic Regression(LR),Neural Networks(NN),Random Forest(RF),Decision Tree(DT),K-Nearest Neighbor(KNN),Adaboost and Naïve Bayes(NB).On the basis of results obtained the accuracy,precision,recall and F1-score were calculated to draw a comparison between the classification approaches being used.展开更多
Social applications such as Weibo have provided a quick platform for information propagation, which have led to an explosive propagation for hot topic. User sentiments about propagation information play an important r...Social applications such as Weibo have provided a quick platform for information propagation, which have led to an explosive propagation for hot topic. User sentiments about propagation information play an important role in propagation speed, which receive more and more attention from data mining field. In this paper, we propose an sentiment-based hot topics prediction model called PHT-US. PHT-US firstly classifies a large amount of text data in Weibo into different topics, then converts user sentiments and time factors into embedding vectors that are input into recurrent neural networks (both LSTM and GRU), and predicts whether the target topic could be a hot spot. Experiments on Sina Weibo show that PHT-US can effectively predict the hot topics in the future. Social applications such as Weibo provide a platform for quick information propagation, which leads to an explosive propagation for hot topics. User sentiments about propagation information play an important role in propagation speed, and thus receive more attention from data mining field. In this paper, a sentiment-based hot topics prediction model called PHT-US is proposed. Firstly a large amount of text data in Weibo was classified into different topics, and then user sentiments and time factors were converted into embedding vectors that are input into recurrent neural networks (both LSTM and GRU), and future hotspots were predicted. Experiments on Sina Weibo show that PHT-US can effectively predict hot topics in the future.展开更多
亚当·斯密是一位特别注重修辞的学者,要想准确把握其道德理论核心,需要认真分析其代表作《道德情操论》的文本。The Theory of Moral Sentiments(TMS)标题中的moral sentiments是指人类在道德判断上的一种基本能力,是包含同情、良...亚当·斯密是一位特别注重修辞的学者,要想准确把握其道德理论核心,需要认真分析其代表作《道德情操论》的文本。The Theory of Moral Sentiments(TMS)标题中的moral sentiments是指人类在道德判断上的一种基本能力,是包含同情、良知、审美以及道德推理等多方面内容的,其根源在于人类以自己同情共感的能力经验到各种道德实践,又通过归纳、反思和推理来将其一般化,最后上升为指导道德抉择和道德行为的原理。斯密道德论的核心绝非"道德情操"本身,而是各种道德情感得以形成的同情共感机制。现在被广泛接受的中文翻译书名《道德情操论》容易误导读者,而翻译成《道德情感论》更符合斯密道德理论的核心内涵。展开更多
Online incremental learning is one of the emerging research interests among the researchers in the recent years.The sentiment classification through the online incremental learning faces many challenges due to the lim...Online incremental learning is one of the emerging research interests among the researchers in the recent years.The sentiment classification through the online incremental learning faces many challenges due to the limitations in the memory and the computing resources available for processing the online reviews.This work has introduced an online incremental learning algorithm for classifying the train reviews.The sentiments available in the reviews provided for the public services are necessary for improving the quality of the service.This work proposes the online kernel optimizationbased support vector machine(OKO-SVM)classifier for the sentiment classification of the train reviews.This paper is the extension of the previous work kernel optimizationbased support vector machine(KO-SVM).The OKO-SVM classifier uses the proposed fuzzy bound for modifying the weight for each incoming review database for the particular time duration.The simulation uses the standard train review and the movie review database for the classification.From the simulation results,it is evident that the proposed model has achieved a better performance with the values of 84.42%,93.86%,and 74.56%regarding the accuracy,sensitivity,and specificity while classifying the train review database.展开更多
The valuation of cryptocurrencies is important given the increasing significance of this potential asset class.However,most state-of-the-art cryptocurrency valuation methods only focus on one of the fundamental factor...The valuation of cryptocurrencies is important given the increasing significance of this potential asset class.However,most state-of-the-art cryptocurrency valuation methods only focus on one of the fundamental factors or sentiments and use out-of-date data sources.In this study,a robust cryptocurrency valuation method is developed using up-to-date datasets.Using various panel regression models and moving-window regression tests,the impacts of fundamental factors and sentiments in the valuation of cryptocurrencies are explored with data covering from January 1,2009 to April 30,2023.The research shows the importance of sentiments and suggests that the fear and greed index can indicate when to make a cryptocurrency investment,while Google search interest in cryptocurrency is crucial when choosing the appropriate type of cryptocurrency.Moreover,consensus mechanism and initial coin offering have significant effects on cryptocurrencies without stablecoins,while their impacts on cryptocurrencies with stablecoins are insignificant.Other fundamental factors,such as the type of supply and the presence of smart contracts,do not have a significant influence on cryptocurrency.Findings from this study can enhance cryptocurrency marketisation and provide insightful guidance for investors,portfolio managers,and policymakers in assessing the utility level of each cryptocurrency.展开更多
Aspect-Based Sentiment Analysis(ABSA)is a fundamental area of research in Natural Language Processing(NLP).Within ABSA,Aspect Sentiment Quad Prediction(ASQP)aims to accurately identify sentiment quadruplets in target ...Aspect-Based Sentiment Analysis(ABSA)is a fundamental area of research in Natural Language Processing(NLP).Within ABSA,Aspect Sentiment Quad Prediction(ASQP)aims to accurately identify sentiment quadruplets in target sentences,including aspect terms,aspect categories,corresponding opinion terms,and sentiment polarity.However,most existing research has focused on English datasets.Consequently,while ASQP has seen significant progress in English,the Chinese ASQP task has remained relatively stagnant.Drawing inspiration from methods applied to English ASQP,we propose Chinese generation templates and employ prompt-based instruction learning to enhance the model’s understanding of the task,ultimately improving ASQP performance in the Chinese context.Ultimately,under the same pre-training model configuration,our approach achieved a 5.79%improvement in the F1 score compared to the previously leading method.Furthermore,when utilizing a larger model with reduced training parameters,the F1 score demonstrated an 8.14%enhancement.Additionally,we suggest a novel evaluation metric based on the characteristics of generative models,better-reflecting model generalization.Experimental results validate the effectiveness of our approach.展开更多
Modern technological advancements have made social media an essential component of daily life.Social media allow individuals to share thoughts,emotions,and ideas.Sentiment analysis plays the function of evaluating whe...Modern technological advancements have made social media an essential component of daily life.Social media allow individuals to share thoughts,emotions,and ideas.Sentiment analysis plays the function of evaluating whether the sentiment of the text is positive,negative,neutral,or any other personal emotion to understand the sentiment context of the text.Sentiment analysis is essential in business and society because it impacts strategic decision-making.Sentiment analysis involves challenges due to lexical variation,an unlabeled dataset,and text distance correlations.The execution time increases due to the sequential processing of the sequence models.However,the calculation times for the Transformer models are reduced because of the parallel processing.This study uses a hybrid deep learning strategy to combine the strengths of the Transformer and Sequence models while ignoring their limitations.In particular,the proposed model integrates the Decoding-enhanced with Bidirectional Encoder Representations from Transformers(BERT)attention(DeBERTa)and the Gated Recurrent Unit(GRU)for sentiment analysis.Using the Decoding-enhanced BERT technique,the words are mapped into a compact,semantic word embedding space,and the Gated Recurrent Unit model can capture the distance contextual semantics correctly.The proposed hybrid model achieves F1-scores of 97%on the Twitter Large Language Model(LLM)dataset,which is much higher than the performance of new techniques.展开更多
In the context of the accelerated pace of daily life and the development of e-commerce,online shopping is a mainstreamway for consumers to access products and services.To understand their emotional expressions in faci...In the context of the accelerated pace of daily life and the development of e-commerce,online shopping is a mainstreamway for consumers to access products and services.To understand their emotional expressions in facing different shopping experience scenarios,this paper presents a sentiment analysis method that combines the ecommerce reviewkeyword-generated imagewith a hybrid machine learning-basedmodel,inwhich theWord2Vec-TextRank is used to extract keywords that act as the inputs for generating the related images by generative Artificial Intelligence(AI).Subsequently,a hybrid Convolutional Neural Network and Support Vector Machine(CNNSVM)model is applied for sentiment classification of those keyword-generated images.For method validation,the data randomly comprised of 5000 reviews from Amazon have been analyzed.With superior keyword extraction capability,the proposedmethod achieves impressive results on sentiment classification with a remarkable accuracy of up to 97.13%.Such performance demonstrates its advantages by using the text-to-image approach,providing a unique perspective for sentiment analysis in the e-commerce review data compared to the existing works.Thus,the proposed method enhances the reliability and insights of customer feedback surveys,which would also establish a novel direction in similar cases,such as social media monitoring and market trend research.展开更多
The emergence of new media in various fields has continuously strengthened the social aspect of social media.Netizens tend to express emotions in social interactions,and many people even use satire,metaphors,and other...The emergence of new media in various fields has continuously strengthened the social aspect of social media.Netizens tend to express emotions in social interactions,and many people even use satire,metaphors,and other techniques to express some negative emotions,it is necessary to detect sarcasm in social comment data.For sarcasm,the more reference data modalities used,the better the experimental effect.This paper conducts research on sarcasm detection technology based on image-text fusion data.To effectively utilize the features of each modality,a feature reconstruction output algorithm is proposed.This algorithm is based on the attention mechanism,learns the low-rank features of another modality through cross-modality,the eigenvectors are reconstructed for the corresponding modality through weighted averaging.When only the image modality in the dataset is used,the preprocessed data has outstanding performance in reconstructing the output model,with an accuracy rate of 87.6%.When using only the text modality data in the dataset,the reconstructed output model is optimal,with an accuracy rate of 85.2%.To improve feature fusion between modalities for effective classification,a weight adaptive learning algorithm is used.This algorithm uses a neural network combined with an attention mechanism to calculate the attention weight of each modality to achieve weight adaptive learning purposes,with an accuracy rate of 87.9%.Extensive experiments on a benchmark dataset demonstrate the superiority of our proposed model.展开更多
基金supported by the National Social Science Fund of China(20BXW101)。
文摘The task of cross-target stance detection faces significant challenges due to the lack of additional background information in emerging knowledge domains and the colloquial nature of language patterns.Traditional stance detection methods often struggle with understanding limited context and have insufficient generalization across diverse sentiments and semantic structures.This paper focuses on effectively mining and utilizing sentimentsemantics knowledge for stance knowledge transfer and proposes a sentiment-aware hierarchical attention network(SentiHAN)for cross-target stance detection.SentiHAN introduces an improved hierarchical attention network designed to maximize the use of high-level representations of targets and texts at various fine-grain levels.This model integrates phrase-level combinatorial sentiment knowledge to effectively bridge the knowledge gap between known and unknown targets.By doing so,it enables a comprehensive understanding of stance representations for unknown targets across different sentiments and semantic structures.The model’s ability to leverage sentimentsemantics knowledge enhances its performance in detecting stances that may not be directly observable from the immediate context.Extensive experimental results indicate that SentiHAN significantly outperforms existing benchmark methods in terms of both accuracy and robustness.Moreover,the paper employs ablation studies and visualization techniques to explore the intricate relationship between sentiment and stance.These analyses further confirm the effectiveness of sentence-level combinatorial sentiment knowledge in improving stance detection capabilities.
基金R&D Program of Beijing Municipal Education Commission(No.KM202211417015)Academic Research Projects of Beijing Union University(No.ZK10202209)+1 种基金The team-building subsidy of“Xuezhi Professorship”of the College of Applied Arts and Science of Beijing Union University(No.BUUCAS-XZJSTD-2024005)Academic Research Projects of Beijing Union University(No.ZKZD202305).
文摘As the pivotal green space,urban parks play an important role in urban residents’daily activities.Thy can not only bring people physical health,but also can be more likely to elicit positive sentiment to those who visit them.Recently,social media big data has provided new data sources for sentiment analysis.However,there was limited researches that explored the connection between urban parks and individual’s sentiments.Therefore,this study firstly employed a pre-trained language model(BERT,Bidirectional Encoder Representations from Transformers)to calculate sentiment scores based on social media data.Secondly,this study analysed the relationship between urban parks and individual’s sentiment from both spatial and temporal perspectives.Finally,by utilizing structural equation model(SEM),we identified 13 factors and analyzed its degree of the influence.The research findings are listed as below:①It confirmed that individuals generally experienced positive sentiment with high sentiment scores in the majority of urban parks;②The urban park type showed an influence on sentiment scores.In this study,higher sentiment scores observed in Eco-parks,comprehensive parks,and historical parks;③The urban parks level showed low impact on sentiment scores.With distinctions observed mainly at level-3 and level-4;④Compared to internal factors in parks,the external infrastructure surround them exerted more significant impact on sentiment scores.For instance,number of bus and subway stations around urban parks led to higher sentiment scores,while scenic spots and restaurants had inverse result.This study provided a novel method to quantify the services of various urban parks,which can be served as inspiration for similar studies in other cities and countries,enhancing their park planning and management strategies.
文摘Recent patterns of human sentiments are highly influenced by emoji based sentiments(EBS).Social media users are widely using emoji based sentiments(EBS)in between text messages,tweets and posts.Although tiny pictures of emoji contains sufficient information to be considered for construction of classification model;but due to the wide range of dissimilar,heterogynous and complex patterns of emoji with similarmeanings(SM)have become one of the significant research areas of machine vision.This paper proposes an approach to provide meticulous assistance to social media application(SMA)users to classify the EBS sentiments.Proposed methodology consists upon three layerswhere first layer deals with data cleaning and feature selection techniques to detect dissimilar emoji patterns(DEP)with similar meanings(SM).In first sub step we input set of emoji,in second sub step every emoji has to qualify user defined threshold,in third sub step algorithm detects every emoji by considering as objects and in fourth step emoji images are cropped,after data cleaning these tiny images are saved as emoji images.In second step we build classification model by using convolutional neural networks(CNN)to explore hidden knowledge of emoji datasets.In third step we present results visualization by using confusion matrix and other estimations.This paper contributes(1)data cleaning method to detect EBS;(2)highest classification accuracy for emoji classification measured as 97.63%.
文摘I was sent twice by CAFIU to work at Beijing Office of the Friedrich-Ebert-Stiftung(FES) of Germany when I had the honor to get familiar with my German colleagues. This helped me know more about the national character of the German people. Moved by their friendly sentiments towards the Chinese people and their deep interest in
文摘This study examines the relationship between positive and negative investor sentiments and stock market returns and volatility in Group of 20 countries using variousmethods, including panel regression with fixed effects, panel quantile regressions, apanel vector autoregression (PVAR) model, and country-specific regressions. We proxyfor negative and positive investor sentiments using the Google Search Volume Indexfor terms related to the coronavirus disease (COVID-19) and COVID-19 vaccine, respectively. Using weekly data from March 2020 to May 2021, we document significantrelationships between positive and negative investor sentiments and stock marketreturns and volatility. Specifically, an increase in positive investor sentiment leads toan increase in stock returns while negative investor sentiment decreases stock returnsat lower quantiles. The effect of investor sentiment on volatility is consistent acrossthe distribution: negative sentiment increases volatility, whereas positive sentimentreduces volatility. These results are robust as they are corroborated by Granger causalitytests and a PVAR model. The findings may have portfolio implications as they indicatethat proxies for positive and negative investor sentiments seem to be good predictorsof stock returns and volatility during the pandemic.
文摘Bike sharing is considered a state-of-the-art transportation program. It is ideal for short or medium trips providing riders the ability to pick up a bike at any self-serve bike station and return it to any bike station located within the system’s coverage area. The bike sharing programs in the United States are still very young compared to those in European countries. Washington DC was the first jurisdiction to devise a third generation bike sharing system in the US in 2008. To evaluate the popularity of a bike sharing program, a sentiment analysis of the riders’ feedback can be performed. Twitter is a great platform to understand people’s views instantly. Social media mining is, thus, gaining popularity in many research areas including transportation. Social media mining has two major advantages over conventional attitudinal survey methods—it can easily reach a large audience and it can reflect the true behavior of participants because of the anonymity social media provides. It is known that self-imposed censor is common in responding to conversational attitudinal surveys. This study performed text mining on the tweets related to a case study (Capital Bike share of Washington DC) to perform sentiment analysis or opinion mining. The results of the text mining mostly revealed higher positive sentiments towards the current system.
文摘The coronavirus disease(COVID-19)pandemic has affected the lives of social media users in an unprecedentedmanner.They are constantly posting their satisfaction or dissatisfaction over the COVID-19 situation at their location of interest.Therefore,understanding location-oriented sentiments about this situation is of prime importance for political leaders,and strategic decision-makers.To this end,we present a new fully automated algorithm based on artificial intelligence(AI),for extraction of location-oriented public sentiments on the COVID-19 situation.We designed the proposed system to obtain exhaustive knowledge and insights on social media feeds related to COVID-19 in 110 languages through AI-based translation,sentiment analysis,location entity detection,and decomposition tree analysis.We deployed fully automated algorithm on live Twitter feed from July 15,2021 and it is still running as of 12 January,2022.The system was evaluated on a limited dataset between July 15,2021 to August 10,2021.During this evaluation timeframe 150,000 tweets were analyzed and our algorithm found that 9,900 tweets contained one or more location entities.In total,13,220 location entities were detected during the evaluation period,and the rates of average precision and recall rate were 0.901 and 0.967,respectively.As of 12 January,2022,the proposed solution has detected 43,169 locations using entity recognition.According to the best of our knowledge,this study is the first to report location intelligence with entity detection,sentiment analysis,and decomposition tree analysis on social media messages related to COVID-19 and has covered the largest set of languages.
文摘Healthcare organizations rely on patients’feedback and experiences to evaluate their performance and services,thereby allowing such organizations to improve inadequate services and address any shortcomings.According to the literature,social networks and particularly Twitter are effective platforms for gathering public opinions.Moreover,recent studies have used natural language processing to measure sentiments in text segments collected from Twitter to capture public opinions about various sectors,including healthcare.The present study aimed to analyze Arabic Twitter-based patient experience sentiments and to introduce an Arabic patient experience corpus.The authors collected 12,400 tweets from Arabic patients discussing patient experiences related to healthcare organizations in Saudi Arabia from 1 January 2008 to 29 January 2022.The tweets were labeled according to sentiment(positive or negative)and sector(public or private),and thereby the Hospital Patient Experiences in Saudi Arabia(HoPE-SA)dataset was produced.A simple statistical analysis was conducted to examine differences in patient views of healthcare sectors.The authors trained five models to distinguish sentiments in tweets automatically with the following schemes:a transformer-based model fine-tuned with deep learning architecture and a transformer-based model fine-tuned with simple architecture,using two different transformer-based embeddings based on Bidirectional Encoder Representations from Transformers(BERT),Multi-dialect Arabic BERT(MAR-BERT),and multilingual BERT(mBERT),as well as a pretrained word2vec model with a support vector machine classifier.This is the first study to investigate the use of a bidirectional long short-term memory layer followed by a feedforward neural network for the fine-tuning of MARBERT.The deep-learning fine-tuned MARBERT-based model—the authors’best-performing model—achieved accuracy,micro-F1,and macro-F1 scores of 98.71%,98.73%,and 98.63%,respectively.
基金This work was supported in part by the UTAR Research Fund(IPSR/RMC/U TARRF/2020-C1/R01).
文摘Millions of people are connecting and exchanging information on social media platforms,where interpersonal interactions are constantly being shared.However,due to inaccurate or misleading information about the COVID-19 pandemic,social media platforms became the scene of tense debates between believers and doubters.Healthcare professionals and public health agencies also use social media to inform the public about COVID-19 news and updates.However,they occasionally have trouble managing massive pandemic-related rumors and frauds.One reason is that people share and engage,regardless of the information source,by assuming the content is unquestionably true.On Twitter,users use words and phrases literally to convey their views or opinion.However,other users choose to utilize idioms or proverbs that are implicit and indirect to make a stronger impression on the audience or perhaps to catch their attention.Idioms and proverbs are figurative expressions with a thematically coherent totality that cannot understand literally.Despite more than 10%of tweets containing idioms or slang,most sentiment analysis research focuses on the accuracy enhancement of various classification algorithms.However,little attention would decipher the hidden sentiments of the expressed idioms in tweets.This paper proposes a novel data expansion strategy for categorizing tweets concerning COVID-19.The following are the benefits of the suggested method:1)no transformer fine-tuning is necessary,2)the technique solves the fundamental challenge of the manual data labeling process by automating the construction and annotation of the sentiment lexicon,3)the method minimizes the error rate in annotating the lexicon,and drastically improves the tweet sentiment classification’s accuracy performance.
文摘Forecasting economic indices on the basis of information extracted from text documents, like newspaper articles is an attractive idea. With the help of text mining techniques, in particular sentiment analysis, we evaluate the tone of individual New York Times (NYT) articles and compare our results to the Chicago Fed National Activity Index (CFNAI). In this paper, we present a simple, intuitive framework to derive sentiment scores from text documents In particular articles are tagged based on terms and their connotated sentiment. Subsequently, we forecast the CFNAI movements via support vector machines (SVM) trained on a subset of the observed sentiment scores. We apply our model into two different data sets, the whole NYT articles and the articles categorized as NYT business news. On both data sets, we applied a simple performance measure to evaluate forecasting accuracy of the CFNAI
文摘Previously, rapid disease detection and prevention was difficult. This is because disease modeling and prediction was dependent on a manually obtained dataset that includes use of survey. With the increased use of social media platforms like Twitter, Facebook, Instagram, etc., data mining and sentiment analysis can help avoid diseases. Sentiment analysis is a powerful tool for analyzing people’s perceptions, emotions, value assessments, attitudes, and feelings as expressed in texts. The purpose of this research is to use machine learning techniques to classify and predict the spatial distribution of positive and negative sentiments of Covid-19 pandemic. This study research has employed machine learning to classify spatial distribution of Covid-19 <span style="font-family:Verdana;">twitter sentiments as positive or negative. The data for this study were geo-tagged</span><span style="font-family:Verdana;"> tweets concerning COVID-19 which were live streamed using streamR package. The key terms used for streaming the data were</span><span style="font-family:Verdana;">:</span><span style="font-family:Verdana;"> Corona, Covid-19, sanitizer, virus, lockdown, quarantine, and social distance. The classification used Naive Bayes algorithms with ngram approaches. N-Gram model is a probabilistic language model used to predict next item in a sequence in the form (n</span><span style="font-family:;" "=""> <span style="font-family:Verdana;">-</span></span><span style="font-family:;" "=""> </span><span style="font-family:Verdana;">1) order Markov. It relies on the Markov assumption—the probability of a word depends only on the previous word without looking too far into the past. The steps followed in this research include</span><span style="font-family:Verdana;">: </span><span style="font-family:;" "=""><span style="font-family:Verdana;">cleaning and preprocessing the data, text tokenization using n-gram </span><i><span style="font-family:Verdana;">i.e.</span></i><span style="font-family:Verdana;"> 1-gram, 2-gram, and 3-gram, tweets were converted or weighted into a matrix of numeric vectors using Term Frequency Inverse-Document. Also, data were divided 80:20 between train and test data. A confusion matrix was utilized to evaluate the classification accuracy, precision, and recall performance of the various algorithms tested. Prediction was done using the best performing Naive Bayes algorithm. The results of this research showed that under Multinomial Naive Bayes, unigram accuracy was 92.02%, bigram accuracy was 97.37%, and trigram accuracy was 94.40%. Unigram had 89.34% accuracy, bigram had 96.80%, and trigram had 94.90% accuracy using Bernoulli Naive Bayes. Unigram accuracy was 90.43%, bigram accuracy was 95.67%, and trigram accuracy was 92.89% using Gaussian Naive Bayes. Bigram tokenization outperformed unigram and trigram tokenization. Bigram Multinomial Naive Bayes was used to predict test data since it was the most accurate in classifying train data. Prediction </span><span style="font-family:Verdana;">accuracy was 84.92%, precision 85.50%, recall 81.02%, and F1 measure 83.20%</span><span style="font-family:Verdana;">. TF-IDF was employed to increase prediction accuracy, obtaining 87.06%. These were then plotted on a globe map. The study indicates that machine learning can identify patterns and emotions in public tweets, which may then be used to steer targeted intervention programs aimed at limiting disease spread.</span></span>
文摘Huge amount of data is being produced every second for microblogs,different content sharing sites,and social networking.Sentimental classification is a tool that is frequently used to identify underlying opinions and sentiments present in the text and classifying them.It is widely used for social media platforms to find user’s sentiments about a particular topic or product.Capturing,assembling,and analyzing sentiments has been challenge for researchers.To handle these challenges,we present a comparative sentiment analysis study in which we used the fine-grained Stanford Sentiment Treebank(SST)dataset,based on 215,154 exclusive texts of different lengths that are manually labeled.We present comparative sentiment analysis to solve the fine-grained sentiment classification problem.The proposed approach takes start by pre-processing the data and then apply eight machine-learning algorithms for the sentiment classification namely Support Vector Machine(SVM),Logistic Regression(LR),Neural Networks(NN),Random Forest(RF),Decision Tree(DT),K-Nearest Neighbor(KNN),Adaboost and Naïve Bayes(NB).On the basis of results obtained the accuracy,precision,recall and F1-score were calculated to draw a comparison between the classification approaches being used.
基金the National Natural Science Foundation of China (No. 61602159)the Natural Science Foundation of Heilongjiang Province (No. F201430)+1 种基金the Innovation Talents Project of Science and Technology Bureau of Harbin (No. 2017RAQXJ094)the fundamental research funds of universities in Heilongjiang Province, special fund of Heilongjiang University (No. HDJCCX-201608).
文摘Social applications such as Weibo have provided a quick platform for information propagation, which have led to an explosive propagation for hot topic. User sentiments about propagation information play an important role in propagation speed, which receive more and more attention from data mining field. In this paper, we propose an sentiment-based hot topics prediction model called PHT-US. PHT-US firstly classifies a large amount of text data in Weibo into different topics, then converts user sentiments and time factors into embedding vectors that are input into recurrent neural networks (both LSTM and GRU), and predicts whether the target topic could be a hot spot. Experiments on Sina Weibo show that PHT-US can effectively predict the hot topics in the future. Social applications such as Weibo provide a platform for quick information propagation, which leads to an explosive propagation for hot topics. User sentiments about propagation information play an important role in propagation speed, and thus receive more attention from data mining field. In this paper, a sentiment-based hot topics prediction model called PHT-US is proposed. Firstly a large amount of text data in Weibo was classified into different topics, and then user sentiments and time factors were converted into embedding vectors that are input into recurrent neural networks (both LSTM and GRU), and future hotspots were predicted. Experiments on Sina Weibo show that PHT-US can effectively predict hot topics in the future.
文摘亚当·斯密是一位特别注重修辞的学者,要想准确把握其道德理论核心,需要认真分析其代表作《道德情操论》的文本。The Theory of Moral Sentiments(TMS)标题中的moral sentiments是指人类在道德判断上的一种基本能力,是包含同情、良知、审美以及道德推理等多方面内容的,其根源在于人类以自己同情共感的能力经验到各种道德实践,又通过归纳、反思和推理来将其一般化,最后上升为指导道德抉择和道德行为的原理。斯密道德论的核心绝非"道德情操"本身,而是各种道德情感得以形成的同情共感机制。现在被广泛接受的中文翻译书名《道德情操论》容易误导读者,而翻译成《道德情感论》更符合斯密道德理论的核心内涵。
文摘Online incremental learning is one of the emerging research interests among the researchers in the recent years.The sentiment classification through the online incremental learning faces many challenges due to the limitations in the memory and the computing resources available for processing the online reviews.This work has introduced an online incremental learning algorithm for classifying the train reviews.The sentiments available in the reviews provided for the public services are necessary for improving the quality of the service.This work proposes the online kernel optimizationbased support vector machine(OKO-SVM)classifier for the sentiment classification of the train reviews.This paper is the extension of the previous work kernel optimizationbased support vector machine(KO-SVM).The OKO-SVM classifier uses the proposed fuzzy bound for modifying the weight for each incoming review database for the particular time duration.The simulation uses the standard train review and the movie review database for the classification.From the simulation results,it is evident that the proposed model has achieved a better performance with the values of 84.42%,93.86%,and 74.56%regarding the accuracy,sensitivity,and specificity while classifying the train review database.
文摘The valuation of cryptocurrencies is important given the increasing significance of this potential asset class.However,most state-of-the-art cryptocurrency valuation methods only focus on one of the fundamental factors or sentiments and use out-of-date data sources.In this study,a robust cryptocurrency valuation method is developed using up-to-date datasets.Using various panel regression models and moving-window regression tests,the impacts of fundamental factors and sentiments in the valuation of cryptocurrencies are explored with data covering from January 1,2009 to April 30,2023.The research shows the importance of sentiments and suggests that the fear and greed index can indicate when to make a cryptocurrency investment,while Google search interest in cryptocurrency is crucial when choosing the appropriate type of cryptocurrency.Moreover,consensus mechanism and initial coin offering have significant effects on cryptocurrencies without stablecoins,while their impacts on cryptocurrencies with stablecoins are insignificant.Other fundamental factors,such as the type of supply and the presence of smart contracts,do not have a significant influence on cryptocurrency.Findings from this study can enhance cryptocurrency marketisation and provide insightful guidance for investors,portfolio managers,and policymakers in assessing the utility level of each cryptocurrency.
基金supported by the National Key Research and Development Program(Nos.2021YFF0901705,2021YFF0901700)the State Key Laboratory of Media Convergence and Communication,Communication University of China+1 种基金the Fundamental Research Funds for the Central Universitiesthe High-Quality and Cutting-Edge Disciplines Construction Project for Universities in Beijing(Internet Information,Communication University of China).
文摘Aspect-Based Sentiment Analysis(ABSA)is a fundamental area of research in Natural Language Processing(NLP).Within ABSA,Aspect Sentiment Quad Prediction(ASQP)aims to accurately identify sentiment quadruplets in target sentences,including aspect terms,aspect categories,corresponding opinion terms,and sentiment polarity.However,most existing research has focused on English datasets.Consequently,while ASQP has seen significant progress in English,the Chinese ASQP task has remained relatively stagnant.Drawing inspiration from methods applied to English ASQP,we propose Chinese generation templates and employ prompt-based instruction learning to enhance the model’s understanding of the task,ultimately improving ASQP performance in the Chinese context.Ultimately,under the same pre-training model configuration,our approach achieved a 5.79%improvement in the F1 score compared to the previously leading method.Furthermore,when utilizing a larger model with reduced training parameters,the F1 score demonstrated an 8.14%enhancement.Additionally,we suggest a novel evaluation metric based on the characteristics of generative models,better-reflecting model generalization.Experimental results validate the effectiveness of our approach.
文摘Modern technological advancements have made social media an essential component of daily life.Social media allow individuals to share thoughts,emotions,and ideas.Sentiment analysis plays the function of evaluating whether the sentiment of the text is positive,negative,neutral,or any other personal emotion to understand the sentiment context of the text.Sentiment analysis is essential in business and society because it impacts strategic decision-making.Sentiment analysis involves challenges due to lexical variation,an unlabeled dataset,and text distance correlations.The execution time increases due to the sequential processing of the sequence models.However,the calculation times for the Transformer models are reduced because of the parallel processing.This study uses a hybrid deep learning strategy to combine the strengths of the Transformer and Sequence models while ignoring their limitations.In particular,the proposed model integrates the Decoding-enhanced with Bidirectional Encoder Representations from Transformers(BERT)attention(DeBERTa)and the Gated Recurrent Unit(GRU)for sentiment analysis.Using the Decoding-enhanced BERT technique,the words are mapped into a compact,semantic word embedding space,and the Gated Recurrent Unit model can capture the distance contextual semantics correctly.The proposed hybrid model achieves F1-scores of 97%on the Twitter Large Language Model(LLM)dataset,which is much higher than the performance of new techniques.
基金supported in part by the Guangzhou Science and Technology Plan Project under Grants 2024B03J1361,2023B03J1327,and 2023A04J0361in part by the Open Fund Project of Hubei Province Key Laboratory of Occupational Hazard Identification and Control under Grant OHIC2023Y10+3 种基金in part by the Guangdong Province Ordinary Colleges and Universities Young Innovative Talents Project under Grant 2023KQNCX036in part by the Special Fund for Science and Technology Innovation Strategy of Guangdong Province(Climbing Plan)under Grant pdjh2024a226in part by the Key Discipline Improvement Project of Guangdong Province under Grant 2022ZDJS015in part by theResearch Fund of Guangdong Polytechnic Normal University under Grants 22GPNUZDJS17 and 2022SDKYA015.
文摘In the context of the accelerated pace of daily life and the development of e-commerce,online shopping is a mainstreamway for consumers to access products and services.To understand their emotional expressions in facing different shopping experience scenarios,this paper presents a sentiment analysis method that combines the ecommerce reviewkeyword-generated imagewith a hybrid machine learning-basedmodel,inwhich theWord2Vec-TextRank is used to extract keywords that act as the inputs for generating the related images by generative Artificial Intelligence(AI).Subsequently,a hybrid Convolutional Neural Network and Support Vector Machine(CNNSVM)model is applied for sentiment classification of those keyword-generated images.For method validation,the data randomly comprised of 5000 reviews from Amazon have been analyzed.With superior keyword extraction capability,the proposedmethod achieves impressive results on sentiment classification with a remarkable accuracy of up to 97.13%.Such performance demonstrates its advantages by using the text-to-image approach,providing a unique perspective for sentiment analysis in the e-commerce review data compared to the existing works.Thus,the proposed method enhances the reliability and insights of customer feedback surveys,which would also establish a novel direction in similar cases,such as social media monitoring and market trend research.
基金funded by National Key Research and Development Program of China(No.2022YFC3302103).
文摘The emergence of new media in various fields has continuously strengthened the social aspect of social media.Netizens tend to express emotions in social interactions,and many people even use satire,metaphors,and other techniques to express some negative emotions,it is necessary to detect sarcasm in social comment data.For sarcasm,the more reference data modalities used,the better the experimental effect.This paper conducts research on sarcasm detection technology based on image-text fusion data.To effectively utilize the features of each modality,a feature reconstruction output algorithm is proposed.This algorithm is based on the attention mechanism,learns the low-rank features of another modality through cross-modality,the eigenvectors are reconstructed for the corresponding modality through weighted averaging.When only the image modality in the dataset is used,the preprocessed data has outstanding performance in reconstructing the output model,with an accuracy rate of 87.6%.When using only the text modality data in the dataset,the reconstructed output model is optimal,with an accuracy rate of 85.2%.To improve feature fusion between modalities for effective classification,a weight adaptive learning algorithm is used.This algorithm uses a neural network combined with an attention mechanism to calculate the attention weight of each modality to achieve weight adaptive learning purposes,with an accuracy rate of 87.9%.Extensive experiments on a benchmark dataset demonstrate the superiority of our proposed model.