This study presents results from sentiment analysis of Dynamic message sign (DMS) message content, focusing on messages that include numbers of road fatalities. As a traffic management tool, DMS plays a role in influe...This study presents results from sentiment analysis of Dynamic message sign (DMS) message content, focusing on messages that include numbers of road fatalities. As a traffic management tool, DMS plays a role in influencing driver behavior and assisting transportation agencies in achieving safe and efficient traffic movement. However, the psychological and behavioral effects of displaying fatality numbers on DMS remain poorly understood;hence, it is important to know the potential impacts of displaying such messages. The Iowa Department of Transportation displays the number of fatalities on a first screen, followed by a supplemental message hoping to promote safe driving;an example is “19 TRAFFIC DEATHS THIS YEAR IF YOU HAVE A SUPER BOWL DON’T DRIVE HIGH.” We employ natural language processing to decode the sentiment and undertone of the supplementary message and investigate how they influence driving speeds. According to the results of a mixed effect model, drivers reduced speeds marginally upon encountering DMS fatality text with a positive sentiment with a neutral undertone. This category had the largest associated amount of speed reduction, while messages with negative sentiment with a negative undertone had the second largest amount of speed reduction, greater than other combinations, including positive sentiment with a positive undertone.展开更多
With the rapid popularization of social applications, various kinds of social media have developed into an important platform for publishing information and expressing opinion. Detecting hidden topics from the huge am...With the rapid popularization of social applications, various kinds of social media have developed into an important platform for publishing information and expressing opinion. Detecting hidden topics from the huge amount of user-generated contents is of great commerce value and social significance. However traditional text analysis approachesonly focus on the statistical correlation between words, but ignore the sentiment tendency and the temporal properties which may have great effects on topic detection results. This paper proposed a Dynamic Sentiment-Topic(DST) model which can not only detect and track the dynamic topics but also analyze the shift of public's sentiment tendency towards certain topic.Expectation-Maximization algorithm was used in DST model to estimate the latent distribution, and we used Gibbs sampling method to sample new document set and update the hyper parameters and distributions.Experiments are conducted on a real dataset and the results show that DST model outperforms the existing algorithms in terms of topic detection and sentiment accuracy.展开更多
股市的情绪化倾向是股票市场具有高度不确定性的主要原因,直接利用历史数据的股票趋势预测方法难以适应市场情绪的多变性,在实际应用中效果不理想。文章针对市场情绪的不稳定性导致股市拐点难以预测的问题,提出一种基于情绪向量的隐半...股市的情绪化倾向是股票市场具有高度不确定性的主要原因,直接利用历史数据的股票趋势预测方法难以适应市场情绪的多变性,在实际应用中效果不理想。文章针对市场情绪的不稳定性导致股市拐点难以预测的问题,提出一种基于情绪向量的隐半马尔可夫模型股市拐点预测方法(hidden semi-Markov model stock turning point prediction method based on sentiment vector,SV-HSMM)。针对市场情绪不可观察性,选取与市场情绪相关的主要特征,使用马尔可夫毯融合成市场情绪;利用隐半马尔可夫模型建模市场环境,构建市场情绪、市场状态和状态持续时间之间的结构关系;引入情绪向量平滑情绪的多变性,并利用Kullback-Leibler(KL)距离量化情绪热度;利用隐半马尔可夫模型的动态推理实现股市拐点预测。结果表明情绪向量方法具有更好的预测效果。展开更多
由于传统文本评论情感分类方法通常忽略用户性格对于情感分类结果的影响,提出一种基于用户性格和语义-结构特征的文本评论情感分类方法(User Personality and Semantic-structural Features based Sentiment Classification Method for ...由于传统文本评论情感分类方法通常忽略用户性格对于情感分类结果的影响,提出一种基于用户性格和语义-结构特征的文本评论情感分类方法(User Personality and Semantic-structural Features based Sentiment Classification Method for Text Comments,BF_Bi GAC).依据大五人格模型能够有效表达用户性格的优势,通过计算不同维度性格得分,从评论文本中获取用户性格特征.利用双向门控循环单元(Bidirectional Gated Recurrent Unit,Bi GRU)和卷积神经网络(Convolutional Neural Network,CNN)可以有效提取文本上下文语义特征和局部结构特征的优势,提出一种基于Bi GRU、CNN和双层注意力机制的文本语义-结构特征获取方法.为区分不同类型特征的影响,引入混合注意力层实现对用户性格特征和文本语义-结构特征的有效融合,以此获得最终的文本向量表达.在IMDB、Yelp-2、Yelp-5及Ekman四个评论数据集上的对比实验结果表明,BF_Bi GAC在分类准确率(Accuracy)和加权macro F_(1)值(F_(w))上均获得较好表现,相对于拼接Bi GRU、CNN的情感分类方法(Sentiment Classification Method Concatenating Bi GRU and CNN,Bi G-RU_CNN)在Accuracy值上分别提升0.020、0.012、0.017及0.011,相对于拼接CNN、Bi GRU的情感分类方法(Sentiment Classification Method Concatenating CNN and Bi GRU,Conv Bi LSTM)F_(w)值上分别提升0.022、0.013、0.028及0.023;相对于预训练模型BERT和Ro BERTa,BF_Bi GAC在保证分类精度的情况下获得了较高的运行效率.展开更多
文摘This study presents results from sentiment analysis of Dynamic message sign (DMS) message content, focusing on messages that include numbers of road fatalities. As a traffic management tool, DMS plays a role in influencing driver behavior and assisting transportation agencies in achieving safe and efficient traffic movement. However, the psychological and behavioral effects of displaying fatality numbers on DMS remain poorly understood;hence, it is important to know the potential impacts of displaying such messages. The Iowa Department of Transportation displays the number of fatalities on a first screen, followed by a supplemental message hoping to promote safe driving;an example is “19 TRAFFIC DEATHS THIS YEAR IF YOU HAVE A SUPER BOWL DON’T DRIVE HIGH.” We employ natural language processing to decode the sentiment and undertone of the supplementary message and investigate how they influence driving speeds. According to the results of a mixed effect model, drivers reduced speeds marginally upon encountering DMS fatality text with a positive sentiment with a neutral undertone. This category had the largest associated amount of speed reduction, while messages with negative sentiment with a negative undertone had the second largest amount of speed reduction, greater than other combinations, including positive sentiment with a positive undertone.
基金supported by National Natural Science Foundation of China with granted No.61402045,61370197the Specialized Research Fund for the Doctoral Program of Higher Education with granted No.20130005110011the National High Technology Research and Development Program with granted No.2013AA013301
文摘With the rapid popularization of social applications, various kinds of social media have developed into an important platform for publishing information and expressing opinion. Detecting hidden topics from the huge amount of user-generated contents is of great commerce value and social significance. However traditional text analysis approachesonly focus on the statistical correlation between words, but ignore the sentiment tendency and the temporal properties which may have great effects on topic detection results. This paper proposed a Dynamic Sentiment-Topic(DST) model which can not only detect and track the dynamic topics but also analyze the shift of public's sentiment tendency towards certain topic.Expectation-Maximization algorithm was used in DST model to estimate the latent distribution, and we used Gibbs sampling method to sample new document set and update the hyper parameters and distributions.Experiments are conducted on a real dataset and the results show that DST model outperforms the existing algorithms in terms of topic detection and sentiment accuracy.
基金This paper is supported by National Natural Science Foundation of China (No. 61074078) and Fundamental Research Funds for the Central Universities (No. 12MS121).
文摘股市的情绪化倾向是股票市场具有高度不确定性的主要原因,直接利用历史数据的股票趋势预测方法难以适应市场情绪的多变性,在实际应用中效果不理想。文章针对市场情绪的不稳定性导致股市拐点难以预测的问题,提出一种基于情绪向量的隐半马尔可夫模型股市拐点预测方法(hidden semi-Markov model stock turning point prediction method based on sentiment vector,SV-HSMM)。针对市场情绪不可观察性,选取与市场情绪相关的主要特征,使用马尔可夫毯融合成市场情绪;利用隐半马尔可夫模型建模市场环境,构建市场情绪、市场状态和状态持续时间之间的结构关系;引入情绪向量平滑情绪的多变性,并利用Kullback-Leibler(KL)距离量化情绪热度;利用隐半马尔可夫模型的动态推理实现股市拐点预测。结果表明情绪向量方法具有更好的预测效果。
文摘由于传统文本评论情感分类方法通常忽略用户性格对于情感分类结果的影响,提出一种基于用户性格和语义-结构特征的文本评论情感分类方法(User Personality and Semantic-structural Features based Sentiment Classification Method for Text Comments,BF_Bi GAC).依据大五人格模型能够有效表达用户性格的优势,通过计算不同维度性格得分,从评论文本中获取用户性格特征.利用双向门控循环单元(Bidirectional Gated Recurrent Unit,Bi GRU)和卷积神经网络(Convolutional Neural Network,CNN)可以有效提取文本上下文语义特征和局部结构特征的优势,提出一种基于Bi GRU、CNN和双层注意力机制的文本语义-结构特征获取方法.为区分不同类型特征的影响,引入混合注意力层实现对用户性格特征和文本语义-结构特征的有效融合,以此获得最终的文本向量表达.在IMDB、Yelp-2、Yelp-5及Ekman四个评论数据集上的对比实验结果表明,BF_Bi GAC在分类准确率(Accuracy)和加权macro F_(1)值(F_(w))上均获得较好表现,相对于拼接Bi GRU、CNN的情感分类方法(Sentiment Classification Method Concatenating Bi GRU and CNN,Bi G-RU_CNN)在Accuracy值上分别提升0.020、0.012、0.017及0.011,相对于拼接CNN、Bi GRU的情感分类方法(Sentiment Classification Method Concatenating CNN and Bi GRU,Conv Bi LSTM)F_(w)值上分别提升0.022、0.013、0.028及0.023;相对于预训练模型BERT和Ro BERTa,BF_Bi GAC在保证分类精度的情况下获得了较高的运行效率.