期刊文献+
共找到3篇文章
< 1 >
每页显示 20 50 100
News Text Topic Clustering Optimized Method Based on TF-IDF Algorithm on Spark 被引量:17
1
作者 Zhuo Zhou Jiaohua Qin +3 位作者 Xuyu Xiang Yun Tan Qiang Liu Neal N.Xiong 《Computers, Materials & Continua》 SCIE EI 2020年第1期217-231,共15页
Due to the slow processing speed of text topic clustering in stand-alone architecture under the background of big data,this paper takes news text as the research object and proposes LDA text topic clustering algorithm... Due to the slow processing speed of text topic clustering in stand-alone architecture under the background of big data,this paper takes news text as the research object and proposes LDA text topic clustering algorithm based on Spark big data platform.Since the TF-IDF(term frequency-inverse document frequency)algorithm under Spark is irreversible to word mapping,the mapped words indexes cannot be traced back to the original words.In this paper,an optimized method is proposed that TF-IDF under Spark to ensure the text words can be restored.Firstly,the text feature is extracted by the TF-IDF algorithm combined CountVectorizer proposed in this paper,and then the features are inputted to the LDA(Latent Dirichlet Allocation)topic model for training.Finally,the text topic clustering is obtained.Experimental results show that for large data samples,the processing speed of LDA topic model clustering has been improved based Spark.At the same time,compared with the LDA topic model based on word frequency input,the model proposed in this paper has a reduction of perplexity. 展开更多
关键词 news text topic clustering spark platform countvectorizer algorithm TF-IDF algorithm latent dirichlet allocation model
下载PDF
Chinese News Text Classification Based on Convolutional Neural Network 被引量:1
2
作者 Hanxu Wang Xin Li 《Journal on Big Data》 2022年第1期41-60,共20页
With the explosive growth of Internet text information,the task of text classification is more important.As a part of text classification,Chinese news text classification also plays an important role.In public securit... With the explosive growth of Internet text information,the task of text classification is more important.As a part of text classification,Chinese news text classification also plays an important role.In public security work,public opinion news classification is an important topic.Effective and accurate classification of public opinion news is a necessary prerequisite for relevant departments to grasp the situation of public opinion and control the trend of public opinion in time.This paper introduces a combinedconvolutional neural network text classification model based on word2vec and improved TF-IDF:firstly,the word vector is trained through word2vec model,then the weight of each word is calculated by using the improved TFIDF algorithm based on class frequency variance,and the word vector and weight are combined to construct the text vector representation.Finally,the combined-convolutional neural network is used to train and test the Thucnews data set.The results show that the classification effect of this model is better than the traditional Text-RNN model,the traditional Text-CNN model and word2vec-CNN model.The test accuracy is 97.56%,the accuracy rate is 97%,the recall rate is 97%,and the F1-score is 97%. 展开更多
关键词 Chinese news text classification word2vec model improved TF-IDF combined-convolutional neural network public opinion news
下载PDF
Realization of Implicit Evaluation in News Text
3
《International English Education Research》 2015年第4期24-26,共3页
Martin's Appraisal System Theory presents a powerful tool to decode the journalist's attitude or world view in news text. The evaluation in this Appraisal System can be further classified into explicit evaluation, w... Martin's Appraisal System Theory presents a powerful tool to decode the journalist's attitude or world view in news text. The evaluation in this Appraisal System can be further classified into explicit evaluation, which inscribes the attitude through the attitudinal vocabulary and implicit evaluation, which invokes the attitude through the use of non-core lexis. Martin (2005) pointed out that the implicit evaluation is realized by three approaches: lexical metaphor, the selection of ideational meanings, and graduation. However, the invoked attitude can be achieved by an amount of resources other than the above approaches. In this paper through the exploration of invoked attitude in the English-language China Doily, we find that the attitude is also invoked through grammatical metaphor, general noun as well as conjunction. It is of great significance to decode the journalists' or newspapers' viewpoints by exploring the implicit evaluation in the news text. 展开更多
关键词 implicit evaluation news text grammatical metaphor general noun CONJUNCTION
下载PDF
上一页 1 下一页 到第
使用帮助 返回顶部