In the era of Big Data,we are faced with an inevitable and challenging problem of“overload information”.To alleviate this problem,it is important to use effective automatic text summarization techniques to obtain th...In the era of Big Data,we are faced with an inevitable and challenging problem of“overload information”.To alleviate this problem,it is important to use effective automatic text summarization techniques to obtain the key information quickly and efficiently from the huge amount of text.In this paper,we propose a hybrid method of extractive text summarization based on deep learning and graph ranking algorithms(ETSDG).In this method,a pre-trained deep learning model is designed to yield useful sentence embeddings.Given the association between sentences in raw documents,a traditional LexRank algorithm with fine-tuning is adopted fin ETSDG.In order to improve the performance of the extractive text summarization method,we further integrate the traditional LexRank algorithm with deep learning.Testing results on the data set DUC2004 show that ETSDG has better performance in ROUGE metrics compared with certain benchmark methods.展开更多
With the flooding of pornographic information on the Internet, how to keep people away from that offensive information is becoming one of the most important research areas in network information security. Some applica...With the flooding of pornographic information on the Internet, how to keep people away from that offensive information is becoming one of the most important research areas in network information security. Some applications which can block or filter such information are used. Approaches in those systems can be roughly classified into two kinds: metadata based and content based. With the development of distributed technologies, content based filtering technologies will play a more and more important role in filtering systems. Keyword matching is a content based method used widely in harmful text filtering. Experiments to evaluate the recall and precision of the method showed that the precision of the method is not satisfactory, though the recall of the method is rather high. According to the results, a new pornographic text filtering model based on reconfirming is put forward. Experiments showed that the model is practical, has less loss of recall than the single keyword matching method, and has higher precision.展开更多
Rapid and precise location of the faults of on-board equipment of train control system is a significant factor to ensure reliable train operation.Text data of the fault tracking table of on-board equipment are taken a...Rapid and precise location of the faults of on-board equipment of train control system is a significant factor to ensure reliable train operation.Text data of the fault tracking table of on-board equipment are taken as samples,and an on-board equipment fault diagnosis model is designed based on the combination of convolutional neural network(CNN)and particle swarm optimization-support vector machines(PSO-SVM).Due to the characteristics of high dimensionality and sparseness of fault text data,CNN is used to achieve feature extraction.In order to decrease the influence of the imbalance of the fault sample data category on the classification accuracy,the PSO-SVM algorithm is introduced.The fully connected classification part of CNN is replaced by PSO-SVM,the extracted features are classified precisely,and the intelligent diagnosis of on-board equipment fault is implemented.According to the test analysis of the fault text data of on-board equipment recorded by a railway bureau and comparison with other models,the experimental results indicate that this model can obviously upgrade the evaluation indexes and can be used as an effective model for fault diagnosis for on-board equipment.展开更多
Compared with the traditional method of adding sentences to get summary in multi-document summarization,a two-stage sentence selection approach based on deleting sentences in acandidate sentence set to generate summar...Compared with the traditional method of adding sentences to get summary in multi-document summarization,a two-stage sentence selection approach based on deleting sentences in acandidate sentence set to generate summary is proposed,which has two stages,the acquisition of acandidate sentence set and the optimum selection of sentence.At the first stage,the candidate sentenceset is obtained by redundancy-based sentence selection approach.At the second stage,optimum se-lection of sentences is proposed to delete sentences in the candidate sentence set according to itscontribution to the whole set until getting the appointed summary length.With a test corpus,theROUGE value of summaries gotten by the proposed approach proves its validity,compared with thetraditional method of sentence selection.The influence of the token chosen in the two-stage sentenceselection approach on the quality of the generated summaries is analyzed.展开更多
As a newly developed theory, narrative ethics has its reasonability and advantages in that it can not only analyze either the contents or the forms of the texts, but also make an analysis of the combination of both co...As a newly developed theory, narrative ethics has its reasonability and advantages in that it can not only analyze either the contents or the forms of the texts, but also make an analysis of the combination of both contents and forms. This article, supported by James Phelan's rhetorical narrative theory as the theoretical base, attempts to explore and interpret narrative judgments and its implied ethics existing in The Child in Time by Ian McEwan so as to observe the hidden aesthetic orientation, the value judgments and the ethical intentions of the text and help to reveal the author's views of narrative ethics and aesthetics of the novel.展开更多
First. we use graph theory to further clarify information of nodes and topics. Next, our paper analyzes the factor which affects the nodes probability of being conspirators. According to requirement 1, each node is gi...First. we use graph theory to further clarify information of nodes and topics. Next, our paper analyzes the factor which affects the nodes probability of being conspirators. According to requirement 1, each node is given an initial probability in being a conspirator on the basis of the acquired information.Then we conduct calculations with the iterative equation produced by factor analysis to get the priority list of the 83 given nodes. In addition, according to requirement 2, we make some changes of the nodes information before solving the iterativc modcl above. Compared with former result, some changes of priority and probability of being conspirator emerges.Finally, based upon requirement 3, we pick out some infomaation from some certain topic by semantic analysis and text analysis. A new group of indexes are solved out with TOPSIS to finish the information-gathering period. The terminal indicator, containing the information of nodes and topics, is a weighted average value of the indexes obtained above and the indexes obtained in requirement 1 with the method of the variation coefficient.展开更多
The objective of this paper was to analyze the impact of online reviews on sales. Based on dual path model of commodity sales, an online reviews impact on the relationship between various factors, and then the theoret...The objective of this paper was to analyze the impact of online reviews on sales. Based on dual path model of commodity sales, an online reviews impact on the relationship between various factors, and then the theoretical hypothesis of each factor has been put forward in the model. As it is intuitive and strongly supported empirically, data including Chinese texts captured from Tmall.com was utilized, and then analyzed by SPSS and ROST CM6. Our empirical study on the reviews of Tmall.com indicated that the hypotheses are verified.展开更多
文摘In the era of Big Data,we are faced with an inevitable and challenging problem of“overload information”.To alleviate this problem,it is important to use effective automatic text summarization techniques to obtain the key information quickly and efficiently from the huge amount of text.In this paper,we propose a hybrid method of extractive text summarization based on deep learning and graph ranking algorithms(ETSDG).In this method,a pre-trained deep learning model is designed to yield useful sentence embeddings.Given the association between sentences in raw documents,a traditional LexRank algorithm with fine-tuning is adopted fin ETSDG.In order to improve the performance of the extractive text summarization method,we further integrate the traditional LexRank algorithm with deep learning.Testing results on the data set DUC2004 show that ETSDG has better performance in ROUGE metrics compared with certain benchmark methods.
文摘With the flooding of pornographic information on the Internet, how to keep people away from that offensive information is becoming one of the most important research areas in network information security. Some applications which can block or filter such information are used. Approaches in those systems can be roughly classified into two kinds: metadata based and content based. With the development of distributed technologies, content based filtering technologies will play a more and more important role in filtering systems. Keyword matching is a content based method used widely in harmful text filtering. Experiments to evaluate the recall and precision of the method showed that the precision of the method is not satisfactory, though the recall of the method is rather high. According to the results, a new pornographic text filtering model based on reconfirming is put forward. Experiments showed that the model is practical, has less loss of recall than the single keyword matching method, and has higher precision.
基金Gansu Province Higher Education Innovation Fund Project(No.2020B-104)“Innovation Star”Project for Outstanding Postgraduates of Gansu Province(No.2021CXZX-606)。
文摘Rapid and precise location of the faults of on-board equipment of train control system is a significant factor to ensure reliable train operation.Text data of the fault tracking table of on-board equipment are taken as samples,and an on-board equipment fault diagnosis model is designed based on the combination of convolutional neural network(CNN)and particle swarm optimization-support vector machines(PSO-SVM).Due to the characteristics of high dimensionality and sparseness of fault text data,CNN is used to achieve feature extraction.In order to decrease the influence of the imbalance of the fault sample data category on the classification accuracy,the PSO-SVM algorithm is introduced.The fully connected classification part of CNN is replaced by PSO-SVM,the extracted features are classified precisely,and the intelligent diagnosis of on-board equipment fault is implemented.According to the test analysis of the fault text data of on-board equipment recorded by a railway bureau and comparison with other models,the experimental results indicate that this model can obviously upgrade the evaluation indexes and can be used as an effective model for fault diagnosis for on-board equipment.
基金the National Natural Science Foundation of China(No.60575041)the High Technology Researchand Development Program of China(No.2006AA01Z150).
文摘Compared with the traditional method of adding sentences to get summary in multi-document summarization,a two-stage sentence selection approach based on deleting sentences in acandidate sentence set to generate summary is proposed,which has two stages,the acquisition of acandidate sentence set and the optimum selection of sentence.At the first stage,the candidate sentenceset is obtained by redundancy-based sentence selection approach.At the second stage,optimum se-lection of sentences is proposed to delete sentences in the candidate sentence set according to itscontribution to the whole set until getting the appointed summary length.With a test corpus,theROUGE value of summaries gotten by the proposed approach proves its validity,compared with thetraditional method of sentence selection.The influence of the token chosen in the two-stage sentenceselection approach on the quality of the generated summaries is analyzed.
文摘As a newly developed theory, narrative ethics has its reasonability and advantages in that it can not only analyze either the contents or the forms of the texts, but also make an analysis of the combination of both contents and forms. This article, supported by James Phelan's rhetorical narrative theory as the theoretical base, attempts to explore and interpret narrative judgments and its implied ethics existing in The Child in Time by Ian McEwan so as to observe the hidden aesthetic orientation, the value judgments and the ethical intentions of the text and help to reveal the author's views of narrative ethics and aesthetics of the novel.
文摘First. we use graph theory to further clarify information of nodes and topics. Next, our paper analyzes the factor which affects the nodes probability of being conspirators. According to requirement 1, each node is given an initial probability in being a conspirator on the basis of the acquired information.Then we conduct calculations with the iterative equation produced by factor analysis to get the priority list of the 83 given nodes. In addition, according to requirement 2, we make some changes of the nodes information before solving the iterativc modcl above. Compared with former result, some changes of priority and probability of being conspirator emerges.Finally, based upon requirement 3, we pick out some infomaation from some certain topic by semantic analysis and text analysis. A new group of indexes are solved out with TOPSIS to finish the information-gathering period. The terminal indicator, containing the information of nodes and topics, is a weighted average value of the indexes obtained above and the indexes obtained in requirement 1 with the method of the variation coefficient.
文摘The objective of this paper was to analyze the impact of online reviews on sales. Based on dual path model of commodity sales, an online reviews impact on the relationship between various factors, and then the theoretical hypothesis of each factor has been put forward in the model. As it is intuitive and strongly supported empirically, data including Chinese texts captured from Tmall.com was utilized, and then analyzed by SPSS and ROST CM6. Our empirical study on the reviews of Tmall.com indicated that the hypotheses are verified.