A feature extraction, which means extracting the representative words from a text, is an important issue in text mining field. This paper presented a new Apriori and N-gram based Chinese text feature extraction method...A feature extraction, which means extracting the representative words from a text, is an important issue in text mining field. This paper presented a new Apriori and N-gram based Chinese text feature extraction method, and analyzed its correctness and performance. Our method solves the question that the exist extraction methods cannot find the frequent words with arbitrary length in Chinese texts. The experimental results show this method is feasible.展开更多
In order to improve the ability of sharing and scheduling capability of English teaching resources, an improved algorithm for English text summarization is proposed based on Association semantic rules. The relative fe...In order to improve the ability of sharing and scheduling capability of English teaching resources, an improved algorithm for English text summarization is proposed based on Association semantic rules. The relative features are mined among English text phrases and sentences, the semantic relevance analysis and feature extraction of keywords in English abstract are realized, the association rules differentiation for English text summarization is obtained based on information theory, related semantic roles information in English Teaching Texts is mined. Text similarity feature is taken as the maximum difference component of two semantic association rule vectors, and combining semantic similarity information, the accurate extraction of English text Abstract is realized. The simulation results show that the method can extract the text summarization accurately, it has better convergence and precision performance in the extraction process.展开更多
Stylistics is an interdisciplinary study which applies modem linguistic theories and approaches to research style of language use. Taking stylistics as a base, this paper gives a detailed analysis of stylistic devices...Stylistics is an interdisciplinary study which applies modem linguistic theories and approaches to research style of language use. Taking stylistics as a base, this paper gives a detailed analysis of stylistic devices used in Hillary's Inspiring Speech in Yale University from the perspective of linguistic description, textual analysis and situational factors, and then explores the general stylistic features of public speech, thus providing some enlightenment for public speakers in the future.展开更多
Liao songs are the cultural and artistic products brewed by the people of Zhuang ethnic minority for thousand years. In this paper, the style and characteristics of singing Zhuang ethnic minority's Liao songs with m...Liao songs are the cultural and artistic products brewed by the people of Zhuang ethnic minority for thousand years. In this paper, the style and characteristics of singing Zhuang ethnic minority's Liao songs with male's two-part voice in Guangxi are mainly introduced through an analysis of vocal music and the study on the performance forms, singing language characteristics, and vocal music and resonance is mainly included, and also the important significance of singing Zhuang ethnic minority's Liao songs with male's two-part voice is discussed. Also, it is compared with the modern Chinese folk singing styles.展开更多
Corpus-based linguistic approach is one of the most used text studies. Nowadays, stylistic analysis has been adopted to shed new light on tourism English. The topic is to apply the language theory--the stylistic analy...Corpus-based linguistic approach is one of the most used text studies. Nowadays, stylistic analysis has been adopted to shed new light on tourism English. The topic is to apply the language theory--the stylistic analysis to the tourist text analysis, to discover the style essence of Tourist English Hypertext. The stylistic features include graphological analysis of Tourist English Hypertext, lexical features in hypertext, syntactical analysis of Tourist English Hypertext. It summarizes online Tourist English Hypertext information with some typical samples, with the methods of examples and analysis. It aims to offer an in-depth insight into the stylistic features of online tourism English texts, helping people to grasp the key points of the online information when they are browsing the information on the Internet. So this paper both enlarges the application of the stylistic analysis and presents summary for online tourist information.展开更多
This paper analyzes the language employed in the representations of women in Doukhobor Russian ritual texts called "ncanMbf' (psalms) from the viewpoint of linguistic text analysis performed in the Russian traditi...This paper analyzes the language employed in the representations of women in Doukhobor Russian ritual texts called "ncanMbf' (psalms) from the viewpoint of linguistic text analysis performed in the Russian tradition of folklore stylistics. While addressing the representations of Biblical female characters, such as the Holy Virgin Mary Magdalene, the Heavenly Bride, and the Whore of Babylon, along with the portrayals of Doukhobor and other women, the paper identifies stylistic features in their textual descriptions. The study establishes connections between Doukhobor texts and Russian folklore and liturgical tradition. The paper strives to identify the place of Doukhobor psalms as an integral part of the Russian literary and folklore heritage展开更多
For the complex questions of Chinese question answering system, we propose an answer extraction method with discourse structure feature combination. This method uses the relevance of questions and answers to learn to ...For the complex questions of Chinese question answering system, we propose an answer extraction method with discourse structure feature combination. This method uses the relevance of questions and answers to learn to rank the answers. Firstly, the method analyses questions to generate the query string, and then submits the query string to search engines to retrieve relevant documents. Sec- ondly, the method makes retrieved documents seg- mentation and identifies the most relevant candidate answers, in addition, it uses the rhetorical relations of rhetorical structure theory to analyze the relationship to determine the inherent relationship between para- graphs or sentences and generate the answer candi- date paragraphs or sentences. Thirdly, we construct the answer ranking model,, and extract five feature groups and adopt Ranking Support Vector Machine (SVM) algorithm to train ranking model. Finally, it re-ranks the answers with the training model and fred the optimal answers. Experiments show that the proposed method combined with discourse structure features can effectively improve the answer extrac- ting accuracy and the quality of non-factoid an- swers. The Mean Reciprocal Rank (MRR) of the an- swer extraction reaches 69.53%.展开更多
The paper considers the problem of semantic processing of web documents by designing an approach, which combines extracted semantic document model and domain- related knowledge base. The knowledge base is populated wi...The paper considers the problem of semantic processing of web documents by designing an approach, which combines extracted semantic document model and domain- related knowledge base. The knowledge base is populated with learnt classification rules categorizing documents into topics. Classification provides for the reduction of the dimensio0ality of the document feature space. The semantic model of retrieved web documents is semantically labeled by querying domain ontology and processed with content-based classification method. The model obtained is mapped to the existing knowledge base by implementing inference algorithm. It enables models of the same semantic type to be recognized and integrated into the knowledge base. The approach provides for the domain knowledge integration and assists the extraction and modeling web documents semantics. Implementation results of the proposed approach are presented.展开更多
Literature about Hemingway's writing style has been fully explored in the past 80 years, while his style has not yet been well probed into from the perspective of corpus-based stylistics. This paper focuses on the an...Literature about Hemingway's writing style has been fully explored in the past 80 years, while his style has not yet been well probed into from the perspective of corpus-based stylistics. This paper focuses on the analysis of his writing style as a news reporter and as a literary writer in order to investigate whether there exists some links between the two or whether his training as a reporter has any effect on his writing as fiction writer. By employing a corpus-based stylistic approach, the paper analyzes in great details Hemingway's early news writing in Kansas City Star ( 1917-1918) and his later fiction writing in the novella The Old Man and the Sea (1952). The stylistic features at lexical level are explored quantitatively in both writings via statistical data. The paper concludes that Hemingway's writing style has his own heritage. The style in his early news articles does find its way into his later fiction writing. There do exist certain similarities in the choices of lexical items and phrasal expressions. It also indicates that a further exploration on their syntactic, rhetorical, and discourse levels is needed ira fuller picture of Hemingway's writing style is to be unveiled.展开更多
文摘A feature extraction, which means extracting the representative words from a text, is an important issue in text mining field. This paper presented a new Apriori and N-gram based Chinese text feature extraction method, and analyzed its correctness and performance. Our method solves the question that the exist extraction methods cannot find the frequent words with arbitrary length in Chinese texts. The experimental results show this method is feasible.
文摘In order to improve the ability of sharing and scheduling capability of English teaching resources, an improved algorithm for English text summarization is proposed based on Association semantic rules. The relative features are mined among English text phrases and sentences, the semantic relevance analysis and feature extraction of keywords in English abstract are realized, the association rules differentiation for English text summarization is obtained based on information theory, related semantic roles information in English Teaching Texts is mined. Text similarity feature is taken as the maximum difference component of two semantic association rule vectors, and combining semantic similarity information, the accurate extraction of English text Abstract is realized. The simulation results show that the method can extract the text summarization accurately, it has better convergence and precision performance in the extraction process.
文摘Stylistics is an interdisciplinary study which applies modem linguistic theories and approaches to research style of language use. Taking stylistics as a base, this paper gives a detailed analysis of stylistic devices used in Hillary's Inspiring Speech in Yale University from the perspective of linguistic description, textual analysis and situational factors, and then explores the general stylistic features of public speech, thus providing some enlightenment for public speakers in the future.
文摘Liao songs are the cultural and artistic products brewed by the people of Zhuang ethnic minority for thousand years. In this paper, the style and characteristics of singing Zhuang ethnic minority's Liao songs with male's two-part voice in Guangxi are mainly introduced through an analysis of vocal music and the study on the performance forms, singing language characteristics, and vocal music and resonance is mainly included, and also the important significance of singing Zhuang ethnic minority's Liao songs with male's two-part voice is discussed. Also, it is compared with the modern Chinese folk singing styles.
文摘Corpus-based linguistic approach is one of the most used text studies. Nowadays, stylistic analysis has been adopted to shed new light on tourism English. The topic is to apply the language theory--the stylistic analysis to the tourist text analysis, to discover the style essence of Tourist English Hypertext. The stylistic features include graphological analysis of Tourist English Hypertext, lexical features in hypertext, syntactical analysis of Tourist English Hypertext. It summarizes online Tourist English Hypertext information with some typical samples, with the methods of examples and analysis. It aims to offer an in-depth insight into the stylistic features of online tourism English texts, helping people to grasp the key points of the online information when they are browsing the information on the Internet. So this paper both enlarges the application of the stylistic analysis and presents summary for online tourist information.
文摘This paper analyzes the language employed in the representations of women in Doukhobor Russian ritual texts called "ncanMbf' (psalms) from the viewpoint of linguistic text analysis performed in the Russian tradition of folklore stylistics. While addressing the representations of Biblical female characters, such as the Holy Virgin Mary Magdalene, the Heavenly Bride, and the Whore of Babylon, along with the portrayals of Doukhobor and other women, the paper identifies stylistic features in their textual descriptions. The study establishes connections between Doukhobor texts and Russian folklore and liturgical tradition. The paper strives to identify the place of Doukhobor psalms as an integral part of the Russian literary and folklore heritage
基金supported by the National Nature Science Foundation of China under Grants No.60863011,No.61175068,No.61100205,No.60873001the Fundamental Research Funds for the Central Universities under Grant No.2009RC0212+1 种基金the National Innovation Fund for Technology based Firms under Grant No.11C26215305905the Open Fund of Software Engineering Key Laboratory of Yunnan Province under Grant No.2011SE14
文摘For the complex questions of Chinese question answering system, we propose an answer extraction method with discourse structure feature combination. This method uses the relevance of questions and answers to learn to rank the answers. Firstly, the method analyses questions to generate the query string, and then submits the query string to search engines to retrieve relevant documents. Sec- ondly, the method makes retrieved documents seg- mentation and identifies the most relevant candidate answers, in addition, it uses the rhetorical relations of rhetorical structure theory to analyze the relationship to determine the inherent relationship between para- graphs or sentences and generate the answer candi- date paragraphs or sentences. Thirdly, we construct the answer ranking model,, and extract five feature groups and adopt Ranking Support Vector Machine (SVM) algorithm to train ranking model. Finally, it re-ranks the answers with the training model and fred the optimal answers. Experiments show that the proposed method combined with discourse structure features can effectively improve the answer extrac- ting accuracy and the quality of non-factoid an- swers. The Mean Reciprocal Rank (MRR) of the an- swer extraction reaches 69.53%.
文摘The paper considers the problem of semantic processing of web documents by designing an approach, which combines extracted semantic document model and domain- related knowledge base. The knowledge base is populated with learnt classification rules categorizing documents into topics. Classification provides for the reduction of the dimensio0ality of the document feature space. The semantic model of retrieved web documents is semantically labeled by querying domain ontology and processed with content-based classification method. The model obtained is mapped to the existing knowledge base by implementing inference algorithm. It enables models of the same semantic type to be recognized and integrated into the knowledge base. The approach provides for the domain knowledge integration and assists the extraction and modeling web documents semantics. Implementation results of the proposed approach are presented.
文摘Literature about Hemingway's writing style has been fully explored in the past 80 years, while his style has not yet been well probed into from the perspective of corpus-based stylistics. This paper focuses on the analysis of his writing style as a news reporter and as a literary writer in order to investigate whether there exists some links between the two or whether his training as a reporter has any effect on his writing as fiction writer. By employing a corpus-based stylistic approach, the paper analyzes in great details Hemingway's early news writing in Kansas City Star ( 1917-1918) and his later fiction writing in the novella The Old Man and the Sea (1952). The stylistic features at lexical level are explored quantitatively in both writings via statistical data. The paper concludes that Hemingway's writing style has his own heritage. The style in his early news articles does find its way into his later fiction writing. There do exist certain similarities in the choices of lexical items and phrasal expressions. It also indicates that a further exploration on their syntactic, rhetorical, and discourse levels is needed ira fuller picture of Hemingway's writing style is to be unveiled.