As a major criterion for textuality and a prominent term in discourse analysis, discourse cohesion is used on the one hand, to identify the linguistic features that cause the sentences to "cohere", and on the other ...As a major criterion for textuality and a prominent term in discourse analysis, discourse cohesion is used on the one hand, to identify the linguistic features that cause the sentences to "cohere", and on the other hand, is to make the sentences in the discourse display some kind of mutual dependence. The paper has intensively analyzed the radio interview between Edward Heath and an interviewer from the perspective of discourse cohesion. After an in-depth analysis, the paper concludes that the interview is quite structurally cohesive by adopting several grammatical cohesive devices or ties, such as the verbal form, the time relator, the conjunction, the reference, the substitution, and the ellipsis, especially the reference and conjunction展开更多
Text alignment is crucial to the accuracy of MT (Machine Translation) systems, some NLP (Natural Language Processing) tools or any other text processing tasks requiring bilingual data. This research proposes a lan...Text alignment is crucial to the accuracy of MT (Machine Translation) systems, some NLP (Natural Language Processing) tools or any other text processing tasks requiring bilingual data. This research proposes a language independent sentence alignment approach based on Polish (not position-sensitive language) to English experiments. This alignment approach was developed on the TED (Translanguage English Database) talks corpus, but can be used for any text domain or language pair. The proposed approach implements various heuristics for sentence recognition. Some of them value synonyms and semantic text structure analysis as a part of additional information. Minimization of data loss was ensured. The solution is compared to other sentence alignment implementations. Also an improvement in MT system score with text processed with the described tool is shown.展开更多
文摘As a major criterion for textuality and a prominent term in discourse analysis, discourse cohesion is used on the one hand, to identify the linguistic features that cause the sentences to "cohere", and on the other hand, is to make the sentences in the discourse display some kind of mutual dependence. The paper has intensively analyzed the radio interview between Edward Heath and an interviewer from the perspective of discourse cohesion. After an in-depth analysis, the paper concludes that the interview is quite structurally cohesive by adopting several grammatical cohesive devices or ties, such as the verbal form, the time relator, the conjunction, the reference, the substitution, and the ellipsis, especially the reference and conjunction
文摘Text alignment is crucial to the accuracy of MT (Machine Translation) systems, some NLP (Natural Language Processing) tools or any other text processing tasks requiring bilingual data. This research proposes a language independent sentence alignment approach based on Polish (not position-sensitive language) to English experiments. This alignment approach was developed on the TED (Translanguage English Database) talks corpus, but can be used for any text domain or language pair. The proposed approach implements various heuristics for sentence recognition. Some of them value synonyms and semantic text structure analysis as a part of additional information. Minimization of data loss was ensured. The solution is compared to other sentence alignment implementations. Also an improvement in MT system score with text processed with the described tool is shown.