This paper had developed and tested optimized content extraction algorithm using NLP method, TFIDF method for word of weight, VSM for information search, cosine method for similar quality calculation from learning doc...This paper had developed and tested optimized content extraction algorithm using NLP method, TFIDF method for word of weight, VSM for information search, cosine method for similar quality calculation from learning document at the distance learning system database. This test covered following things: 1) to parse word structure at the distance learning system database documents and Cyrillic Mongolian language documents at the section, to form new documents by algorithm for identifying word stem;2) to test optimized content extraction from text material based on e-test results (key word, correct answer, base form with affix and new form formed by word stem without affix) at distance learning system, also to search key word by automatically selecting using word extraction algorithm;3) to test Boolean and probabilistic retrieval method through extended vector space retrieval method. This chapter covers: to process document content extraction retrieval algorithm, to propose recommendations query through word stem, not depending on word position based on Cyrillic Mongolian language documents distinction.展开更多
With the help of Big Data and Citespace software, this research makes a statistical analysis of the journals anddissertations on College English teaching and learning materials collected by CNKI from 2011 to 2020. Thi...With the help of Big Data and Citespace software, this research makes a statistical analysis of the journals anddissertations on College English teaching and learning materials collected by CNKI from 2011 to 2020. This paper,based on the knowledge map drawn by the visualized analysis of literatures volume, authors, research institutions,and keywords clustering, analyzes the current research status and hotspots in the compilation of China’s CollegeEnglish textbooks.展开更多
文摘This paper had developed and tested optimized content extraction algorithm using NLP method, TFIDF method for word of weight, VSM for information search, cosine method for similar quality calculation from learning document at the distance learning system database. This test covered following things: 1) to parse word structure at the distance learning system database documents and Cyrillic Mongolian language documents at the section, to form new documents by algorithm for identifying word stem;2) to test optimized content extraction from text material based on e-test results (key word, correct answer, base form with affix and new form formed by word stem without affix) at distance learning system, also to search key word by automatically selecting using word extraction algorithm;3) to test Boolean and probabilistic retrieval method through extended vector space retrieval method. This chapter covers: to process document content extraction retrieval algorithm, to propose recommendations query through word stem, not depending on word position based on Cyrillic Mongolian language documents distinction.
基金This research is based on the“Construction of Chinese Academic Translation Team Project”(ZP1823105)of the Characteristic Humanities and Social Science Discipline Construction in 2018,and“2019-2020 Postgraduate Teaching Book Project Translation and International Communication”,East China University of Science and Technology.
文摘With the help of Big Data and Citespace software, this research makes a statistical analysis of the journals anddissertations on College English teaching and learning materials collected by CNKI from 2011 to 2020. This paper,based on the knowledge map drawn by the visualized analysis of literatures volume, authors, research institutions,and keywords clustering, analyzes the current research status and hotspots in the compilation of China’s CollegeEnglish textbooks.