This paper had developed and tested optimized content extraction algorithm using NLP method, TFIDF method for word of weight, VSM for information search, cosine method for similar quality calculation from learning doc...This paper had developed and tested optimized content extraction algorithm using NLP method, TFIDF method for word of weight, VSM for information search, cosine method for similar quality calculation from learning document at the distance learning system database. This test covered following things: 1) to parse word structure at the distance learning system database documents and Cyrillic Mongolian language documents at the section, to form new documents by algorithm for identifying word stem;2) to test optimized content extraction from text material based on e-test results (key word, correct answer, base form with affix and new form formed by word stem without affix) at distance learning system, also to search key word by automatically selecting using word extraction algorithm;3) to test Boolean and probabilistic retrieval method through extended vector space retrieval method. This chapter covers: to process document content extraction retrieval algorithm, to propose recommendations query through word stem, not depending on word position based on Cyrillic Mongolian language documents distinction.展开更多
以蒙古文编码国家标准的研制及其系统实现方面的工作为基础,针对蒙古文复杂文本布局引擎(CTL Engine)及其OpenType字库的系统结构,提出蒙古文复杂文本布局引擎的标准符合性测试(Conformance Test for Standards)方案,定义蒙古文复杂文...以蒙古文编码国家标准的研制及其系统实现方面的工作为基础,针对蒙古文复杂文本布局引擎(CTL Engine)及其OpenType字库的系统结构,提出蒙古文复杂文本布局引擎的标准符合性测试(Conformance Test for Standards)方案,定义蒙古文复杂文本布局引擎的测试点及其测试实例,并以关键软件系统为依托测试和分析Uniscribe和HarfBuzz等支持蒙古文的复杂文本布局引擎。展开更多
文摘This paper had developed and tested optimized content extraction algorithm using NLP method, TFIDF method for word of weight, VSM for information search, cosine method for similar quality calculation from learning document at the distance learning system database. This test covered following things: 1) to parse word structure at the distance learning system database documents and Cyrillic Mongolian language documents at the section, to form new documents by algorithm for identifying word stem;2) to test optimized content extraction from text material based on e-test results (key word, correct answer, base form with affix and new form formed by word stem without affix) at distance learning system, also to search key word by automatically selecting using word extraction algorithm;3) to test Boolean and probabilistic retrieval method through extended vector space retrieval method. This chapter covers: to process document content extraction retrieval algorithm, to propose recommendations query through word stem, not depending on word position based on Cyrillic Mongolian language documents distinction.
文摘以蒙古文编码国家标准的研制及其系统实现方面的工作为基础,针对蒙古文复杂文本布局引擎(CTL Engine)及其OpenType字库的系统结构,提出蒙古文复杂文本布局引擎的标准符合性测试(Conformance Test for Standards)方案,定义蒙古文复杂文本布局引擎的测试点及其测试实例,并以关键软件系统为依托测试和分析Uniscribe和HarfBuzz等支持蒙古文的复杂文本布局引擎。