A Web page typically contains many information blocks. Apart from the main content blocks, it usually has such blocks as navigation panels, copyright and privacy notices, and advertisements. We call these blocks the n...A Web page typically contains many information blocks. Apart from the main content blocks, it usually has such blocks as navigation panels, copyright and privacy notices, and advertisements. We call these blocks the noisy blocks. The noises in Web pages can seriously harm Web data mining. To the question of climinating these noises, we intro duce a new tree structure, called Style Tree, and study an algorithm how to construct a site style tree. The Style Tree Model is employed to detect and climinate noises in any Web pages of the site. An information based measure to determine which element node is noisy is also constructed. In addition, the applications of this method are discussed in detail. Experimental results show that our noises climination technique is able to improve the mining results significantly. Key words noises climination - DOM tree - style tree - Web mining CLC number TP 339 Foundation item: Supported by the National Natural Science Foundation of China (60003013)Biography: ZHAN Cheng-li (1979-), male, Master candidate, research direction: Intelligent Information System.展开更多
Since Henry Holec first put forward the term‘Autonomy'in 1980's, autonomous learning has been drawing the universal attention of scholars both at home and abroad. Promoting learners' ability of self-regul...Since Henry Holec first put forward the term‘Autonomy'in 1980's, autonomous learning has been drawing the universal attention of scholars both at home and abroad. Promoting learners' ability of self-regulated learning has been taken as one of the important goals of modern education. College English autonomous learning based on network environment does not mean free study without any restraints or monitoring, but rather involves the self-monitoring and external monitoring. Meanwhile, different learners may have different cognitive styles in their learning processes, which may have an influence on the improvement of the learners' efficiency in the autonomous language learning. Proper monitoring models coordinating with the students' different field cognitive styles.展开更多
文摘A Web page typically contains many information blocks. Apart from the main content blocks, it usually has such blocks as navigation panels, copyright and privacy notices, and advertisements. We call these blocks the noisy blocks. The noises in Web pages can seriously harm Web data mining. To the question of climinating these noises, we intro duce a new tree structure, called Style Tree, and study an algorithm how to construct a site style tree. The Style Tree Model is employed to detect and climinate noises in any Web pages of the site. An information based measure to determine which element node is noisy is also constructed. In addition, the applications of this method are discussed in detail. Experimental results show that our noises climination technique is able to improve the mining results significantly. Key words noises climination - DOM tree - style tree - Web mining CLC number TP 339 Foundation item: Supported by the National Natural Science Foundation of China (60003013)Biography: ZHAN Cheng-li (1979-), male, Master candidate, research direction: Intelligent Information System.
文摘Since Henry Holec first put forward the term‘Autonomy'in 1980's, autonomous learning has been drawing the universal attention of scholars both at home and abroad. Promoting learners' ability of self-regulated learning has been taken as one of the important goals of modern education. College English autonomous learning based on network environment does not mean free study without any restraints or monitoring, but rather involves the self-monitoring and external monitoring. Meanwhile, different learners may have different cognitive styles in their learning processes, which may have an influence on the improvement of the learners' efficiency in the autonomous language learning. Proper monitoring models coordinating with the students' different field cognitive styles.
基金国家自然科学基金(the National Natural Science Foundation of China under Grant No.60475022) 山西省自然科学基金(the Natural Science Foundation of Shanxi Province of China under Grant No.20041041)山西省回国留学人员基金(No.2002004)。