6T Joachims.A Probabilistic Analysis of the Rocchio Algorithm with TFIDF for Text Categorization[C]//Proceedings of the 14th International Conference on Machine Iearning,1997:143-151.
7Arvind Arasu,Jasmine Novak.PageRank Computation and the Structure of the Web:Experiments and Algorithms[C]//Proceedings of the 11th International Conference on World Wide Web,Beijing,2002:221-241.
8R Lempel,S Moran.SAISA:The Stochastic Approach for ISnk-Structure Analysis[J].ACM Transactions.Information Systems (TOIS),2001,19(2):131-160.
7EHRIG M, MAEDCHE A. Ontology-focused crawling of Web documents[A]. Proceedings of the 2003 ACM symposium on Applied computing[C], March 2003.
8GUO Q, GUO H, ZHANG ZQ, et al. Schema Driven Topic Specific Web Crawling[A]. DASFAA[C], 2005.
9GRAUPMANN J, BIWER M, ZIMMER C, et al. COMPASS: A Concept-based Web Search Engine for HTML, XML, and Deep Web Data[A]. Proceedings of the 30th VLDB Conference[C],2004.
10QIN JL, ZHOU YL, CHAU M. Building domain-specific web collections for scientific digital libraries: a meta-search enhanced focused crawling method[A]. Proceedings of the 4th ACM/IEEE-CS joint conference on Digital libraries[C], June 2004.