8Manku G S, Jain A. Detecting Near Duplicates for Web Crawling [ J ]. www2007/Track: Data Mining, 2007 : 141 - 149.
9Ye Shaozhi,Wen Ji Rong. A systematic study of pa- rameter correlations in large scale duplicate docu- ment detection[ C]//Proceedings of the 10th Pacific- Asia Conference on Knowledge Discovery and Data Mining. Springer - Verlag Berlin Heidelberg: PAK- DD,2006 : 275 - 284.
10Manku G S, Jain A, Sarma A D. Detecting near - du- plicates for web crawling [ J ]. WWW 2007/Track: Data Mining: Similiarlty Search ,2007,15 ( 8 ): 141 - 149.