9Cognitive Science Laboratory.WordNet-a Lexical Database for English[OL].[2006-10-04].http://ccl.pku.edu.cn/doubtfire/semantics/WordNet/C-wordnet/wordnet-c-index.html.
10Gupta S, Kaiser G, Neistadt D, et al. DOM- Based Content Extraction of HTML Documents[C]//Proceeding of the 12th International Conference on World Wide Web. New York: ACM Press,2003 : 207 - 214.