8Gupta S, Kaiser G, Neistadt D, et al. DOM- Based Content Extraction of HTML Documents[C]//Proceeding of the 12th International Conference on World Wide Web. New York: ACM Press,2003 : 207 - 214.
9CAI Deng, YU Shi - peng, Wen Ji - rong, et al. Extracting Content Structure for Web Pages based on Visual Representation[C]//Proceeding of the 5th Asia Pacific Web Conference. Berlin: Springer - Verlag, 2003: 406 - 417.
10Zheng Shuyi, Song Ruihua, Wen Ji - Rong. Template - Independent News Extraction Based on Visua/Consistency[ C]//The 22nd Conference on Artificial Intelligence. Vancouver: AAAI Press, 2007:1507 - 1511.