期刊文献+

微博文本和传统文本体裁特征对比 被引量:1

Comparison of Genre Features Between Micro-blog Text and Traditional Text
下载PDF
导出
摘要 在总结常用特征集合的基础上,根据微博文本的特点以及特征选取原则,选取了适合微博文本体裁分析的特征集合,这些特征能典型的反应微博文本和其他文本形式的区别.还分别对不同的文本体裁进行特征值的统计,并将统计结果在不同的文本体裁之间进行了深入的对比分析,并从体裁的角度分析出不同文本体裁的特征值差别的原因.并从体裁特征的角度说明微博文本是一种新的体裁文本. After summarizing common feature set, according to micro-blog text's characteristics and feature selection principles, this paper selects some feature sets which are suitable for micro-blog text genre analysis. These features can typically reflect the differences between micro-blog text and other text forms. It performs a statistical analysis on different text genre ,and inputs the statistical result to comparison analysis among various text genres, and finds out reasons for characteristic value differences from the perspective of genre feature. It also proves that micro-blog text is a new text genre.
出处 《南华大学学报(自然科学版)》 2015年第2期87-90,96,共5页 Journal of University of South China:Science and Technology
基金 湖南省社科基金资助项目(14YBA335) 湖南省研究生科研创新基金资助项目(2014SCX16) 湖南省自然科学基金资助项目(13JJ4076) 湖南省教育厅优秀青年基金资助项目(13B101) 南华大学重点学科和创新团队建设基金资助项目 衡阳市科技局科技计划基金资助项目(2013KG66 2013KG67)
关键词 微博文本 传统文本 体裁 特征项 micro-blog text traditional text genre feature
  • 相关文献

参考文献8

二级参考文献68

  • 1黄永光,刘挺,车万翔,胡晓光.面向变异短文本的快速聚类算法[J].中文信息学报,2007,21(2):63-68. 被引量:17
  • 2Chul S, Kong Joo Lee. Multiple Sets of Features for Automatic Genre Classification of Web Documents[J]. Information Processing & Management, 2005, 41(5): 1263-1276.
  • 3Brett K, Geoffrey N, Hinrich S. Automatic Detection of Text Genre[C]//Proc. of the 35th Annual Meeting on Association for Computational Linguistics. Madrid, Spain: [s. n.], 1997.
  • 4Yong Bae Lee, Hyon M. Text Genre Classification with Genre-revealing and Subject-revealing Features[C]//Proc. of the 25th Annual lnt'l ACM SIGIR Conf. on Research and Development in Information Retrieval. Tampere, Finland: [s. n.], 2002.
  • 5Aidan F, Nicholas K. Learning to Classify Documents According to Genre[J]. Journal of the American Society for Information Science and Technology, 2006, 57(11): 1506-1518.
  • 6彭京,杨冬青,唐世渭,付艳,蒋汉奎.一种基于语义内积空间模型的文本聚类算法[J].计算机学报,2007,30(8):1354-1363. 被引量:44
  • 7Barbara H Kwasnik,Kevin Crowston.Genres of Digital Documents[A].In:Proceedings of the 37th Annual Hawaii International Conference on System Sciences (HICSS'04)[C],Big Island,Hawaii,2004.
  • 8Douglas Biber.Using register-diversified corpora for general language study[J].Computational Linguistics,1993,19:219-241.
  • 9Brett Kessler,Geoffrey Nunberg,Hinrich Schutze.Automatic Detection of Text Genre[A].In:Proceedings of 35th Annual Meeting of Association for Computational Linguistics and 8th Conference of European Chapter of Association for Computational Linguistics[C],Madrid,Spain,1997,32-38.
  • 10E.Stamatatos,N.Fakotakis,& G.Kokkinakis,Text genre detection using common word frequencies[A].In:Proceedings of 18 International Conference on Computational Linguistics[C],Luxemburg,2001,808-814.

共引文献67

同被引文献3

引证文献1

二级引证文献2

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部