期刊文献+

Mining the interests of Chinese microbloggers via keyword extraction 被引量:26

Mining the interests of Chinese microbloggers via keyword extraction
原文传递
导出
摘要 Microblogging provides a new platform for com- municating and sharing information among Web users. Users can express opinions and record daily life using microblogs. Microblogs that are posted by users indicate their interests to some extent. We aim to mine user interests via keyword extraction from microblogs. Traditional keyword extraction methods are usually designed for formal documents such as news articles or scientific papers. Messages posted by mi- croblogging users, however, are usually noisy and full of new words, which is a challenge for keyword extraction. In this paper, we combine a translation-based method with a frequency-based method for keyword extraction. In our ex- periments, we extract keywords for microblog users from the largest microblogging website in China, Sina Weibo. The re- suits show that our method can identify users' interests accu- rately and efficiently. Microblogging provides a new platform for com- municating and sharing information among Web users. Users can express opinions and record daily life using microblogs. Microblogs that are posted by users indicate their interests to some extent. We aim to mine user interests via keyword extraction from microblogs. Traditional keyword extraction methods are usually designed for formal documents such as news articles or scientific papers. Messages posted by mi- croblogging users, however, are usually noisy and full of new words, which is a challenge for keyword extraction. In this paper, we combine a translation-based method with a frequency-based method for keyword extraction. In our ex- periments, we extract keywords for microblog users from the largest microblogging website in China, Sina Weibo. The re- suits show that our method can identify users' interests accu- rately and efficiently.
出处 《Frontiers of Computer Science》 SCIE EI CSCD 2012年第1期76-87,共12页 中国计算机科学前沿(英文版)
关键词 MICROBLOGGING Sina Weibo Chinese keywordextraction user interests. microblogging, Sina Weibo, Chinese keywordextraction, user interests.
  • 相关文献

参考文献63

  • 1Kwak H,Lee C,Park H,Moon S. What is Twitter,a social network or a news media[A].2010.591-600.
  • 2Liu Z,Chert X,Zheng Y,Sun M. Automatic keyphrase extraction by bridging vocabulary gap[A].2011.135-144.
  • 3Brown P F,Pietra S A D,Pietra V J D,Mercer R L. The mathematics of statistical machine translation:parameter estimation[J].Computational Linguistics,1993,(02):263-311.
  • 4Koehn P. Statistical Machine Translation[M].Cambridge:Cambridge University Press,2010.
  • 5Berger A L,Lafferty J D. Information retrieval as statistical translation[A].1999.222-229.
  • 6Karimzadehgan M,Zhai C X. Estimation of statistical translation models based on mutual information for ad hoc information retrieval[A].2010.323-330.
  • 7Duygulu P,Barnard K,de Freitas J F G,Forsyth D A. Object recognition as machine translation:learning a lexicon for a fixed image vocabulary[A].2002.97-112.
  • 8Berger A L,Caruana R,Cohn D,Freitag D,Mittal V O. Bridging the lexical chasm:statistical approaches to answer-finding[A].2000.192-199.
  • 9Echihabi A,Marcu D. A noisy-channel approach to question answering[A].2003.16-23.
  • 10Murdock V,Croft W B. Simple translation models for sentence retrieval in faetoid question answering[A].2004.

同被引文献151

引证文献26

二级引证文献135

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部