期刊文献+

作者识别研究综述 被引量:2

A Review on Authorship Identification Research
下载PDF
导出
摘要 作者识别是根据已知文本推断未知文本作者的交叉学科.其传统研究通常基于文学或语言学的经验知识,而现代研究则主要依靠数学方法量化作者的写作风格.近些年,随着认知科学、系统科学和信息技术的发展,作者识别受到越来越多研究者的关注.本文主要站在计算语言学的角度综述作者识别领域现代研究中的方法和思路.首先,简要介绍了作者识别的发展历程.然后,详述了文体风格特征、作者识别方法以及该领域中多层面的研究.接着介绍了与作者识别相关的一些评测、数据集及评价指标.最后,指出该领域存在的一些问题,结合这些问题分析并展望了作者识别的发展趋势. Authorship identification is an interdisciplinary subject of inferring the author of unknown texts based on the known texts.The traditional research of authorship identification is generally based on the empirical know-ledge of literature or linguistics,while the modern research mostly relies on mathematical methods to quantify the author's writing style.In recent years,with the development of cognitive science,system science and information technology,more and more researchers pay attention to authorship identification.This paper mainly reviews the methods and ideas in modern research in the field of authorship identification from the perspective of computation-al linguistics.First,the development history of authorship identification is introduced briefly.Then,the stylometry,authorship identification methods and multi-faceted research in this realm are expounded.Next,some evaluations,data sets and evaluation metrics related to authorship identification are explicated.Finally,some problems in this domain are pointed out,while the development trend of authorship identification is analyzed and forecasted com-bined with these problems.
作者 张洋 江铭虎 ZHANG Yang;JIANG Ming-Hu(Lab of Computational Linguistics,School of Humanities,Tsinghua University,Beijing 100084)
出处 《自动化学报》 EI CAS CSCD 北大核心 2021年第11期2501-2520,共20页 Acta Automatica Sinica
基金 国家自然科学基金(62036001)资助。
关键词 作者识别 文体学 写作风格 评价指标 Authorship identification stylometry writing style evaluation metrics
  • 相关文献

参考文献1

二级参考文献14

  • 1陈大康.从数理语言学看后四十回的作者——与陈炳藻先生商榷[J].红楼梦学刊,1987(1):293-318. 被引量:54
  • 2李贤平.《红楼梦》成书新说[J].复旦学报(社会科学版),1987,29(5):3-16. 被引量:66
  • 3http://www.keenage.com.2005.2.
  • 4Ward Elliott,Robert Valenza.Was the Earl of Oxford the true Shakespeare?[J]A Computer-Aided Analysis.Notes and Queries,38,501-506,April 1991.
  • 5Efstathios.Stamatatos,Nikos.Fakotakis,George.Kokkinakis.Computer-Based Authorship Attribution Without Lexical Measures[J].Computers and the Humanities,Volume 35,Issue 2:193-214,May 2001.
  • 6O.de Vel,A.Anderson,M.Corney,G.Mohay.Mining E-mail Content for Author Identification Forensics.SIGMOD:Special Section on Data Mining for Intrusion Detection and Threat Analysis[C],55-64,2001.
  • 7Jiexun Li,Rong Zheng,Hsinchun Chen.From Fingerprint to Writeprint[J].Communications of the ACM,47(3):70-76,March 2004.
  • 8Carole E.Chaski Empirical evaluations of language-based author identification techniques[J].Forensic Linguistics 8(1):1-65,2001.
  • 9Harald Baayen,Hans van Halteren,Anneke Neijt,Fiona Tweedie.An experiment in authorship attribution[A].In:Proceedings of the 6th International Conference on the Statistical Analysis of Textual Data(JADT 2002)[C]:29-37.
  • 10Niamh McCombe.Methods Of Author Identification[D].B.A.(Mod.) CSLL Final Year Project,2002.

共引文献24

同被引文献55

二级引证文献2

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部