摘要
双重否定结构是一种“通过两次否定表示肯定意义”的特殊结构,直接影响自然语言处理中的语义判断与情感分类。该文以“??P==>P”为标准,对现代汉语中所有的“否定词+否定词”结构进行了遍历研究,将双重否定结构按照格式分为了3大类,25小类,常用双重否定结构或构式132个。结合动词的叙实性、否定焦点、语义否定与语用否定等理论,该文归纳了双重否定结构的三大成立条件,并据此设计实现了基于规则的双重否定结构自动识别程序。程序实验的精确率为98.80%,召回率为98.90%,F1值为98.85%。同时,程序还从96281句语料中获得了8640句精确率约为99.20%的含有双重否定结构的句子,为基于统计的深度学习模型提供了语料支持。
The double negation structure is a special structure of“expressing positive meaning through two negations”,in which the two negations have an important impact on the semantic analysis and emotional classification in natural language processing.Taking“P==>P”as the prototype,this paper examines the“negation word+negation word”structures in modern Chinese,and divides them into 3 categories,25 sub-categories and 132 constructions in total.Then this paper proposes three conditions for the establishment of the double negation structure,and a rule-based method to identify the double negation.The accuracy rate of recognition of the double negation structure is 98.80%,the recall rate is 98.90%,and the F 1 value is 98.95%.The proposed method could identify 8640 sentences with 99.20%true double negation structure from 96281 sentences.
作者
王昱
袁毓林
WANG Yu;YUAN Yulin(Department of Chinese and Bilingual Studies,The Hong Kong Polytechnic University,Hongkong 999077,China;Department of Chinese Language and Literature,Faculty of Arts and Humanities,University of Macao,Macao 519000,China;Department of Chinese Language and Literature/Center for Chinese Linguistics,Peking University,Beijing 100871,China)
出处
《中文信息学报》
CSCD
北大核心
2024年第2期36-45,共10页
Journal of Chinese Information Processing
基金
国家科技创新2030“新一代人工智能”重大项目(2020AAA0106701)
国家社会科学基金(18ZDA295)。
关键词
双重否定
自动识别程序
语义分析
double negation
automatic recognition program
semantic analysis