摘要
幽默在人类交流中扮演着重要角色,并大量存在于情景喜剧中。笑点(punchline)是情景喜剧实现幽默效果的形式之一,在情景喜剧笑点识别任务中,每条句子的标签代表该句是否为笑点,但是以往的笑点识别工作通常只通过建模上下文语义关系识别笑点,对标签的利用并不充分。为了充分利用标签序列中的信息,该文提出了一种结合条件随机场的单词级-句子级多任务学习方法,该方法在两方面进行了改进,首先将标签序列中相邻两个标签之间的转移关系看作幽默理论中不一致性的一种体现,并使用条件随机场学习这种转移关系。其次,由于通过相邻标签之间的转移关系以及上下文语义关系均能够学习到铺垫和笑点之间的不一致性,我们引入了多任务学习方法,让模型同时学习每条句子的句义、组成每条句子的所有字符的词义、单词级别的标签转移关系,以及句子级别的标签转移关系,使模型能够结合两种关系信息提高笑点识别的性能。该文在CCL2020“小牛杯”幽默计算——情景喜剧笑点识别评测任务的英文数据集上进行实验,结果表明,该文提出的方法比同期最好的方法F1值上提高了3.2%,在情景喜剧幽默笑点识别任务上取得了最好的效果,并通过消融实验证明了上述两方面改进的有效性。
Humor plays an important role in human communication and is abundant in sitcoms.Punchline is one of a form to achieve humorous effects in sitcoms.The existing punchlines recognition methods only recognize the punchline by modeling the contextual semantic relationship.In contrast,this paper proposes a new method based on multi-task learning model.First,we regard the transfer relationship between two tags as a manifestation of inconsistency in humor theory,and we use the conditional random field to learn this transfer relationship.Secondly,learning the transfer relationship between adjacent tags and the contextual semantic relationship can both capture the inconsistency between the setup and punchline,and we introduce the multi-task learning method to learn the meaning of each sentence,the meaning of all the characters that make up each sentence,the label transfer relationship at the word level and the label transfer relationship at the sentence level.Experiments on the English data set of CCL2020“Mavericks Cup”humorous calculation-sitcom punchlines recognition and evaluation task.show that the proposed method is 3.2% higher than the current best method,achieving the best effect on the punchlines recognition task.
作者
张童越
张绍武
林鸿飞
徐博
杨亮
ZHANG Tongyue;ZHANG Shaowu;LIN Hongfei;XU Bo;YANG Liang(Dalian University of Technology,Dalian,Liaoning116024,China)
出处
《中文信息学报》
CSCD
北大核心
2023年第11期142-150,共9页
Journal of Chinese Information Processing
基金
国家自然科学基金(62076046,62076051)。
关键词
情感分析
幽默计算
多任务学习
条件随机场
sentiment analysis
humorous calculation
multi-task learning
conditional random field