期刊文献+

三元组深度哈希学习的司法案例相似匹配方法 被引量:3

Triplet deep Hashing learning for judicial case similarity matching method
下载PDF
导出
摘要 在数量庞大的司法案例文书中进行相似案例匹配可以有效地提升司法部门的工作效率。但司法案件文本不仅长,而且文本自身还具有一定程度的结构复杂性,因此司法案例文本匹配与传统自然语言处理任务相比,具有较高的难度。为解决上述问题,本文基于三元组深度哈希学习模型提出了一种司法案例相似匹配方法,首先使用预训练的BERT中文模型分组提取文书的特征;再利用文书三元组相似性关系,训练深度神经网络模型,用于生成文书的哈希码表示;最后,基于文书哈希码的海明距离来判断是否为相似案例。实验结果表明,本文采用哈希学习方法极大地降低了文书特征表示的存储开销,提高了相似案例匹配的速度。 Matching similar cases in a large number of judicial case documents can effectively improve the efficiency of the judicial department.However,the text of judicial cases is not only lengthy,but also exhibits a certain degree of structural complexity.Therefore,the text matching of judicial cases is more difficult compared with the traditional natural language processing tasks.To solve the above problems and challenges,this paper proposes a judicial case similar matching method based on the triplet deep Hashing learning model.First,a pre-trained BERT model is used to extract the features of the documents in groups.The triplet similarity relationship of the documents is then employed to train the deep neural network model to generate the Hashing code representation of the documents.Finally,the Hamming distance based on the Hashing code of the documents is used to determine whether they are similar cases.Experimental results show that the Hashing learning method greatly reduces the storage cost of the documents’feature representations and improves the speed of similar case matching.
作者 李佳敏 刘兴波 聂秀山 郭杰 尹义龙 LI Jiamin;LIU Xingbo;NIE Xiushan;GUO Jie;YIN Yilong(School of Software,Shandong University,Ji’nan 250101,China;School of Computer Science and Technology,Shandong Jianzhu University,Ji’nan 250101,China)
出处 《智能系统学报》 CSCD 北大核心 2020年第6期1147-1153,共7页 CAAI Transactions on Intelligent Systems
基金 国家重点研发计划项目(2018YFC0830100,2018YFC0830102).
关键词 司法案例 案例匹配 相似检索 哈希学习 深度学习 神经网络 BERT模型 三元组 judicial cases case matching similarity retrieval Hashing learning deep learning neural network BERT model triples
  • 相关文献

参考文献3

二级参考文献29

  • 1朱前鸿.汉语背景下法律基本语词的模糊性研究[J].国家检察官学院学报,2005,13(5):84-89. 被引量:1
  • 2张玉芳,彭时名,吕佳.基于文本分类TFIDF方法的改进与应用[J].计算机工程,2006,32(19):76-78. 被引量:121
  • 3贾君枝,邰杨芳.基于法律框架网络本体的信息检索研究[J].情报学报,2007,26(4):561-566. 被引量:6
  • 4Coleman J, Shapiro S. The oxford handbook of jurisprudence philosophic of law [M]. London: Oxford University Press, 2002
  • 5Endicott. Linguistic indeterminacy [ J]. Oxford Journal of Legal Studies, 1996 (4): 667-697
  • 6Charles J, Fillmore, Frame semantics [M]. [S.l. ] : Hanshin Publishing Co, 1982:111-137
  • 7Shvaiko P, Euzenat J. A survey of schema-based matching approaches [ M] . Trento: University of Trento, 2004:4-6
  • 8Borst W N. Construction of engineering ontologies for knowledge sharing and reuse [ D]. Enschede: University of Twente, 1997
  • 9Zhong .liwei, Zhu Haiping, Li Jianming , et al. Conceptual graph matching for semantic search [ C ] //Proceedings of the 10th International Conference on Conceptual Structures (ICCS 2002), LNCS 2393, 2002:15-19
  • 10Rocha C, Schwabe D, Arago M P. A hybrid approach for searching in the semantic Web [ C ] //The Thirteenth International World Wide Web Conference, 2004-05 : 17-22

共引文献48

同被引文献22

引证文献3

二级引证文献12

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部