法律案件要素识别混合专家大模型

Mixture of Expert Large Language Model for Legal Case Element Recognition

下载PDF

导出

摘要智能司法判决正向符合法律判案逻辑的方向转变。案件要素识别是近年来提出的一项基础任务。相比于前期的基于深度学习和机器阅读理解的识别方法,采用大模型的生成式要素识别方法具有进行复杂推理的潜力。但是,目前司法大模型在这类基础任务上的效果不佳。提出了一种对话式混合专家要素识别大模型。该模型针对案件特点设计了特定的Prompt,供ChatGLM3-6B-base大模型学习;通过全参微调该大模型获得基础要素识别能力,其权重供后续混合专家共享,降低大模型学习成本;针对不同案件类型场景和标签不平衡场景,在大模型的注意力层引入案件DoRA专家和标签DoRA专家模块,提高模型对任务的区分度;设计可学习门控实现标签专家选择。在CAIL2019和某省脱敏盗窃案件要素识别数据集上,对比了三类方法的九个基准模型,并进行模型消融实验。实验结果显示,提出的模型综合性能F1值高于最优模型性能5.9个百分点;在标签不平衡的CAIL2019数据集上,标签专家一定程度上能够减缓数据极度不平衡给模型带来的影响;同时,CAIL2019上训练的模型不再需要全参微调,通过案件专家和标签专家轻量级微调后,在某省盗窃案件中取得最佳效果,证明模型具有易扩展性。 The intelligent judicial decision-making is gradually aligning with the logic of legal adjudication.Case element recognition is a fundamental task proposed in recent years.Compared with earlier methods based on deep learning and machine reading comprehension,the generative element recognition approach using large language models(LLM)holds greater potential for complex reasoning.However,the current performance of judicial LLM on these fundamental tasks remains suboptimal.This paper introduces a conversational mixture of expert element recognition LLM.The proposed model in this paper first designs specific prompts tailored to the characteristics of cases for the ChatGLM3-6B-base model.The LLM is then fine-tuned with full parameters to acquire basic element recognition capabilities,with its weights shared among subsequent hybrid experts to reduce learning costs.To address different case types and label imbalance scenarios,case-specific DoRA experts and label-specific DoRA experts are integrated into the LLM’s attention layer,enhancing the model’s ability to differentiate between tasks.A learnable gating mechanism is also designed to facilitate the selection of label experts.The proposed model is tested on the CAIL2019 dataset and a desensitized theft case element recognition dataset from a certain province,nine benchmark models across three types of methods are compared,and ablation experiments are conducted.Experimental results show that the proposed model’s overall performance,measured by the F1 score,exceeds the best-performance model by 5.9 percentage points.On the label-imbalanced CAIL2019 dataset,the label expert effectively mitigates the impact of extreme data imbalance.Additionally,without repeated full-parameter fine-tuning,the basic model trained on CAIL2019 achieves optimal results in theft cases of a certain province after lightweight fine-tuning by case and label experts,demonstrating the model’s scalability.

作者尹华吴梓浩柳婷婷张佳佳高子千 YIN Hua;WU Zihao;LIU Tingting;ZHANG Jiajia;GAO Ziqian(School of Digital Economics,Guangdong University of Finance&Economics,Guangzhou 510320,China;School of Informatics,Guangdong University of Finance&Economics,Guangzhou 510320,China)

机构地区广东财经大学数字经济学院广东财经大学信息学院

出处《计算机科学与探索》 CSCD 北大核心 2024年第12期3260-3271,共12页 Journal of Frontiers of Computer Science and Technology

基金教育部人文社会科学研究青年基金项目(21YJCZH202) 广东省普通高校创新团队项目(2022WCXTD008) 广东省法学会法学研究委托课题项目(GDLS(2024C12))。

关键词案件要素识别大模型混合参数高效专家提示词 legal case element recognition large language model mixture of parameter-efficiency expert prompt

分类号 TP391.1 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

1姜燕.推进产学研用深度融合的实践探索与思考[J].出版广角,2024(5):43-47. 被引量：2
2李晓光,郑小娜.图书馆镜像服务存在的法律风险及应对方案[J].数字图书馆论坛,2024,20(9):57-63.
3曾祥鑫.人工智能的司法应用困境及完善路径[J].争议解决,2024,10(9):8-15.
4无.我国技能人才队伍建设工作成就与展望[J].中国培训,2024(10):13-15.
5陈琦.关于重大职务犯罪案件指定管辖制度的研究[J].汉江师范学院学报,2024,44(5):110-115.
6刘忆冰.检察公益诉讼视域下的生物多样性保护[J].人民检察,2024(S01):60-61.
7谢天圻,吴媛媛,敬超,孙伟恒.GAN模型生成图像检测方法综述[J].计算机工程与应用,2024,60(22):74-86.
8无.江西彭泽:联盟“智”量撬动“质”量[J].中国人才,2023(11):80-80.
9王娟,樊畅,管雨翔.基于CNN-LSTM-Attention模型的盗窃犯罪分析与预测[J].大数据时代,2024(10):27-40.
10周正宽.社会组织培育社区社会资本的路径研究——以成都市H社区为例[J].现代商贸工业,2024,45(21):5-8.

计算机科学与探索

2024年第12期

浏览历史

内容加载中请稍等...

法律案件要素识别混合专家大模型

相关作者

相关机构

相关主题

浏览历史