面向大语言模型驱动的智能体的计划复用机制

A Plan Reuse Mechanism for LLM-Driven Agent

下载PDF

导出

摘要将大语言模型集成到个人助手中(如小爱同学、蓝心小V等)能有效提升个人助手与人类交互、解决复杂问题、管理物联网设备等能力,这类助手也被称为大模型驱动的智能体,也可称其为智能体.智能体接收到用户请求后,首先调用大模型生成计划,之后调用各类工具执行计划并将响应返回给用户.上述过程中,智能体使用大模型生成计划的延迟可达数十秒,十分影响用户体验.对真实数据的分析显示,智能体接收到的请求中约有30%是相同或相似的,此类请求可复用先前生成的计划,以降低智能体响应延迟.然而,直接对请求原始文本进行相似度评估难以准确界定智能体接收到的请求文本间的相似性.此外,自然语言表达的多样性和大模型生成的非结构化计划文本导致难以对计划进行有效复用.针对上述问题,提出并实现了面向大模型驱动的智能体的计划复用机制AgentReuse,通过利用请求文本间语义的相似性和差异性,采用基于意图分类的方法来界定请求间的相似性并实现计划复用.基于真实数据集的实验结果表明,AgentReuse对计划的有效复用率为93%,对请求进行相似性界定的F1分数为0.9718,准确率为0.9459,与不采用复用机制相比,可减少93.12%的延迟. Integrating large language models(LLMs)into personal assistants,like Xiao Ai and Blue Heart V,effectively enhances their ability to interact with humans,solve complex tasks,and manage IoT devices.Such assistants are also termed LLM-driven agents.Upon receiving user requests,the LLM-driven agent generates plans using an LLM,executes these plans through various tools,and then returns the response to the user.During this process,the latency for generating a plan with an LLM can reach tens of seconds,significantly degrading user experience.Real-world dataset analysis shows that about 30%of the requests received by LLM-driven agents are identical or similar,which allows the reuse of previously generated plans to reduce latency.However,it is difficult to accurately define the similarity between the request texts received by the LLM-driven agent through directly evaluating the original request texts.Moreover,the diverse expressions of natural language and the unstructured format of plan texts make implementing plan reuse challenging.To address these issues,we present and implement a plan reuse mechanism for LLM-driven agents called AgentReuse.AgentReuse leverages the similarities and differences among requests’semantics and uses intent classification to evaluate the similarities between requests and enable the reuse of plans.Experimental results based on a real-world dataset demonstrate that AgentReuse achieves a 93%effective plan reuse rate,an F1 score of 0.9718,and an accuracy of 0.9459 in evaluating request similarities,reducing latency by 93.12%compared with baselines without using the reuse mechanism.

作者李国鹏吴瑞骐谈海生陈国良 Li Guopeng;Wu Ruiqi;Tan Haisheng;Chen Guoliang(School of Computer Science and Technology,University of Science and Technology of China,Hefei 230027)

机构地区中国科学技术大学计算机科学与技术学院

出处《计算机研究与发展》 EI CSCD 北大核心 2024年第11期3706-3720,共15页 Journal of Computer Research and Development

基金科技创新2030—“新一代人工智能”重大项目(2021ZD0110400) 国家自然科学基金重点项目(62132009) 中央高校基本科研业务费专项资金。

关键词智能物联网大语言模型智能体语义缓存相似度评估 artificial intelligence of things large language models(LLMs) agent semantic cache similarity evaluation

分类号 TP393 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献6

1马郓,刘譞哲,梅宏.面向移动Web应用的浏览器缓存性能度量与优化[J].软件学报,2020,31(7):1980-1996. 被引量：9
2王恩东,唐士斌,陈继承,王洪伟,倪璠,赵雅倩.多核处理器目录缓存结构设计[J].计算机研究与发展,2015,52(6):1242-1253. 被引量：3
3高云帆,郁董卿,王思琪,王昊奋.大语言模型驱动的选址推荐系统[J].计算机研究与发展,2024,61(7):1681-1696. 被引量：1
4Lei WANG,Chen MA,Xueyang FENG,Zeyu ZHANG,Hao YANG,Jingsen ZHANG,Zhiyuan CHEN,Jiakai TANG,Xu CHEN,Yankai LIN,Wayne Xin ZHAO,Zhewei WEI,Jirong WEN.A survey on large language model based autonomous agents[J].Frontiers of Computer Science,2024,18(6):1-26. 被引量：10
5李戈,彭鑫,王千祥,谢涛,金芝,王戟,马晓星,李宣东.大模型:基于自然交互的人机协同软件开发与演化工具带来的挑战[J].软件学报,2023,34(10):4601-4606. 被引量：13
6郭斌,刘思聪,刘琰,李志刚,於志文,周兴社.智能物联网:概念、体系架构与关键技术[J].计算机学报,2023,46(11):2259-2278. 被引量：28

二级参考文献27

1Ferdman M, Adileh A, Kocberber O, et al. Clearing the clouds~ A study of emerging scale out workloads on modernhardware [C] //Proc of the 17th Conf on Architecture Support for Programming Languages and Operating Systems (ASPLOS). New York: ACM, 2012:37-48.
2Sorin D J, Hill M D, Wood D A. A Primer on Memory Consistency and Cache Coherence [M]. San Rafael, CA: Morgan & Claypool Publishers, 2011.
3Barroso L A, Gharachorloo K, McNamara R, et al. Piranha: A scalable architecture based on single chip multiprocessing [C] //Proc of the 27th Annual Int Symp on Computer Architecture (ISCA). New York; ACM, 2000 282 -293.
4Sun, OpenSPARCTM T2 core microarchitecture specification [R/OL]. Sun MicroSysmtes, Inc, 2007 [2015 -04-20]. http~//www, oracle, com/technetwork/systems/opensparc/t2 06 opensparet2 core-microarch 1537749. html.
5Singhal R. Inside Intel next generation Nehalem micorarchlteeture [R/OL]. Intel Corporation, 2008 [2015- 04-20]. http://weblab, cs. uml. edu/-bill/cs515/Intel Nehalem Processor. pdf.
6Ferdman M, Pejman L K, Balet K, et al. Cuckoo directory: A scalable directory for many-core systems [C] //Proc of the 17th Int Syrup on High Performance Computer Architecture (HPCA). New York: ACM, 2011:169-180.
7Cuesta B A, Ros A, Gomez M E,' et al. Increasing the effectiveness of directory caches by deactivating coherence for private memory blocks [C] //Proe of the 38th Annual Int Syrup on Computer Architecture (ISCA). New York: ACM, 2011, 93-104.
8Pejman L K, Grot B, Fredman M, et al. Scale-out Processors [C] //Proc of the 39th Annual Int Symp on Computer Architecture (ISCA). Piscataway, NJ: IEEE, 2012:500-511.
9Gupta A, Weber W, Mowry T. Reducing memory and traffic requirements for scalable directory-based cache coherence schemes [C] //Proe of the 1990 Int Conf on Paraliel Processing (ICPP). Berlin: Springer, 1992:167-192.
10Martin M M K, Hill M D, Sorin D J. Why on-chip cache coherence is here to stay [J]. Communications of the ACM, 2012, 55(7): 78-89.

共引文献58

1叶苗.多核处理器下SKLOIS多级安全数据库查询方法研究[J].科学技术与工程,2017,17(2):95-99.
2吴健虢,陈海燕,刘胜,邓让钰,陈俊杰.多核Cache稀疏目录性能提升方法综述[J].计算机工程与科学,2019,41(3):385-392. 被引量：2
3张博,蒋志颀.中小型网站在高并发下的优化方案[J].微型电脑应用,2021,37(8):181-185. 被引量：1
4孙亮.一种基于有限时间与或过滤器的浏览器缓存设计[J].电子技术与软件工程,2022(5):228-232. 被引量：1
5杨文辉.基于网页设计的实验室耗材智能管理[J].电脑编程技巧与维护,2023(2):82-84.
6陈自力.基于边缘计算的Web缓存替换策略算法[J].贵阳学院学报（自然科学版）,2023,18(2):18-23. 被引量：1
7周健,曹晓龙,吴琦.Web软件性能参数自动化测试方法设计[J].电子设计工程,2023,31(16):112-115. 被引量：3
8冯爱花.基于Vue云管理平台Web前端性能优化的研究[J].长江信息通信,2023,36(7):126-128. 被引量：3
9周沭玲.高并发访问下的移动Web前端浏览性能优化研究[J].重庆科技学院学报（自然科学版）,2023,25(5):63-68. 被引量：1
10何梓玄,高瞩.连锁品牌智能移动咖啡屋系统设计研究[J].包装与设计,2023(5):172-173.

1莉璎.鲨鱼之吻[J].小说月刊,2023(9):12-13.
2双光学防抖[J].数码摄影,2024(7):106-108.
3朱立平,赵杉(摄影).张蓝心 “练”在旅途[J].新旅行,2019(8):24-29.
4周莎.《荒野机器人》:程序之外的爱[J].中学生博览,2024(36):74-74.
5赵玉成,朱水苗(图).圆桌沙龙:教育数字化转型的热现象与冷思考[J].上海教育,2022(29):36-36.
6陈凤妹,程显毅.自然语言处理技术视角下的编程演进研究[J].电脑知识与技术,2024,20(27):15-18.
7《下一任:前任》走肾的郑恺跟“前任”杠上了[J].时代影视,2019(6):64-65.
8张惠英.核心素养何以落实[J].教育实践与研究,2024(27):1-1.
9李豪,钱丽萍,朱晓慧.基于区块重组和双通道可视化的恶意代码分类[J].计算机应用与软件,2024,41(10):342-348.
10陈锦凤.一节习题课的优化建议及教学启示[J].数学教学通讯,2024(29):59-61.

计算机研究与发展

2024年第11期

浏览历史

内容加载中请稍等...

面向大语言模型驱动的智能体的计划复用机制

参考文献6

二级参考文献27

共引文献58

相关作者

相关机构

相关主题

浏览历史