多模态知识图谱增强葡萄种植问答对的答案选择模型被引量：4

Enhancing answer selection model of grape planting using multimodal knowledge graph

下载PDF

导出

摘要针对传统答案选择模型仅依靠问答对自身信息进行匹配的问题,该研究提出了一种使用多模态知识图谱来增强问答对的答案选择模型。该模型通过设计基于ComplEx(complex embedding)图谱嵌入的方法学习多模态知识图谱嵌入,引入上下文注意力机制并使用CNN网络获取多模态知识图谱的特征表示,使用知识感知注意力方法,将多模态知识图谱提供的背景知识与问答对的文本语义信息融合。以葡萄种植为例,通过搭建葡萄种植多模态知识图谱和构造葡萄种植问答数据集开展试验,试验结果表明:使用多模态知识图谱有助于模型获取更多信息从而达到更好的效果,在葡萄问答数据集中正确答案的平均倒数排名和平均准确率分别达到了85.02%、84.21%,与其他模型相比,平均倒数排名提高2.57个百分点,平均准确率提高了3.96个百分点。该答案选择模型利用多模态知识图谱的知识提高答案选择效果,可为搜索、问答等下游任务提供技术基础。 Answer selection is one of the most important tasks during natural language processing in the downstream tasks,such as question-answering systems,and search ranking.The most relevant answer can be selected to the given question from a candidate answer pool,which is usually regarded as a relevance ranking task.However,the current models of answer selection cannot discover the deep semantic relationships between questions and answers using the limited information in the text of the question-answer pairs.Fortunately,knowledge graph can be expected to serve as the background knowledge,in order to enhance the deep semantics of the answer selection model.It is still lacking on the multi-modal background knowledge support,because the answer selection models can rely solely on their own information.In this research,a multi-modal knowledge graph enhanced answer selection model was proposed,including the embedding layer,representation learning layer,knowledge graph enhancement layer,and output layer.Among them,the Glove model was used to obtain the word embeddings for the questionanswer texts in the embedding layer.Furthermore,a ComplEx-based method(complex embedding)was designed to learn the entity embeddings for the multi-modal knowledge graph.The image entity information was considered to extract the image feature representations using the Vision Transformer(VIT).Bi-directional long short-term memory(Bi-LSTM)was used for the representation learning of question-answer texts in the representation layer.The context-guided multi-modal knowledge graph question and answer vector representations were obtained using context-guided attention mechanism.In the knowledge graph enhancement layer,the interaction attention mechanism was used to fuse the semantic representation of the questionanswer texts with the background knowledge features that provided by the multi-modal knowledge graph,particularly for the feature representations of the multi-modal knowledge graph enhanced question and answer.The feature representations of the knowledge graph enhanced question and answer were concatenated with the additional semantic features in the output layer.The softmax function was used to predict the probability distribution of answer labels for a given question.Taking the grape planting as an example,the multi-modal entity linking was realized using the longest common subsequence algorithm.The entity recognition was also implemented to extract the knowledge using the Bert-LSTM-CRF framework and Bert pre-training model.The reference of knowledge graph was collected from the literature and experts.Finally,a multi-modal knowledge graph was constructed in the grape planting field.A grape planting question and answer dataset was also constructed using grape forums,smart agricultural platforms,agricultural managers,and agricultural benefit networks as data sources,followed by text cleaning and dataset expansion.Experimental results show that the better performance of the model was achieved to obtain more information using the multi-modal knowledge graphs.Specifically,the mean reciprocal rank and mean average precision reached 85.02%and 84.21%,respectively,in the grape question answering dataset.The mean reciprocal rank and mean average precision increased by 2.57 and 3.96 percentage points,respectively.The answer selection model with the knowledge of multi-modal knowledge graph can be expected to improve the better performance of answer selection model.The embedding representation with attention mechanism can be utilized to enhance the background knowledge from the multimodal knowledge graph.The finding can provide a technical basis for the downstream applications of multi-modal knowledge graphs,such as the search and question answering.

作者杨硕李书琴 YANG Shuo;LI Shuqin(College of Information Engineering,Northwest A&F University,Yangling 712100,China)

机构地区西北农林科技大学信息工程学院

出处《农业工程学报》 EI CAS CSCD 北大核心 2023年第14期207-214,共8页 Transactions of the Chinese Society of Agricultural Engineering

基金中央高校基本科研业务专项资金(2452019064)。

关键词农业知识图谱葡萄种植答案选择多模态图谱表示自然语言处理 agriculture knowledge graph grape cultivation answer selection multi-modal graph representation natural language processing(NLP)

分类号 TP391 [自动化与计算机技术—计算机应用技术] S24 [农业科学—农业电气化与自动化]

引文网络
相关文献

参考文献9

1Bo Zhang,Haowen Wang,Longquan Jiang,Shuhan Yuan,Meizi Li.A Novel Bidirectional LSTM and Attention Mechanism Based Neural Network for Answer Selection in Community Question Answering[J].Computers, Materials & Continua,2020(3):1273-1288. 被引量：3
2徐艳蕾,孔朔琳,陈清源,高志远,李陈孝.基于Transformer的强泛化苹果叶片病害识别模型[J].农业工程学报,2022,38(16):198-206. 被引量：11
3吕志远,张付杰,魏晓明,黄媛,李晶晶,张钟莉莉.采用组合增强的YOLOX-ViT协同识别温室内番茄花果[J].农业工程学报,2023,39(4):124-134. 被引量：8
4刘巨升,杨惠宁,孙哲涛,杨鹤,邵立铭,于红,张思佳,叶仕根.面向知识图谱构建的水产动物疾病诊治命名实体识别[J].农业工程学报,2022,38(7):210-217. 被引量：10
5金宁,赵春江,吴华瑞,缪祎晟,李思,杨宝祝.基于BiGRU_MulCNN的农业问答问句分类技术研究[J].农业机械学报,2020,51(5):199-206. 被引量：21
6牛夏牧,焦玉华.感知哈希综述[J].电子学报,2008,36(7):1405-1411. 被引量：98
7任媛,于红,杨鹤,刘巨升,杨惠宁,孙哲涛,张思佳,刘明剑,孙华.融合注意力机制与BERT+BiLSTM+CRF模型的渔业标准定量指标识别[J].农业工程学报,2021,37(10):135-141. 被引量：20
8万鹏,赵竣威,朱明,谭鹤群,邓志勇,黄毓毅,吴文锦,丁安子.基于改进ResNet50模型的大宗淡水鱼种类识别方法[J].农业工程学报,2021,37(12):159-168. 被引量：28
9王会勇,论兵,张晓明,孙晓领.基于联合知识表示学习的多模态实体对齐[J].控制与决策,2020,35(12):2855-2864. 被引量：16

二级参考文献107

1宋枫溪,高林.文本分类器性能评估指标[J].计算机工程,2004,30(13):107-109. 被引量：33
2向晓雯,史晓东,曾华琳.一个统计与规则相结合的中文命名实体识别系统[J].计算机应用,2005,25(10):2404-2406. 被引量：37
3王甦汪安圣.认知心理学[M].北京:北京大学出版社,1992..
4A W M Smeulders, et al. Content-based image retrieval at the end of the early years[ J] .IEEE Transactions on Pattern Analysis and Machine Intelligence,2000, 22(12) : 1349 - 1380.
5B B Zhu,M D Swanson, A H Tewfik.When seeing isn't believing[ J] .IEEE Signal Processing Magazine,2004,21 (2):40 - 49.
6H G Schaathun. On watermarking/fingerprinting for copyright protection[ A]. Proc. of 1st International Conference on Innovative Computing, Infonnation and Control (ICICIC) [ C .]. Beijing: IEEE, 2006. (3) :50 - 53.
7J Haitsma, T Kalker. A highly robust audio fingerprinting system[A]. Proc of 3rd International Conference on Music Informarion Retrieval(ISMIR) [ C ]. Paris: IRCAM, 2002.107 - 115.
8P Cano, E Batlle, T Kalker, J Haitsma. A review of audio fingerprinting [ J ]. Journal of VLSI Signal Processing, 2005,41 : 271 - 284.
9H Ozer, B Sankur, N Memon, E Anarim. Perceptual audio hashing functions[ J]. EURASIP Journal on Applied Signal Processing, 2005,12:1780- 1793.
10http://isis. poly. edu/index. php? page = 1&project = 1094.

共引文献199

1杨学磊.牛羊食毛症的发病成因及防治策略[J].新农民,2024(8):117-119.
2王婷婷,苗琳,吴钰,刘秀磊.基于表示学习的实体对齐技术研究综述[J].电子测试,2023(1):60-68.
3王军龙,宣魁,熊海涛,王峰,李娟.基于改进ResNeXt50残差网络的锦鲤选美方法[J].农业机械学报,2023,54(S01):330-337. 被引量：2
4刘世晶,刘阳春,钱程,郑浩君,周捷,张成林.基于CycleGAN和注意力增强迁移学习的小样本鱼类识别[J].农业机械学报,2023,54(S01):296-302. 被引量：3
5李林,刁磊,唐詹,柏召,周晗,郭旭超.基于BERT_Stacked LSTM的农业病虫害问句分类方法[J].农业机械学报,2021,52(S01):172-177. 被引量：6
6王永康,艾山·吾买尔,顾亚东,何江涛.TransREF:一种改进的基于邻域信息的知识表示模型[J].电子测量技术,2023,46(21):7-15.
7韩琦,王志芳,牛夏牧,李琼.针对索引图像的人脸区域分级加密算法[J].电子学报,2008,36(B12):25-29. 被引量：2
8王阿川,陈海涛.基于离散余弦变换的鲁棒感知图像哈希技术[J].中国安全科学学报,2009,19(4):91-96. 被引量：9
9刘亚多,李伟,李晓强,汪竹蓉,冯瑞.压缩域鲁棒音乐指纹算法研究[J].电子学报,2010,38(5):1172-1176. 被引量：9
10古今,郭立,梁惠,程龙.一种高效鲁棒的语音感知认证算法[J].小型微型计算机系统,2010,31(7):1461-1465. 被引量：1

同被引文献73

1李书琴,张明美,刘斌.融合字词语义信息的猕猴桃种植领域命名实体识别研究[J].农业机械学报,2022,53(12):323-331. 被引量：5
2张博凯,李想.基于知识图谱的Android端农技智能问答系统研究[J].农业机械学报,2021,52(S01):164-171. 被引量：11
3张海瑜,陈庆龙,张斯静,张子怡,杨帆,李鑫星.基于语义知识图谱的农业知识智能检索方法[J].农业机械学报,2021,52(S01):156-163. 被引量：12
4朱颢东,钟勇.结合优化的文档频和LSA的特征选择方法[J].计算机工程与应用,2009,45(34):121-123. 被引量：1
5黄承慧,印鉴,侯昉.一种结合词项语义信息和TF-IDF方法的文本相似度量方法[J].计算机学报,2011,34(5):856-864. 被引量：221
6谢能付.Research on Agricultural Ontology and Fusion Rules Based Knowledge Fusion Framework[J].Agricultural Science & Technology,2012,13(12):2638-2641. 被引量：1
7范晨熙,黄理灿,李雪利.基于Lucene的BM25模型的评分机制的研究[J].工业控制计算机,2013,26(3):78-79. 被引量：15
8王艺,王英,原野,郭云龙,张自力,邓烈,李莉.基于语义本体的柑橘肥水管理决策支持系统[J].农业工程学报,2014,30(9):93-101. 被引量：11
9王超,李书琴,肖红.基于文献的农业领域本体自动构建方法研究[J].计算机应用与软件,2014,31(8):71-74. 被引量：11
10牟向伟,陈燕,曹妍.农产品冷链HACCP管理体系知识建模与推理[J].农业工程学报,2016,32(2):300-308. 被引量：22

引证文献4

1王元胜,吴华瑞,赵春江.农业知识驱动服务技术革新综述与前沿[J].农业工程学报,2024,40(7):1-16. 被引量：1
2侯琛,牛培宇.农业知识图谱技术研究现状与展望[J].农业机械学报,2024,55(6):1-17. 被引量：2
3杨民安,孙雨,王凤超,杨晶,陈进.基于人工智能的小麦高效育种信息交互系统构建[J].农业工程学报,2024,40(13):117-123.
4单源源,李书琴.融合关系上下文与路径的茶叶种植知识图谱关系补全模型[J].农业工程学报,2024,40(13):171-178.

二级引证文献3

1易文龙,张丽,刘木华,程香平.特色农产品销售评价大数据的弱监督分析方法[J].农业工程学报,2024,40(12):183-192.
2牛培宇,侯琛.基于文本数据增强的中文水稻育种问句命名实体识别[J].农业机械学报,2024,55(8):333-343.
3张宇芹,朱景全,董薇,李富忠,郭雷风.农业垂直领域大语言模型构建流程和技术展望[J].农业大数据学报,2024,6(3):412-423.

1陈乐乐,张雄伟,孙蒙,张星昱.融合梅尔谱增强与特征解耦的噪声鲁棒语音转换[J].声学学报,2023,48(5):1070-1080. 被引量：1
2卜祥鹏,王海军.基于知识增强学习的视觉语言导航[J].电脑编程技巧与维护,2023(9):131-134.
3孙基航,胡艳丽,唐九阳.基于知识感知提示与对比调优的事件元素抽取方法[J].火力与指挥控制,2023,48(10):109-115.
4王杰.修齐治平、关注当下的政治智慧(十):善于学习,终身学习(上)[J].月读,2023(8):62-70.
5张潇,刘渊.结合用户视角的知识图注意力网络推荐算法[J].计算机工程与应用,2023,59(17):123-131. 被引量：1
6周才华,王博,柯熊钢,郝鹏,杜凯繁.面向大型承载结构强度性能的试验系统高精度装配调控方法[J].宇航总体技术,2023,7(5):72-82.
7许丽华.浅谈小学英语词汇教学的技巧和方法[J].今天,2023(23):150-151.
8武志平.初中化学问题情境教学的方法探索[J].炫动漫,2023(10):37-39.
9吴雅玲.听唱玩创,音阶训练的四个台阶——小学音乐课堂音阶训练的现象与改进策略[J].大众文摘,2022(51):166-168.
10安李云,李建友,曾杰宏,王胜.自拟方通脉颗粒联合利伐沙班在下肢肌间静脉血栓中的应用效果[J].中外医学研究,2023,21(28):55-58.

农业工程学报

2023年第14期

浏览历史

内容加载中请稍等...

多模态知识图谱增强葡萄种植问答对的答案选择模型被引量：4

参考文献9

二级参考文献107

共引文献199

同被引文献73

引证文献4

二级引证文献3

相关作者

相关机构

相关主题

浏览历史

多模态知识图谱增强葡萄种植问答对的答案选择模型 被引量：4

参考文献9

二级参考文献107

共引文献199

同被引文献73

引证文献4

二级引证文献3

相关作者

相关机构

相关主题

浏览历史

多模态知识图谱增强葡萄种植问答对的答案选择模型被引量：4