摘要
奶牛生产性状中产奶量一直是相关领域专家重点关注的性状之一,产奶量的提高对于经济和民生发展具有重要意义,因此产奶量的相关影响因素成为了研究提高奶产量和奶质量的焦点。针对目前缺少奶牛产奶量-基因知识图谱的问题,以PubMed生物医学文献库为相关来源,利用爬虫技术构建奶牛产奶量性状组学数据的相关文献数据集,通过知识抽取获得与奶牛产奶量相关的大约140个基因影响因素及其他影响因素。利用Neo4j图数据库的方式进行数据存储,构建与奶牛产奶量性状相关的知识图谱,最后形成奶牛产奶量性状数据知识库的可视化平台。
Milk yield has always been one of the traits that experts pay attention to in dairy cow production traits.The improvement of milk yield is of great significance to the development of economy and people’s livelihood.Therefore,the research on the influencing factors of milk yield has become the focus of improving milk yield and milk quality.In view of the lack of cow milk yield gene knowledge graph,it firstly uses the crawler technology based on PubMed biomedical literature database to construct the related literature data set of milk yield trait omics data of dairy cows.Then,the relevant influencing factors of cow milk yield are extracted based on knowledge extraction technology,including 140 gene influencing factors.Finally,the Neo4j graph database is used for data storage,the knowledge graph related to milk yield traits of dairy cows is constructed,and the visual platform of milk yield traits data knowledge base is formed.
作者
胡浩
高静
刘振羽
HU Hao;GAO Jing;LIU Zhenyu(College of Computer and Information Engineering,Inner Mongolia Agricultural University,Hohhot 010018,China;Inner Mongolia Autonomous Region Key Laboratory of Big Data Research and Application of Agriculture and Animal Husbandry,Hohhot 010018,China)
出处
《计算机工程与应用》
CSCD
北大核心
2023年第2期299-305,共7页
Computer Engineering and Applications
基金
内蒙古自然科学基金(2019MS03014)
内蒙古自治区科技重大专项(2019ZD016)
国家自然科学基金(61462070)
内蒙古自治区科学技术厅项目(2020ZD0007)。
关键词
文献挖掘
知识图谱
奶牛产奶量
知识抽取
literature mining
knowledge graph
milk yield
knowledge extraction