期刊文献+

基于语义数据的药物网络模型构建与分析

Construction and Analysis of Drug Network Model Based on Semantic Data
下载PDF
导出
摘要 针对传统生物数据分析方法无法高效处理规模不断增大的生物语义数据集的现状,将基于属性共现的节点相似度算法应用于Ch EMBL数据集,构建基于药物天然产物-活性的二部图模型,应用Graphlab框架计算基于活性特征的药物天然产物相似度,并对相似度较高的药物天然产物进行活性推荐。实验结果表明,该方法能有效利用生物数据集的语义信息发现药物天然产物潜在的活性特征,从而指导药物研发早期的活性探测以及药物靶标的发现和选择过程。 For the reason that the traditional biological analysis method is unable to handle the semantic information effectively,this paper applies the node similarity algorithm based on attribute co-occurrence to ChEMBL database and constructs the bipartite graph based on nautral product and activity.Then,with the framework of Graphlab,it calculates the natural product similarity based on activity and recommends the natural products with high similarity.Experimental results show that the method can effectively use the semantic information of biological datasets to find out the potential activities of natural products,thus guiding the activity detection and drug target discovery and selection in the early stage of drug research.
作者 王爽 冯志勇
出处 《计算机工程》 CAS CSCD 北大核心 2016年第6期31-36,42,共7页 Computer Engineering
基金 国家自然科学基金资助项目(61173155 61373035) 国家"863"计划基金资助项目(2007AA01Z130 2013AA013204)
关键词 语义数据 ChEMBL数据集 活性 Graphlab并行计算 节点相似度算法 semantic data ChEMBL dataset activity Graphlab parallel computing node similarity algorithm
  • 相关文献

参考文献20

  • 1Berners-Lee T,Hendler J,Lassila O.The Semantic Web[J].Scientific American,2001,284(5):34-43.
  • 2胡鹤,刘大有,王生生.Web本体语言OWL[J].计算机工程,2004,30(12):1-2. 被引量:42
  • 3Bizer C,Heath T,Berners-Lee T.Linked Data-The Story so Far[J].International Journal on Semantic Web and Information Systems,2009,5(3):1-22.
  • 4Willighagen E L,Waagmeester A,Spjuth O,et al.The Ch EMBL Database As Linked Open Data[J].Journal of Cheminformatics,2013,5(1):1-12.
  • 5Belleau F,Nolin M A,Tourigny N,et al.Bio2RDF:Towards a Mashup to Build Bioinformatics Knowledge Systems[J].Journal of Biomedical Informatics,2008,41(5):706-716.
  • 6Samwald M,Jentzsch A,Bouton C,et al.Linked Open Drug Data for Pharmaceutical Research and Deve-lopment[J].Journal of Cheminformatics,2011,3(1):19-24.
  • 7Williams A J,Harland L,Groth P,et al.Open PHACTS:Semantic Interoperability for Drug Discovery[J].Drug Discovery Today,2012,17(21):1188-1198.
  • 8Gaulton A,Bellis L J,Bento A P,et al.Ch EMBL:A Largescale Bioactivity Database for Drug Dis-covery[J].Nucleic Acids Research,2012,40(1):1100-1107.
  • 9曹亮,王茜,卢菁.XML数据在关系数据库中存储和检索的研究和实现[J].东南大学学报(自然科学版),2002,32(1):124-127. 被引量:25
  • 10Willighagen E L,Alvarsson J,Andersson A,et al.Linking the Resource Description Framework to Chemin-formatics and Proteochemometrics[J].Journal of Biomedical Semantics,2011,2(S1):6-11.

二级参考文献38

  • 1Dean J, Ghemawat S. MapReduce: Simplified dala processing on large clusters//Proceedings of the Conference on Operating System Design and Implementation(OSDU04,). San Francisco, USA, 2004: 137-150.
  • 2Thusoo A, Sarma J S, JainN, Shao Z, Chakka P, Anthony S, Liu H, Wyckoff P, Murthy R. Hive: A warehousing solution over a map-reduce framework//Proceedings of the Conference on Very Large Databases (VLDB' 09). Lyon, France, 2009:1626-1629.
  • 3Olston C, Reed B, Srivastava U, Kumar R, Tomkins A. Pig Latin: A not-so-foreign language for data processing//Proceedings of the 2008 ACM SIGMOD International Conference on Management of Data (SIGMOD' 08). Vancouver, BC, Canada, 2008:1099 1110.
  • 4Bu Y, Howe B, Balazinska M, Ernst M D. HaLoop.. Efficient iterative data processing on large clusters//Proceedings of the Conference on Very Large Databases (VLDB' 10). Sin gapore, 2010:285-296.
  • 5Ekanayake J, Li H, Zhang B, Gunarathne T, Bae S-H, Qiu J, Fox G. Twister: A runtime for iterative MapReduce// Proceedings of the 19th ACM International Symposium on High Performance Distributed Computing. Chicago, Illinois, USA, 2010:810-818.
  • 6Wilson G V. Practical Parallel Programming. Cambridge, MA.. MIT Press, 1995.
  • 7Valiant L G. A bridging model for parallel computation. Communications of the ACM, 1990, 33(8): 103-111.
  • 8Dean J, Ghemawat S. MapReduce: A flexible data processing tool. Communications of the ACM, 2010, 53(1): 72-77.
  • 9Pavlo A, Paulson E, Rasin A, Abadi D J, DeWitt D J, Mad den S, Stonebraker M. A comparison of approaches to large scale data//Proceedings of the 2009 ACM SIGMOD Interna tional Conference on Management of Data (SIGMOD' 09) New York, USA, 2009:165-178.
  • 10Stonebraker M, Abadi D J, DeWitt D J, Madden S, Paulson E, Pavlo A, Rasin A. MapReduce and parallel DBMSs: Friends or foes? Communications of the ACM, 2010, 53(1) : 64-71.

共引文献109

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部