期刊文献+

蛋白质编码区的Takagi-Sugeno模糊模型辨识 被引量:1

Prediction of protein coding regions by Takagi-Sugeno model
下载PDF
导出
摘要 DNA序列编码区的辨识是基因辨识的一个重要方面。由于基因序列数据量大,导致许多统计辨识算法泛化性差、运算速度慢。根据编码区域序列和非编码区域序列相比有不同的碱基组成,提出将Takagi-Sugeno模型用于DNA序列的编码区辨识。首先,用基于模糊似然函数的模糊聚类算法确定系统的模糊划分数目,进而根据聚类个数建立相应的Takagi-Sugeno局部线性化模型,最后用最小二乘法实现模型结论参数的辨识。该算法不仅可以确定编码区的位置,还可以辨识出密码子第一位碱基的位置,对蛋白质结构的研究是非常重要的。算法简单、高效。仿真结果表明,该算法非常适合编码区辨识和其他编码区辨识算法有可比性。 An important step in gene identification is to predict coding regions in DNA sequence.Due to the large volume of gene data leading to the problem of poor generalization capability and lower computing speed in many algorithms of prediction of coding region.In this paper,a Takagi-Sugeno model of DNA sequence is built based on the different composition of nucleotides in coding regions and non-coding regions.First,the system is quickly divided into several fuzzy parts using clustering algorithm based on the fuzzy likelihood function.Then,regarding cluster number as a rule number,Takagi--Sugeno fuzzy model has been built.Finally,the consequent parameters of the model are identified associating with LS.The algorithm not only can predict coding regions,but also can identify the first nueleotide of the codon in coding regions.This is very significant for accurate translatiorl into a protein sequence.The algorithm is simple and simulation results show the proposed method is more effective for coding regions prediction than the existing coding region discovery tools.
作者 郭烁 朱义胜
出处 《计算机工程与应用》 CSCD 北大核心 2009年第26期216-219,共4页 Computer Engineering and Applications
基金 国家自然科学基金No.60671061 助教校中青年科研启动基金资助项目(沈阳化工学院)No.2 00424~~
关键词 DNA序列编码区 密码子 TAKAGI-SUGENO模糊模型 模糊聚类 最小二乘法 coding region in DNA sequence codon Takagi-Sugeno model clustering algorithm Least Square(LS)
  • 相关文献

参考文献13

  • 1Hatzigeorgiou A, Mache N, Reczko M.Functional site prediction on the DNA sequence by artificial neural networks[C]//Proceedings of the 1996 IEEE International Joint Symposia on Intelligence and Systems, 1996,7(96) : 12-16.
  • 2Cai Y D,Bork P.Homology-based gene prediction using neural nets[J].Analytical Biochemistry, 1998,265(2):269-274.
  • 3Emmersen J,Rudd S.Separation of sequences from host-pathogen interface using triplet nucleotide frequencies[J].Fungal Genetics and Biology, 2007,44(27) :231-241.
  • 4Brejova B,Brown D G,Vinar T.The most probable annotation problem in HMMs and its application to bioinformatics[J].Journal of Computer and System Sciences,2007,73(7):1060-1077.
  • 5Yin M M,Wang J T L.GeneScout:A data mining system for predicting vertebrate genes in genomic DNA sequences[J].Infornaation Sciences, 2004,163 ( 1/3 ):201-218.
  • 6Vaidyanathan P P,Yoon B J.Digital filters for gene prediction applications[C]//IEEE Asilomar Conference on Signal,Systems and Computers.Monterey,CA:IEEE Signal Processing Society,2002:306-310.
  • 7田元新,陈超,邹小勇,邱建丁,蔡沛祥,莫金垣.外显子周期三行为特征的研究[J].化学学报,2005,63(13):1215-1219. 被引量:16
  • 8Takagi T,Sugeno M.Fuzzy identification of systems and its application to modeling and control[J].IEEE Trans on Systems,Manand Cybernetics, 1985,15( 1 ) : 116-132.
  • 9曾凡锋,蔡自兴,马润津.基于模糊似然函数的模糊辨识方法[J].控制与决策,1998,13(5):581-584. 被引量:16
  • 10郭烁,李平.模糊聚类与最小二乘相结合建立非线性系统模型[J].模式识别与人工智能,2003,16(3):288-291. 被引量:7

二级参考文献31

  • 1睢刚,陈来九.动态系统模糊模型辨识及其自学习算法[J].自动化学报,1995,21(6):749-753. 被引量:5
  • 2尚修刚,蒋慰孙.一种新的模糊似然函数[J].模式识别与人工智能,1997,10(1):9-14. 被引量:8
  • 3廖俊,朱世强,林建亚,任德祥.遗传算法在T-S模糊模型辨识中的应用[J].信息与控制,1997,26(2):140-145. 被引量:11
  • 4Takagi T, Sugeno M. Fuzzy Identification of Systems and Its Application to Modeling and Control. IEEE Trans on Systems, Man and Cybernetics, 1985, 15(1): 116- 132.
  • 5Chen Weixu, Yong Zailu. Fuzzy Model Identification and Self-learning for Dynamic Systems. IEEE Trans on System, Man and Cybernetics, 1987, 17(4): 683-689.
  • 6Liang Wang. Complex Systems Modeling via Fuzzy Logic. IEEE Trans on System, Man and Cybernetics, 1996, 26(1) : 100 - 106.
  • 7张化光,复杂系统的模糊辨识与模糊自适应控制,1993年
  • 8Sugeno M,Fuzzy Sets Syst,1988年,28卷,1期,15页
  • 9吴乃虎.基因工程原理(上册):第2版[M].北京:科学出版社,2002.10-12.
  • 10Tiwari, S.; Ramachandran, S.; Bhattacharya, A.; Bhatta-charya, S.; Ramaswamy, R. CABIOS, Comput. Appl. Biosci.1997, 13(3), 263.

共引文献48

同被引文献19

  • 1贺文强,苗果园,张永清,高志强.山西省小麦品质区划研究[J].山西师范大学学报(自然科学版),2006,20(2):82-84. 被引量:9
  • 2潘洁,戴廷波,姜东,朱艳,曹卫星.基于气候因子效应的冬小麦籽粒蛋白质含量预测模型[J].中国农业科学,2005,38(4):684-691. 被引量:16
  • 3王绍中,李春喜,章练红,崔转玲.小麦品质生态及品质区划研究 Ⅰ.河南省小麦品质现状及地区差异[J].河南农业科学,1995,24(10):3-10. 被引量:37
  • 4Han Jiawei,Kamber M.Data mining concepts and techniques[M].范明,孟小峰,译.2版.北京:机械工业出版社,2007.
  • 5Makrehchi M, Kamel M S.Text classification using small num- ber of features[C]//Pemer P, Imiya A.Proc of the 4th Int'l Conf on Machine Learning and Data Mining in Pattern Recognition,2005:580-589.
  • 6Daniel C, Triboy E.Changes in wheat protein aggregation during grain development: effects of temperatures and water stress[J].Eu- ropean Journal of Agronomy,2002,16:1-12.
  • 7Bradley P S, Fayyad U M.Refining initial points for k-means clustering[C]//Proc of the 15th Intemet Conf on Machine Learn- ing.San Francisco: Morgan Kaufmann Publishers, 1998: 91-99.
  • 8中华人民共和国国家标准.GB/T17892-1999优质小麦强筋小麦[S].北京:国家质量技术监督局,1999.
  • 9中华人民共和国国家标准.GB/T17893-1999优质小麦弱筋小麦[S].北京:国家质量技术监督局,1999.
  • 10王玲,薄列峰,焦李成.密度敏感的谱聚类[J].电子学报,2007,35(8):1577-1581. 被引量:61

引证文献1

二级引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部