期刊文献+

基于加权Context建模的DNA序列压缩算法 被引量:3

A Genome Sequence Compression Algorithm Based on the Weighted Context Modeling
下载PDF
导出
摘要 给出一种基于自适应Context加权的细菌DNA序列压缩算法.不同阶数的Context模型用于描述碱基符号间的关联程度.通过加权的方式将各阶模型进行组合,构建用于驱动算术编码器的条件概率分布.各阶模型对应权值由其相应自适应码长决定.在编码过程中,权值能够根据各阶模型获得的统计计数值自适应更新.实验结果表明,该方法能够获得比其他加权Context建模基因组序列压缩算法更好的压缩效率. A bacteria genome compression algorithm based on the adaptive weighted context model is present. The context model with different order is used to describe the relation degree of basic group code. The context models are combined by weighting to constitute the conditional probability distribution to drive arithmetic coder and the values of these weights are determined by the corresponding adaptive code length. In the coding process,the values of these weights are adaptively updated according to the statistic count value acquired by the context model. The experimental results indicate that the algorithm presented could produce better compression result than the results by other algorithms.
出处 《昆明学院学报》 2014年第3期81-84,共4页 Journal of Kunming University
基金 云南省自然科学基金青年基金资助项目(2013FD042)
关键词 Context建模 DNA序列压缩 自适应码长 context modeling genome compression weighted context modeling adaptive code length
  • 相关文献

参考文献9

  • 1GRUMBACH S, TAHI F. Compression of DNA sequences [ C ]//Data Compression Conference. Utah : Snowbird, 1993 : 340 - 350.
  • 2GRUMBACH S, TAHI F. A new challenge for compression algo- rithms: genetic sequences [ J ]. Information Processing & Manage- ment, 1994,30 (6) :866 - 875.
  • 3CHEN X, KWONG S, LI M. A compression algorithm fur DNA se- quences and its applicationsin genome comparison [ J]. Genome Inform Ser Workshop Genome Inform, 1999,10:51 - 61.
  • 4MATSUMOTO T, SADAKANE K, IMAI H. Biological sequence compression algorithms[ J]. Genome Informatics,2000,11:43 - 52.
  • 5CAO M D, DIX T I,ALLISON L,et al. A simple statistical algorithm for biological sequence compression [ C ]//Data Compression Confer- ence. Utah : Snowbird ,2007:43 - 52.
  • 6PINHO A J,PRATAS D, FERREIRA P J S. Bacteria DNA sequence compression using a mixture of finite-context models[ J ]. IEEE Statis- tical Signal Processing Workshop ,2011,12 : 125 - 128.
  • 7陈旻,王开云,薛洁,罗迪.一种图像自适应小波压缩算法[J].昆明学院学报,2013,35(6):96-99. 被引量:5
  • 8杨亚彪,陈旻,王付艳,蔡杰.基于贝叶斯估计的Context量化器设计方法[J].昆明学院学报,2013,35(3):79-82. 被引量:3
  • 9NCBI. Center for biotechnology information [ EB/OL ]. [ 2014 - 03 - 25]. http://www, ncbi. nih. gov/genomes/Bacteria/.

二级参考文献16

  • 1RISSANEN J. Universal coding, information, prediction, and estima- tion [ J ]. Information Theory, 1984,30 (4) :629 - 636.
  • 2RISSANEN J. A universal data compression system [ J ]. Information Theory,1983,29(5) :656 -664.
  • 3RISSANEN J, FEDER M. A universal finite memory source [ J ]. Infor- mation Theory, 1995,41 (3) :643 - 652.
  • 4CHEN Jian-hua. Context modeling based on context quantization with application in wavelet image coding[ J]. IEEE Transactions on Image Processing,2004,13 ( 1 ) :26 - 32.
  • 5WU Xiao-lin, CHOU P A, XUE Xiao-hui. Minimum conditional entropy context quantization [ J ]. Information Theory,2000,60(2) :43 - 53.
  • 6FORCHHAMMER S,WU X. Context quantization by minimum adap- tive code length [ C ]//Proceedings of IEEE International Symposium on Information Theory,2007:246 - 250.
  • 7RISSANEN J. A universal data compression system [ J ]. IEEE Trans- actions on Information Theory, 1983,29 (5) :656 - 664.
  • 8CHEN Min,WANG Fu-yan. Context quantization based on the modi- fied K-means clustering [ J]. Advanced Materials Research, 2013, 756:4068 - 4072.
  • 9CHEN Min,CHEN Jian-hua. Affinity propagation for the Context quanti- zation [ J ]. Advanced Materials Research ,2013,791 : 1533 - 1536.
  • 10SHAPIRO J M. Embedded image coding using zerotrees of wavelets coefficients[ J]. IEEE Transactions on Signal Processing, 1993,41 (12) :3445 -3462.

共引文献6

同被引文献23

  • 1冯敏.凉山彝族服饰[J].贵州民族研究,1989,9(4):116-125. 被引量:10
  • 2GRUMBACH S,TAHIF.CompressionofDNA sequences[C]//ProcDataCompressionConference.Snowbird:IEEEComputerSociety,1993:340-350.
  • 3GRUMBACHS,TAHIF.Anewchallengeforcompressionalgorithms:Geneticsequences[J].InformationProcessing&Management,1994,30(6):866-875.
  • 4RIVALSE,DELAHAYEJP,DAUCHETM,etal.Aguaranteedcompression schemeforrepetitiveDNA sequences[C]//ProcDataCompressionConference.Snowbird:IEEEComputerSociety,1996:453-471.
  • 5CHENX,KWONGS,LIM.A compressionalgorithm forDNAsequencesanditsapplicationsingenomecomparison[C]//ProceedingsoftheFourthAnnualInternationalConferenceonComputationalMolecularBiology.New York:NY,2000:107-116.
  • 6CHENX,LIM,MAB,etal.DNAcompress:FastandeffectiveDNAsequencecompression[J].Bioinformatics,2002,18(2):1696-1698.
  • 7BEHZADIB,FESSANTFL.DNAcompressionchallengerevisited:Adynamicprogrammingapproach[J].CombinatorialPatternMatching,2005,353:190-200.
  • 8MATSUMOTO T,SADAKANE K,IMAIH.Biologicalsequencecompressionalgorithms[J].GenomeInformatics,2000,11:43-52.
  • 9TABUSI,KORODIG,RISSANENJ.DNAsequencecompressionusingthenormalizedmaxi-mumlikelihoodmodelfordiscreteregression[C]//ProcDataCompressionConference.Snowbird:IEEEComputerSociety,2003:253-263.
  • 10KORODIG,TABUSI.Anefficientnormalizedmaximumlikelihoodalgorithm forDNA sequencecompression[J].ACMTransInfSyst,2005,23(1):3-34.

引证文献3

二级引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部