研究了在数据无损压缩领域影响深远的两种压缩算法 :L Z78及 L Z77,提出了一种改进的混合字典压缩算法 HL Z(Hybrid L Z) .HL Z是基于 L Z78和 L Z77的一种混合算法 ,利用了 L Z78和 L Z77的互补特性 .在用 HL Z算法进行正文编码时 ,当...研究了在数据无损压缩领域影响深远的两种压缩算法 :L Z78及 L Z77,提出了一种改进的混合字典压缩算法 HL Z(Hybrid L Z) .HL Z是基于 L Z78和 L Z77的一种混合算法 ,利用了 L Z78和 L Z77的互补特性 .在用 HL Z算法进行正文编码时 ,当发现已经到达字典中提供的词汇终点时 ,并不立刻进行编码 ,而是与滑动窗口相比较 ,若当前字符串在滑动窗口中的匹配长度尚不及它在字典中的匹配串的长度 ,则采用 L Z78输出 ,否则用 L Z77编码输出 .在还原输出编码时 ,HL Z算法建立了一个链结构 ,将字典中具有相同首字母的词条链接起来 ,大大减少了搜索字典中对应最长匹配串的时间 .实验结果表明 ,HL Z算法具有与 L Z78和 L Z77相似的计算复杂度和存储复杂度 ,但具有更好的全局与局部自适应性、更高的压缩效率 .展开更多
Mutation (substitution, deletion, insertion, etc.) in nucleotide acid causes the maximal sequence lengths of exact match (MALE) between paralogous members from a duplicate event to become shorter during evolution. In ...Mutation (substitution, deletion, insertion, etc.) in nucleotide acid causes the maximal sequence lengths of exact match (MALE) between paralogous members from a duplicate event to become shorter during evolution. In this work, MALE changes between members of 26 gene families from four representative species (Arabidopsis thaliana, Oryza sativa, Mus mus- culus and Homo sapiens) were investigated. Comparative study of paralogous’ MALE and amino acid substitution rate (dA<0.5) indicated that a close relationship existed between them. The results suggested that MALE could be a sound evolutionary scale for the divergent time for paralogous genes during their early evolution. A reference table between MALE and divergent time for the four species was set up, which would be useful widely, for large-scale genome alignment and comparison. As an example, de- tection of large-scale duplication events of rice genome based on the table was illustrated.展开更多
文摘研究了在数据无损压缩领域影响深远的两种压缩算法 :L Z78及 L Z77,提出了一种改进的混合字典压缩算法 HL Z(Hybrid L Z) .HL Z是基于 L Z78和 L Z77的一种混合算法 ,利用了 L Z78和 L Z77的互补特性 .在用 HL Z算法进行正文编码时 ,当发现已经到达字典中提供的词汇终点时 ,并不立刻进行编码 ,而是与滑动窗口相比较 ,若当前字符串在滑动窗口中的匹配长度尚不及它在字典中的匹配串的长度 ,则采用 L Z78输出 ,否则用 L Z77编码输出 .在还原输出编码时 ,HL Z算法建立了一个链结构 ,将字典中具有相同首字母的词条链接起来 ,大大减少了搜索字典中对应最长匹配串的时间 .实验结果表明 ,HL Z算法具有与 L Z78和 L Z77相似的计算复杂度和存储复杂度 ,但具有更好的全局与局部自适应性、更高的压缩效率 .
基金Project supported by the National Natural Science Foundation of China (Grant Nos. 30270810, 90208022 and 30471067) and IBM Shared University Research (Life Science), China
文摘Mutation (substitution, deletion, insertion, etc.) in nucleotide acid causes the maximal sequence lengths of exact match (MALE) between paralogous members from a duplicate event to become shorter during evolution. In this work, MALE changes between members of 26 gene families from four representative species (Arabidopsis thaliana, Oryza sativa, Mus mus- culus and Homo sapiens) were investigated. Comparative study of paralogous’ MALE and amino acid substitution rate (dA<0.5) indicated that a close relationship existed between them. The results suggested that MALE could be a sound evolutionary scale for the divergent time for paralogous genes during their early evolution. A reference table between MALE and divergent time for the four species was set up, which would be useful widely, for large-scale genome alignment and comparison. As an example, de- tection of large-scale duplication events of rice genome based on the table was illustrated.