摘要
On the basis of information theory and statistical methods, we use mutual information, n- tuple entropy and conditional entropy, combined with biological characteristics, to analyze the long range correlation and short range correlation in human Y chromosome palindromes. The magnitude distribution of the long range correlation which can be reflected by the mutual information is PS〉PSa〉PSb (P5a and P5b are the sequences that replace solely Alu repeats and all interspersed repeats with random uneorrelated sequences in human Y chromosome palindrome 5, respectively); and the magnitude distribution of the short range correlation which can be reflected by the n-tuple entropy and the conditional entropy is PS〉P5a〉PSb〉random uncorrelated sequence. In other words, when the Alu repeats and all interspersed repeats replace with random uneorrelated sequence, the long range and short range correlation decrease gradually. However, the random nncorrelated sequence has no correlation. This research indicates that more repeat sequences result in stronger correlation between bases in human Y chromosome. The analyses may be helpful to understand the special structures of human Y chromosome palindromes profoundly.
基金
This work was supported by the National Natu- ral Science Foundation of China (No.20173023 and No.90203012) and the Specialized Research Fund for the Doctoral Program of Higher Education of China (No.20020730006).