摘要
该文把GB2 312 - 80的汉字转换为从 0至 6767的短整型数 ,这些短整型数据有一个共同的存储特点 :它们的 2字节中的高 3位 (称为冗余位 )皆为 0 .删除冗余位而重组其余位即可形成压缩文本 .这种压缩方法显然是简单、快捷、容易实现和对GB2 312 -
In this paper, the chinese characters of GB2312-80 are transformed into short integral numbers distributing from 0 to 6767. Every one of these short integral numbers is stored in a cell of two bytes, and the 3 higher bits, named redundance bits, in the cell are always zero. Omitting the redundance bits and reorganizing the others, the compression text of chinese characters is formed. The compression method is simple, quick, easy to implement and universal for all texts of chinese characters of GB2312-80.
出处
《华南师范大学学报(自然科学版)》
CAS
2001年第2期84-88,共5页
Journal of South China Normal University(Natural Science Edition)