Lithuanian Central State Archive is the biggest one within the state archival service and the only state archive where audiovisual documents are stored. There are more than 800,000 units of audiovisual documents in th...Lithuanian Central State Archive is the biggest one within the state archival service and the only state archive where audiovisual documents are stored. There are more than 800,000 units of audiovisual documents in the archive. The main laws regulating the activity of Lithuanian Central State Archive and related to audiovisual archiving are the Law on Documents and Archives of Lithuanian Republic, the Law of Cinema of Lithuanian Republic, and the Law on Copyright and Related Rights of Lithuanian Republic. There are four big collections of audiovisual documents in the Lithuanian Central State Archive--films, photo documents, sound recordings, and video recordings. The Archive's specialists have a large experience in the field of physical treatment and preservation of analogue audiovisual documents. Lithuanian Central State Archive digitizes audiovisual documents seeking the balance between long time preservation and nowadays access. Since May, 2010 till April 2013, Lithuanian Central State Archive implemented the project--Lithuanian documentaries on the Internet. During the project the Archives digitized and transferred to the Internet 1,000 titles of Lithuanian documentaries, created in the period 1919-1961. Lithuanian Central State Archive wants to popularize its collections, so various international projects are participated in.展开更多
Based on variable sized chunking, this paper proposes a content aware chunking scheme, called CAC, that does not assume fully random file contents, but tonsiders the characteristics of the file types. CAC uses a candi...Based on variable sized chunking, this paper proposes a content aware chunking scheme, called CAC, that does not assume fully random file contents, but tonsiders the characteristics of the file types. CAC uses a candidate anchor histogram and the file-type specific knowledge to refine how anchors are determined when performing de- duplication of file data and enforces the selected average chunk size. CAC yields more chunks being found which in turn produces smaller average chtmks and a better reduction in data. We present a detailed evaluation of CAC and the experimental results show that this scheme can improve the compression ratio chunking for file types whose bytes are not randomly distributed (from 11.3% to 16.7% according to different datasets), and improve the write throughput on average by 9.7%.展开更多
文摘Lithuanian Central State Archive is the biggest one within the state archival service and the only state archive where audiovisual documents are stored. There are more than 800,000 units of audiovisual documents in the archive. The main laws regulating the activity of Lithuanian Central State Archive and related to audiovisual archiving are the Law on Documents and Archives of Lithuanian Republic, the Law of Cinema of Lithuanian Republic, and the Law on Copyright and Related Rights of Lithuanian Republic. There are four big collections of audiovisual documents in the Lithuanian Central State Archive--films, photo documents, sound recordings, and video recordings. The Archive's specialists have a large experience in the field of physical treatment and preservation of analogue audiovisual documents. Lithuanian Central State Archive digitizes audiovisual documents seeking the balance between long time preservation and nowadays access. Since May, 2010 till April 2013, Lithuanian Central State Archive implemented the project--Lithuanian documentaries on the Internet. During the project the Archives digitized and transferred to the Internet 1,000 titles of Lithuanian documentaries, created in the period 1919-1961. Lithuanian Central State Archive wants to popularize its collections, so various international projects are participated in.
基金Supported by the National Natural Science Foundation of China (No.60673001) the State Key Development Program of Basic Research of China (No. 2004CB318203).
文摘Based on variable sized chunking, this paper proposes a content aware chunking scheme, called CAC, that does not assume fully random file contents, but tonsiders the characteristics of the file types. CAC uses a candidate anchor histogram and the file-type specific knowledge to refine how anchors are determined when performing de- duplication of file data and enforces the selected average chunk size. CAC yields more chunks being found which in turn produces smaller average chtmks and a better reduction in data. We present a detailed evaluation of CAC and the experimental results show that this scheme can improve the compression ratio chunking for file types whose bytes are not randomly distributed (from 11.3% to 16.7% according to different datasets), and improve the write throughput on average by 9.7%.