3Ruemmler C. Wilkes J. An introduction to disk drive modeling [J]. IEEE Computer. 1994. 27(3): 17-28.
4Patterson D A. Gibson G. Katz R H. A case for redundant arrays of inexpensive disks ( RAID) [C]//Proc of the SIGMOD. New York: ACM. 1988: 109-116.
5Salem K. Garcia Molina H. Disk striping [C]//Proc of the 2nd Int Conf on Data Engineering. Piscataway. NJ: IEEE. 1986: 336-342.
6Kim M Y. Synchronized disk interleaving [J]. IEEE Trans on Computers. 1986. 35 (11): 978-988.
7Ruemmler C. Wilkes J. An introduction to disk drive modeling [J].IEEE Computer. 1994. 27(3): 17-28.
8Shan H Z. Shalf J. Using lOR to analyze the I/O performance for HPC platforms [C/OL]//Proc of the 2007 Cray User Group Conf, 2007: 1-15.
9Fan B. Tantisiriroj W. Xiao L. et al. DiskReduce: RAID for data-intensive scalable computing [C]//Proc of the Petascale Data Storage Workshop (PDSW 2009). New York: ACM. 2009: 6-10.
10Buyya R. Corte T. Jin H. Petal: Distributed virtual disks [C]/ /Proc of the High Performance Mass Storage and Parallel I/O: Technologies and Applications. Piscataway. NJ: IEEE. 2002: 420-430.
4MANBER U. Finding similar files in a large file system [C]// Proceedings of the Winter 1994 USENIX Technical Conference. San Fransisco, CA, USA: [s.n.], 1994: 1-10.
5BRODER A Z. On the resemblance and containment of docu- ments [C]// Proceedings of the International Conference on Com- pression and Complexity of Sequences. Salerno, Italy: [s.n.], 1997 : 21-29.
6RIVEST R. The MD5 message- digest algorithm [J]. RFC 1321, Internet Engineering Task Force, 1992, 22(1) : 15- 26.
7Manyika J, Chui M, Brown B, et al. Big Data: The Next Frontier for Innovation, Competition, and Productivity. McKinsey Global Institute, 2011.
8Gray J. What Next A Dozen Information-Technology Research Goals[Technical Report]. Microsoft Research, 1999 MS-TR-99-50.
9Bolosky WJ, Corbin S, Goebel D, Douceur JR. Singleinstance storage in Windows 2000. Proc. of the 4th USENIX Windows System Symposium, August 2000.
10Quinlan S, Dorward S. Venti: a new approach to archival storage. Proc. of the First USENIX Conference on File and Storage Technologies. Monterey, CA, USA. 2002.