Hot data identification is crucial for many applications though few investigations have examined the subject. All existing studies focus almost exclusively on frequency. However, effectively identifying hot data requi...Hot data identification is crucial for many applications though few investigations have examined the subject. All existing studies focus almost exclusively on frequency. However, effectively identifying hot data requires equally considering recency and frequency. Moreover, previous studies make hot data decisions at the data block level. Such a fine-grained decision fits particularly well for flash-based storage because its random access achieves performance comparable with its sequential access. However, hard disk drives (HDDs) have a significant performance disparity between sequential and random access. Therefore, unlike flash-based storage, exploiting asymmetric HDD access performance requires making a coarse-grained decision. This paper proposes a novel hot data identification scheme adopting multiple bloom filters to efficiently characterize recency as well as frequency. Consequently, it not only consumes 50% less memory and up to 58% less computational overhead, but also lowers false identification rates up to 65% compared with a state-of-the-art scheme. Moreover, we apply the scheme to a next generation HDD technology, i.e., Shingled Magnetic Recording (SMR), to verify its effectiveness. For this, we design a new hot data identification based SMR drive with a coarse-grained decision. The experiments demonstrate the importance and benefits of accurate hot data identification, thereby improving the proposed SMR drive performance by up to 42%.展开更多
A method of data processing to determine the coefficients of linearization equations for 1050 anemometer (produced by Thermo-Systems Inc. -TSI, USA) with the sensors made of domestic hot wire using the program preferr...A method of data processing to determine the coefficients of linearization equations for 1050 anemometer (produced by Thermo-Systems Inc. -TSI, USA) with the sensors made of domestic hot wire using the program preferred in this Paper is described. By calculation and test, it is indicated that the error resulting from this method is about 0. 5% of the full scale and less than TSl's. By using this method we can set up the calibration curve according to the measurement range and the diameter of the hot wire at a certain accuracy.展开更多
为了在数据密集型工作流下有效降低缓存碎片整理开销并提高缓存命中率,提出一种持久性分布式文件系统客户端缓存DFS-Cache(Distributed File System Cache)。DFS-Cache基于非易失性内存(NVM)设计实现,能够保证数据的持久性和崩溃一致性...为了在数据密集型工作流下有效降低缓存碎片整理开销并提高缓存命中率,提出一种持久性分布式文件系统客户端缓存DFS-Cache(Distributed File System Cache)。DFS-Cache基于非易失性内存(NVM)设计实现,能够保证数据的持久性和崩溃一致性,并大幅减少冷启动时间。DFS-Cache包括基于虚拟内存重映射的缓存碎片整理机制和基于生存时间(TTL)的缓存空间管理策略。前者基于NVM可被内存控制器直接寻址的特性,动态修改虚拟地址和物理地址之间的映射关系,实现零拷贝的内存碎片整理;后者是一种冷热分离的分组管理策略,借助重映射的缓存碎片整理机制,提升缓存空间的管理效率。实验采用真实的Intel傲腾持久性内存设备,对比商用的分布式文件系统MooseFS和GlusterFS,采用Fio和Filebench等标准测试程序,DFS-Cache最高能提升5.73倍和1.89倍的系统吞吐量。展开更多
为科学有效地梳理BIM(Building Information Modeling)技术发展脉络并分析BIM技术发展方向,基于Web of Science(WoS)数据库,筛选出2006—2022年BIM相关文献3085篇,以可视化文献计量分析角度,对国内外BIM发展现状及热点趋势进行总结与分...为科学有效地梳理BIM(Building Information Modeling)技术发展脉络并分析BIM技术发展方向,基于Web of Science(WoS)数据库,筛选出2006—2022年BIM相关文献3085篇,以可视化文献计量分析角度,对国内外BIM发展现状及热点趋势进行总结与分析。分析结果表明,BIM当前处于快速上升期,中、美、英、澳、韩等国处于BIM理论研究的领先地位;BIM领域的理论研究、技术研发及实践应用已形成一套循环进化生态体系。该成果可对BIM技术研究及应用方向分析提供参考。展开更多
基金This work was supported by Hankuk University of Foreign Studies Research Fund of Korea, and also partially supported by the National Science Foundation (NSF) Awards of USA under Grant Nos. 1053533, 1439622, 1217569, 1305237, and 1421913. Acknowledgment We would like to thank David Schwaderer (Samsung Semiconductor Inc., USA) for his valuable comments and proofreading.
文摘Hot data identification is crucial for many applications though few investigations have examined the subject. All existing studies focus almost exclusively on frequency. However, effectively identifying hot data requires equally considering recency and frequency. Moreover, previous studies make hot data decisions at the data block level. Such a fine-grained decision fits particularly well for flash-based storage because its random access achieves performance comparable with its sequential access. However, hard disk drives (HDDs) have a significant performance disparity between sequential and random access. Therefore, unlike flash-based storage, exploiting asymmetric HDD access performance requires making a coarse-grained decision. This paper proposes a novel hot data identification scheme adopting multiple bloom filters to efficiently characterize recency as well as frequency. Consequently, it not only consumes 50% less memory and up to 58% less computational overhead, but also lowers false identification rates up to 65% compared with a state-of-the-art scheme. Moreover, we apply the scheme to a next generation HDD technology, i.e., Shingled Magnetic Recording (SMR), to verify its effectiveness. For this, we design a new hot data identification based SMR drive with a coarse-grained decision. The experiments demonstrate the importance and benefits of accurate hot data identification, thereby improving the proposed SMR drive performance by up to 42%.
文摘A method of data processing to determine the coefficients of linearization equations for 1050 anemometer (produced by Thermo-Systems Inc. -TSI, USA) with the sensors made of domestic hot wire using the program preferred in this Paper is described. By calculation and test, it is indicated that the error resulting from this method is about 0. 5% of the full scale and less than TSl's. By using this method we can set up the calibration curve according to the measurement range and the diameter of the hot wire at a certain accuracy.
文摘为了在数据密集型工作流下有效降低缓存碎片整理开销并提高缓存命中率,提出一种持久性分布式文件系统客户端缓存DFS-Cache(Distributed File System Cache)。DFS-Cache基于非易失性内存(NVM)设计实现,能够保证数据的持久性和崩溃一致性,并大幅减少冷启动时间。DFS-Cache包括基于虚拟内存重映射的缓存碎片整理机制和基于生存时间(TTL)的缓存空间管理策略。前者基于NVM可被内存控制器直接寻址的特性,动态修改虚拟地址和物理地址之间的映射关系,实现零拷贝的内存碎片整理;后者是一种冷热分离的分组管理策略,借助重映射的缓存碎片整理机制,提升缓存空间的管理效率。实验采用真实的Intel傲腾持久性内存设备,对比商用的分布式文件系统MooseFS和GlusterFS,采用Fio和Filebench等标准测试程序,DFS-Cache最高能提升5.73倍和1.89倍的系统吞吐量。
文摘为科学有效地梳理BIM(Building Information Modeling)技术发展脉络并分析BIM技术发展方向,基于Web of Science(WoS)数据库,筛选出2006—2022年BIM相关文献3085篇,以可视化文献计量分析角度,对国内外BIM发展现状及热点趋势进行总结与分析。分析结果表明,BIM当前处于快速上升期,中、美、英、澳、韩等国处于BIM理论研究的领先地位;BIM领域的理论研究、技术研发及实践应用已形成一套循环进化生态体系。该成果可对BIM技术研究及应用方向分析提供参考。