期刊文献+

采用BWT的多核并行的子串匹配算法

Multi-core Parallel Substring Matching Algorithm Using BWT
下载PDF
导出
摘要 针对P-BWT精确匹配算法存在只支持短串查询并且只能工作在单处理器上的问题,提出了一个多核并行的支持任意查询长度的精确查询算法.改进了P-BWT索引上的查询过程,当一个查询串跨越了多个数据分片时,首先在其匹配的最后一个分片上查询,然后依次在前面分片上进行验证.进一步提出了一个多核并行查询算法来减少搜索和验证过程的迭代次数.实验结果表明,所述算法可以高效并行地完成子串匹配任务. In order to solve the problem that P-BWT (Burrows-Wheeler transform) could only support short queries, and work on a uniprocessor, a multi-core parallel exact matching algorithm was proposed which any query length could be supposed. Firstly, the search process on P-BWT index was modified. When a query spans multiple data fragments, it first searches on the last segment, then verifies on the other segments. Further, a parallel algorithm was proposed to reduce the iterations in the search and verify process. Finally, the experimental study show that using the proposed algorithm, the substring matching task could be accomplished efficiently in parallel manner.
出处 《东北大学学报(自然科学版)》 EI CAS CSCD 北大核心 2016年第5期624-628,共5页 Journal of Northeastern University(Natural Science)
基金 国家自然科学基金资助项目(61322208 61272178 61129002 61572122 61532021) 教育部高等学校博士学科点专项科研基金资助项目(20110042110028)
关键词 BWT 全文索引 精确匹配 并行 多核 BWT ( Burrows-Wheeler transform) full text index exact matching parallel multi-core
  • 相关文献

参考文献11

  • 1Navarro G, Makinen V. Compressed full-text indexes [ J ]. ACM Computing Surveys (CSUR) ,2007,39( 1 ) :2 -50.
  • 2Yang X, Wang B, Li C, et al. Efficient direct search on compressed genomic data [ C ]//IEEE 29th International Conference on Data Engineering (ICDE). Brisbane, 2013 : 961 - 972.
  • 3Li H,Durbin R. Fast and accurate short read alignment with Burrows-Wheeler transform [ J ]. Bioinformatics, 2009, 25 (14) :1754 - 1760.
  • 4Li R, Yu C, Li Y, et al. SOAP2 : an improved ultrafast tool for short read alignment [ J ]. Bioinformatics, 2009,25 ( 15 ) : 1966 - 1967.
  • 5Wandelt S, Deng D, Gerdjikov S, et al. State-of-the-art in string similarity search and join[ J]. ACM SIGMOD Record, 2014,43 ( 1 ) :64 - 76.
  • 6Jiang Y, Li G, Fang J, et al. String similarity joins: an experimental evaluation [ J ]. Proceedings of the VLDB Endowment,2014,7 ( 8 ) :625 - 636.
  • 7Ferragina P, Manzini G. Opportunistic data structures with applications [ C]//Proceedings 41st Annual Symposium on Foundations of Computer Science. Redondo, 2000:390 - 398.
  • 8Manzini G, Ferragina P. Engineering a lightweight suffix array construction algorithm[ J]. Algorithmica,2004,40( 1 ) : 33 - 50.
  • 9Crauser A, Ferragina P. A theoretical and experimental study on the construction of suffix arrays in external memory [ J ]. Algorithmica ,2002,32( 1 ) : 1 - 35.
  • 10Dementiev R,Karkkainen J, Mehnert J, et al. Better external memory suffix array construction [ J ]. Journal of Experimental Algorithmics,2008,12 ( 1 ) :3 - 27.

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部