摘要
Aligning billions of reads generated by the next-generation sequencing(NGS)to reference sequences,termed“mapping”,is the time-consuming and computationally-intensive process in most NGS applications.A Fast,accurate and robust mapping algorithm is highly needed.Therefore,we developed the FANSe3 mapping algorithm,which can map a 30×human wholegenome sequencing(WGS)dataset within 30 min,a 50×human whole exome sequencing(WES)dataset within 30 s,and a typical mRNA-seq dataset within seconds in a single-server node without the need for any hardware acceleration feature.Like its predecessor FANSe2,the error rate of FANSe3 can be kept as low as 10-9 in most cases,this is more robust than the Burrows-Wheeler transform-based algorithms.Error allowance hardly affected the identification of a driver somatic mutation in clinically relevant WGS data and provided robust gene expression profiles regardless of the parameter settings and sequencer used.The novel algorithm,designed for high-performance cloud-computing after infrastructures,will break the bottleneck of speed and accuracy in NGS data analysis and promote NGS applications in various fields.The FANSe3 algorithm can be downloaded from the website:http://www.chi-biote ch.com/fanse 3/.
基金
collectively supported by the Ministry of Science and Technology of China,National Key Research and Development Program[2018YFC0910200/2017YFA0505001].