摘要
With the development of genome sequencing for many organisms, more and moreraw sequences need to be annotated. Gene prediction by computational methods for finding thelocation of protein coding regions is one of the essential issues in bioinformatics. Two classes ofmethods are generally adopted: similarity based searches and ab initio prediction. Here, we reviewthe development of gene prediction methods, summarize the measures for evaluating predictor quality,highlight open problems in this area, and discuss future research directions.
With the development of genome sequencing for many organisms, more and moreraw sequences need to be annotated. Gene prediction by computational methods for finding thelocation of protein coding regions is one of the essential issues in bioinformatics. Two classes ofmethods are generally adopted: similarity based searches and ab initio prediction. Here, we reviewthe development of gene prediction methods, summarize the measures for evaluating predictor quality,highlight open problems in this area, and discuss future research directions.