AGOUTI: improving genome assembly and annotation using transcriptome data
Abstract
Summary
Current genome assemblies consist of thousands of contigs. These incomplete and fragmented assemblies lead to errors in gene identification, such that single genes spread across multiple contigs are annotated as separate gene models. We present AGOUTI (Annotated Genome Optimization Using Transcriptome Information), a tool that uses RNA-seq data to simultaneously combine contigs into scaffolds and fragmented gene models into single models. We show that AGOUTI improves both the contiguity of genome assemblies and the accuracy of gene annotation, providing updated versions of each as output.
Availability
The software is implemented in python and is available from github.com/svm-zhang/AGOUTI.
Contact
<email>simozhan@indiana.edu</email>
Supplementary information
Supplementary data are available atBioinformaticsonline.
Related articles
Related articles are currently not available for this article.