The 1001G+ project: A curated collection ofArabidopsis thalianalong-read genome assemblies to advance plant research

This article has 0 evaluations Published on
Read the full article Related papers
This article on Sciety

Abstract

Arabidopsis thalianawas the first plant for which a high-quality genome sequence became available. The publication of the first reference genome sequence almost 25 years ago was already accompanied by genome-wide data on sequence polymorphisms in another accession, or naturally occurring strain. Since then, inventories of genome-wide diversity have been generated at increasingly precise levels. High-density genotype data forA. thaliana, including those from the 1001 Genomes Project, were key to demonstrating the enormous power of GWAS in inbred populations of wild plants, and the comparison of intraspecific polymorphism with interspecific divergence has illuminated many aspects of plant genome evolution. Over the past decade, an increasing number of nearly complete genome sequences have been published for many more accessions. Here, we highlight the diversity of a curated collection of previously published and so far unpublished genome sequences assembled using different types of long reads, including PacBio Continuous Long Reads (CLR), PacBio High Fidelity (HiFi) reads, and Oxford Nanopore Technologies (ONT) reads. This 1001 Genomes Plus (1001G+) resource is being made available at<ext-link xmlns:xlink="http://www.w3.org/1999/xlink" ext-link-type="uri" xlink:href="http://1001genomes.org">http://1001genomes.org</ext-link>. We invite colleagues with yet unpublished genome assemblies fromA. thalianaaccessions to contribute to this effort.

Related articles

Related articles are currently not available for this article.