Genomics of Genebanks
The traditional assumption in germplasm conservation is that every accession is equally important, which is primarily due to the technical challenges and costs of determining the extent of genetic variation and uniqueness of accessions that used to be higher than the costs of cold rooms for storage. The availability of low-cost, high-density and high throughput genotyping by sequencing (GBS) and DArTseq technologies are providing genetic resource scientists tremendous opportunities for improving germplasm curation; allele mining and gene discovery; generation of high-density molecular passport data; understanding the genetic profiles of the entire collection; identifying genetically unique accessions (subsets or minicore set) for promoting their use in crop improvement and genetic studies; identifying gaps in the existing collection, and correcting mislabeled and taxonomically misclassified collections.
Recently, RBCA implemented a pilot molecular characterization study by genotyping about 4,000 rice accessions with 31,739 DArTseq-based single nucleotide polymorphisms (SNPs) . DArTseq generated a smaller number of markers as compared with the initial GBS technology developed at the University of Cornell that produced up to a million SNPs. Nevertheless, the DArTseq genotype data were highly useful for a wide range of purposes, which includes (i) understanding the extent of genetic diversity, population structure and genetic differentiation of the African rice collections as well as for creating a subset for future use in genetics and breeding studies19; (ii) developing species- and subspecies- diagnostic SNP markers to minimize misclassification, misidentification and mislabeling errors during germplasm acquisition and routine genebank operations28; (iii) comparing the extent of molecular diversity indices among three rice species conserved at the AfricaRice genebank and identifying candidate genes in African rice 25; and (iv) comparing the extent of genetic variation and relatedness among various improved intraspecific and interspecific rice varieties developed by AfricaRice breeders with varieties developed by other institutions and widely used in Africa 27. Overall, the DArTseq-based molecular characterization of the rice collection was much cheaper and more accurate than the phenotypic characterization.
Based on the pilot DArTseq data, we have genotyped a total of 9,120 accessions of which 8,381 accessions are publicly available in Genesys. The genotype data along with the phenotype characterization and passport data would be highly useful for creating subsets of the most genetically diverse accessions for each species and subspecies for future evaluation, which ultimately promotes the use of our collection in rice improvement.