Analytic Tool Identifies Genetic Variation in Crop Species
A new computational tool swiftly and efficiently exposes genetic diversity within DNA databases of various plant species.

Complete the form below to unlock access to ALL audio articles.
In a major advance for agricultural science, researchers have developed a new computational tool designed to swiftly and efficiently expose genetic diversity within DNA databases of various plant species.[1]
The open-source platform is poised to accelerate the discovery of genetic variations that are key to developing crops with improved resilience, yield and nutritional value.
Harnessing advanced algorithms and the capabilities of high-performance computing (HPC), the KAUST team, led by plant genomicist Rod Wing, demonstrated the tool’s ability to detect small DNA differences — so-called single nucleotide variants (SNPs) — across various strains of rice, maize, soybean and sorghum.
In the case of the rice investigation, for instance, the team employed the tool on a complex genetic dataset of DNA sequences from thousands of distinct accessions — a comprehensive “pan-genome” that the researchers had previously helped to assemble for Asian rice (Oryza sativa). Using this dataset along with the group’s novel analytical method, the KAUST researchers uncovered more than 2 million genetic variants previously overlooked by conventional interrogations of a single reference rice genome.
This marks an initial step towards unlocking new avenues in crop enhancement and sustainable agriculture, notes plant geneticist and study co-author Yong Zhou. “These hidden SNPs could now be utilized for breeding programs immediately and also to identify novel functional genes for agricultural traits,” he says.
Want more breaking news?
Subscribe to Technology Networks’ daily newsletter, delivering breaking science news straight to your inbox every day.
Subscribe for FREEKey to the performance of the tool — named the high-performance computing genome variant calling workflow, or HPC-GVCW — is the ability to divide large chunks of the genome into discrete bits and then to rely on parallel processing technologies to solve complex computing problems on large-scale multidimensional genomics data.
“This reduces the execution time massively,” says study co-author Nagarajan Kathiresan, a computational scientist, “making it able to process 3,000 genomes within 24 hours.”
With more genomes now getting sequenced than ever before, Zhou adds, the new tool should prove invaluable for streamlining their analysis to empower next-generation crop breeding.
References: Zhou Y, Kathiresan N, Yu Z, et al. A high-performance computational workflow to accelerate GATK SNP detection across a 25-genome dataset. BMC Biol. 2024;22(1):13. doi: 10.1186/s12915-024-01820-5
Sedeek K, Mohammed N, Zhou Y, et al. Multitrait engineering of Hassawi red rice for sustainable cultivation. Plant Sci. 2024;341:112018. doi: 10.1016/j.plantsci.2024.112018
This article has been republished from the following materials. Note: material may have been edited for length and content. For further information, please contact the cited source.