The discovery of shared biological properties among independent variants of DNA sequences offers the opportunity to broaden understanding of the biological basis of disease and identify new therapeutic targets, according to a collaboration between the Perelman School of Medicine at the University of Pennsylvania, the University of Arizona Health Sciences, and Vanderbilt University. The group published their findings this month in npj Genomic Medicine.
Drugs can have variable effects on people depending on small natural differences in the sequence of DNA between individuals. These genetic differences are called SNPs, or single nucleotide polymorphisms, and are variants in the DNA alphabet of A, T, C, and G molecules that occur naturally among individuals. Many such SNPs have been associated with disease risk, for instance showing that a person with an A at a given location in the DNA sequence has a higher risk of diabetes compared with someone with a G. However, these disease-related SNPs often reside in the so-called “dark matter” of the genome that does not directly code for genes, but does include switches that control gene expression.
Over the last ten years, researchers have conducted genome wide association studies (GWAS) to map DNA variants across thousands of genomes from individuals to find which variants are more frequent in people with a certain disease. For such common, complex diseases as diabetes or cancer, GWAS have identified hundreds such variants. On the other hand, GWAS have found that many disease-associated variants do not alter the function of genes in an obvious way, making some variants difficult for immediate clinical interpretation.
Senior author Jason H. Moore, PhD, the Edward Rose Professor of Informatics and director of the Institute for Biomedical Informatics and colleagues Yves A. Lussier, Haiquan Li, Ikbel Achour, and Joshua C. Denny have developed a computational method to explore the downstream effects of variants associated with risk to reveal possible mechanisms of disease.
“Our results provide a ‘roadmap’ of disease mechanisms emerging from GWAS to identify candidate therapeutic targets,” Moore said.
In the current paper, the team demonstrated that variants associated with disease risk can affect such biological activities as gene expression and the function of proteins in cellular house-keeping machinery.
“Taking this all together a more comprehensive picture of disease biology is emerging,” Moore said. “This picture – up to now – has been blurry, especially when variants occurred between genes.”
The team used computational modelling of two million pairs of disease-associated SNPs drawn from three GWAS projects, as well as information from other genome databases that match a patient’s individual genetic makeup to their outward symptoms. From this, they predicted 3,870 SNP pairs with a similar biological mechanism. These prioritized SNP pairs, with overlapping messenger RNA targets or similar functions, were more likely to be associated with the same disease than unrelated pathologies.