Individualizing Health Care One Byte at a Time
Professor Hoehndorf and his Ph.D. student Imane Boudellioua working together to solve the algorithm. Credit: © 2017 KAUST
Genetic diagnosis of disease and personalization of treatment have the potential to dramatically improve strategies for diagnosis and therapy. Around 80% or rare diseases are thought to have a genetic component, but currently, many patients experience long delays in diagnosis or receive none at all. The challenge is to identify which of the hundreds of thousands of genetic differences between a patient and an unaffected individual might be responsible for their disease; a problem which has been described as "looking for needles in stacks of needles." What's new with this approach is the application of data from model experimental organisms.
How can we make use of the similarities between human and non-human disease models? Non-human model organisms offer significantly more insight because it is possible to induce variation into their genomes and observe the outcomes. This type of data has been collected for close to a century and has culminated into a "big-data" set of genotype-phenotype association data.
A framework called PhenomeNET finds disease genes in a patient-specific manner by matching a patient's phenotype (symptoms) to a large database of gene-to-phenotype associations, including those from non-human model organisms, such as mouse or zebrafish. By combining PhenomeNET with methods that find harmful variants in a genomics sequence, the team developed the PhenomeNET Variant Predictor (PVP) system, an algorithm that prioritizes these variants with their likelihood of involvement in human disease.
"PVP makes use of clinical and experimental data that have been collected for years and uses them to identify the genetic variants underlying the conditions of patients with genetic disorders," said Professor Hoehndorf.
Working with Dr. Nadia Schoenmakers at the Wellcome Trust-MRC Institute of Metabolic Science in Cambridge, the team were able to show that PVP can identify genetic changes in patients with congenital thyroid disease, and can reveal candidate genetic changes in "Mendelian" diseases where only a single gene is involved. Moving forward, the team will seek to determine whether a similar approach can be applied to complex diseases, such as diabetes, where multiple genes are involved.
Scientists want access to data collected by others for their research, but such access could also compromise personal privacy, even after removal of so-called personally identifiable data. Now, statisticians are developing synthetic networks that may increase the availability of some data while still protecting individual or institutional privacy.