Making Big Data More Accessible
When a group of researchers in the Undiagnosed Disease Network at Baylor College of Medicine realized they were spending days combing through databases searching for information regarding gene variants, they decided to do something about it. By creating MARRVEL (Model organism Aggregated Resources for Rare Variant ExpLoration) they are now able to help not only their own lab but also researchers everywhere search databases all at once and in a matter of minutes.
This collaborative effort among Baylor, the Jan and Dan Duncan Neurological Research Institute at Texas Children’s Hospital (NRI) and Harvard Medical School is described in the latest online edition of the American Journal of Human Genetics.
Big data search engine
“One big problem we have is that tens of thousands of human genome variants and phenotypes are spread throughout a number of databases, each one with their own organization and nomenclature that aren’t easily accessible,” said Julia Wang, an M.D./Ph.D. candidate in the Medical Scientist Training Program at Baylor and a McNair Student Scholar in the Bellen lab, as well as first author on the publication. “MARRVEL is a way to assess the large volume of data, providing a concise summary of the most relevant information in a rapid user-friendly format.”
MARRVEL displays information from OMIM, ExAC, ClinVar, Geno2MP, DGV, and DECIPHER, all separate databases to which researchers across the globe have contributed, sharing tens of thousands of human genome variants and phenotypes. Since there is not a set standard for recording this type of information, each one has a different approach and searching each database can yield results organized in different ways. Similarly, decades of research in various model organisms, from mouse to yeast, are also stored in their own individual databases with different sets of standards.
Dr. Zhandong Liu, assistant professor in pediatrics – neurology at Baylor, a member of the NRI and co-corresponding author on the publication, explains that MARRVEL acts similar to an internet search engine.
“This program helps to collate the information in a common language, drawing parallels and putting it together on one single page. Our program curates model organism specific databases to concurrently display a concise summary of the data,” Liu said.
Existing 20-year-old Multiple Sclerosis Drug Effective Against Multi-resistant BacteriaNews
A widely-used and twenty-year-old medicine used to treat multiple sclerosis can also beat a type of multi-resistant bacteria for which there are currently only a few effective drugs.READ MORE
Pre-Diabetes Discovery Marks Step Towards Precision MedicineNews
Identification of three molecules that can be used to accurately assess pre-diabetes – a key predictor of conditions such as diabetes and high blood pressure – has brought precision medicine for humans a step closer.READ MORE
Revolutionary Imaging Technique Uses CRISPR to Map DNA MutationsNews
The new high-speed AFM method can map DNA to a resolution of tens of base pairs while creating images up to a million base pairs in size. And it does it using a fraction of the amount of specimen required for DNA sequencing.READ MORE
Comments | 0 ADD COMMENT
3rd Annual NGS Data Analysis and Informatics Conference
Feb 08 - Feb 09, 2018
3rd Annual Genome Editing & Engineering Conference
Feb 08 - Feb 09, 2018