Making Big Data More Accessible
When a group of researchers in the Undiagnosed Disease Network at Baylor College of Medicine realized they were spending days combing through databases searching for information regarding gene variants, they decided to do something about it. By creating MARRVEL (Model organism Aggregated Resources for Rare Variant ExpLoration) they are now able to help not only their own lab but also researchers everywhere search databases all at once and in a matter of minutes.
This collaborative effort among Baylor, the Jan and Dan Duncan Neurological Research Institute at Texas Children’s Hospital (NRI) and Harvard Medical School is described in the latest online edition of the American Journal of Human Genetics.
Big data search engine
“One big problem we have is that tens of thousands of human genome variants and phenotypes are spread throughout a number of databases, each one with their own organization and nomenclature that aren’t easily accessible,” said Julia Wang, an M.D./Ph.D. candidate in the Medical Scientist Training Program at Baylor and a McNair Student Scholar in the Bellen lab, as well as first author on the publication. “MARRVEL is a way to assess the large volume of data, providing a concise summary of the most relevant information in a rapid user-friendly format.”
MARRVEL displays information from OMIM, ExAC, ClinVar, Geno2MP, DGV, and DECIPHER, all separate databases to which researchers across the globe have contributed, sharing tens of thousands of human genome variants and phenotypes. Since there is not a set standard for recording this type of information, each one has a different approach and searching each database can yield results organized in different ways. Similarly, decades of research in various model organisms, from mouse to yeast, are also stored in their own individual databases with different sets of standards.
Dr. Zhandong Liu, assistant professor in pediatrics – neurology at Baylor, a member of the NRI and co-corresponding author on the publication, explains that MARRVEL acts similar to an internet search engine.
“This program helps to collate the information in a common language, drawing parallels and putting it together on one single page. Our program curates model organism specific databases to concurrently display a concise summary of the data,” Liu said.
CRISPR Reveals New Targets for Promising Cancer DrugsNews
Novel screening method identifies new drug targets that could potentially enhance the effectiveness of PD-1 checkpoint inhibitors, a promising new class of cancer immunotherapy.READ MORE
Cell Recycling System Offers Therapeutic Entry Point for Rare Disease TreatmentNews
Scientists have demonstrated how an investigational drug works against a rare, fatal genetic disease, Niemann-Pick type C1 (NPC1).READ MORE
Comments | 0 ADD COMMENT
EMBL Course: Next Generation Sequencing: RNA Sequencing Library Preparation
Apr 23 - Apr 27, 2018
EMBO Practical Course: Microbial Metagenomics: A 360º Approach
Apr 23 - Apr 30, 2018
EMBL Conference: European Conference of Life Science Funders and Foundations
Apr 19 - Apr 20, 2018
EMBL Course: Next Generation Sequencing: Whole Genome Sequencing Library Preparation
Apr 16 - Apr 20, 2018