GenomeStack Big Data Analytics Database
News May 29, 2015
The GenomeStack Big Data Analytics platform has been developed specifically for bioinformatics researchers, data scientists and analysts conducting genome research. The database replaces the traditional file-based, manual process for storing and analyzing genome sequenced data.Post-aligned sequencing data from multiple samples is stored in an ultra-fast and highly scalable SQL database, enabling querying of a large number of samples simultaneously. Thus, an analysis of 50 BAM files with 30 billion rows of aligned sequencing data, can be analyzed in under a minute, while providing a detailed drill-down from all participating samples to the single read level in order to review the nucleotides distribution on a specific chromosome position.