To continue reading this article, sign up for FREE to
Membership is FREE
and provides you with
instant access
to email newsletters, digital publications, our full content catalogue & more...
GenomeStack Big Data Analytics Database
The GenomeStack Big Data Analytics platform has been developed specifically for bioinformatics researchers, data scientists and analysts conducting genome research. The database replaces the traditional file-based, manual process for storing and analyzing genome sequenced data.Post-aligned sequencing data from multiple samples is stored in an ultra-fast and highly scalable SQL database, enabling querying of a large number of samples simultaneously. Thus, an analysis of 50 BAM files with 30 billion rows of aligned sequencing data, can be analyzed in under a minute, while providing a detailed drill-down from all participating samples to the single read level in order to review the nucleotides distribution on a specific chromosome position.