Lessons Learned from Implementing a National Infrastructure in Sweden for Storage and Analysis of Next-generation Sequencing Data
Conference Recording Aug 25, 2014
About the SpeakerOla Spjuth is Director of UPPNEX and Deputy Director at the UPPMAX high performance computing center at Uppsala University. He is also researcher at the Department of Pharmaceutical Biosciences with main research interests in the area of high-performance bioinformatics, data integration, and computational pharmacology. AbstractStoring and analyzing data from next-genration sequencing (NGS) requires a well developed e-infrastructure of large scale storage and high-performance computing. Here we describe UPPNEX, an implementation of such an e-infrastructure, tailored to the needs of data storage and analysis of NGS data in Sweden serving various labs and multiple instruments from the major sequencing technology platforms. UPPNEX comprises resources for high-performance computing, large-scale and high-availability storage, an extensive bioinformatics software suite, up-to-date reference genomes and annotations, a support function with system and application experts as well as a web portal and support ticket system. UPPNEX applications are numerous and diverse, and include whole genome-, de novo- and exome sequencing, targeted resequencing, SNP discovery, RNASeq, and methylation analysis. There are over 400 projects that utilize UPPNEX and include large undertakings such as the sequencing of the flycatcher and Norwegian spruce. The presentation covers strategic decisions made when investing in hardware, setting up maintenance and support, allocating resources, and illustrate major challenges such as managing data growth. We also present our experiences and observations with UPPNEX to date, providing insights into the successful and less successful decisions made.