Translational Genomics Accelerates Cancer Cure Research Using SGI Technology
News Dec 07, 2007
Purchased in April through a National Institute of Health (NIH) grant, the SGI® Altix® 4700 will assist TGen researchers in understanding the genomic variation - a process which requires comparison searches of enormous data sets - that can be used for diagnosis and treatment of disease in a manner tailored to individual patients.
TGen selected the SGI Altix 4700 system with over half a terabyte of shared memory so researchers in the Phoenix, Ariz, institute can search across multiple chromosomes, all in memory, without having to break the problems into smaller pieces, enabling researchers to look at the whole instead of the sum of the parts.
While custom in-house code will be written for these large data searches, TGen reports that benchmarks were run on the National Center for Biotechnology Information (NCBI) BLAST (Basic Local Alignment Search Tool), an algorithm for comparing primary biological sequence information, ClustalW, a general purpose multiple sequence alignment program for DNA or proteins, and NAMD, a parallel molecular dynamics code for large bimolecular systems. Testing resulted in performance improvements of up to 50 percent on the 64-bit SGI Altix system as compared to their existing 32-bit system architecture.
"Technology, and microarrays specifically, have allowed the size of bioinformatics data sets to become so large that it became vital to acquire a large memory, 64-bit computational infrastructure to be able to manipulate and analyze those files at the level our researchers required," said James Lowey, Director of High Performance Bio-Computing, The Translational Genomics Research Institute.
"Conventional 32-bit computer architectures cannot address memory above 4GB. This limitation poses sub-optimal analytical approaches due to the prohibitively protracted computer analysis time needed for optimal mathematical models and computational algorithms."
The molecular profile datasets being analyzed at TGen cover malignant myeloma, melanoma, Alzheimer's, autism and pancreatic, prostate, colon, and breast cancers. With microarrays, each individual file can be up to 150MB. To search and compare genetic patterns, researchers take hundreds and hundreds of copies of that individual file.
TGen's code will harness the compute power of the SGI Altix system's global shared memory to run microarrays, searching for variations as minute as a change on one protein, trying to determine what effect that has across the entire spectrum of what is being observed.
"The success of TGen scientists to date has come at the sacrifice of time," says Dr. Edward Suh, CIO of TGen. "However, individuals affected with the disease do not have the luxury of time. The 64-bit SGI computing system will optimize TGen researchers' ability to meet their data analysis needs efficiently, hopefully leading to timely and effective discovery for improved human health."
The system will be housed at Arizona State University (ASU) in Tempe, AZ, with operational support provided by ASU's Fulton High Performance Computing Initiative (HPCI). "The shared memory capabilities of the new SGI system provide a welcome addition to the HPC portfolio in Arizona, and will enable researchers at TGen and their collaborators at ASU to address problems we've been unable to tackle in the past," said Dan Stanzione, the Director of the Fulton HPCI.
Purchased through James River Technical, Inc., SGI's designated partner for higher education and research, and installed in late May 2007, the SGI Altix 4700 system at TGen is equipped with 576GB memory and 48 Intel® Itanium® 2 cores.
"We are very excited about the award of the NIH grant to TGen, and the subsequent installation of the SGI Altix system", said Tom Mountcastle, President of James River Technical. "Genomics is a strategic focus for JRT and SGI and the management and processing of large data sets and workflow are ideal fits for the Altix 4700. We are pleased that the system has met the expectations of TGen and we are proud to be a part of the support infrastructure to facilitate this very important research that affects all of our lives."