Applied Biosystems and BCM Summarize Contributions to First Data Release of 1000 Genomes Project
News Feb 18, 2009
Applied Biosystems and scientists from the Baylor College of Medicine Human Genome Sequencing Center (HGSC), have summarized their collective contribution to the first data release of the 1000 Genomes Project.
As a commercial participant, and as part of Applied Biosystems’ collaboration with the HGSC, more than 460 gigabases of mappable sequence data has been generated by the SOLiD™ System, representing 65 % more than the target for the two organizations when the collaborative project was conceived.
The HGSC and Applied Biosystems are key participants in the 1000 Genomes Project, an international research consortium that aims to sequence the genomes of approximately 1,000 people from around the world to create the most detailed and medically useful compendium of human genetic variation.
In April 2008, the HGSC established a collaboration agreement with Applied Biosystems to use six SOLiD Systems to expand its contribution to the pilot phase of the project, and help researchers to determine the best approach for accomplishing its goals. The HGSC’s target for contribution to the project using the SOLiD System was 200 gigabases of sequence data.
In June 2008, Applied Biosystems joined the 1000 Genomes Project as a commercial participant, and committed to contribute a minimum of 75 gigabases of sequence data using the SOLiD System.
As a result of the pilot phase of the project, use of the SOLiD System for the collaboration between the HGSC and Applied Biosystems produced the following results:
• The HGSC collected sequence data from 25 genomes, generating 256 gigabases of uniquely mappable sequence data;
• The HGSC sequenced 24 individuals at approximately 2.6-fold coverage, and sequenced one individual at 26-fold coverage;
• By the end of 2008, the HGSC was generating an average of 15 gigabases per mate pair sequencing run using the SOLiD 2.0 System.
“One of the reasons we chose the SOLiD System for this research was the ability to generate mate-pair reads with inserts from one to three kilobases, which provides very good placement of the reads and allows us to determine genetic structural variation such as inversions, translocations, insertions, and deletions in complex genomes,” said Donna Muzny, Director of Operations at the HGSC.
Applied Biosystems’ commercial participation in the 1000 Genomes Project has enabled scientists to assess technology performance on a diverse set of biological samples. According to Applied Biosystems, the SOLiD System has contributed the following to the project:
• More than 206 gigabases of mappable sequence data, representing two-fold more data than the original committed target of 75 gigabases.
• These 206 gigabases averaged 17 gigabases per sequencing run, which was generated on the SOLiD 2.0 System. The throughput per run represents approximately five-fold coverage of the human genome.
• In the first data release from the 1000 Genome Project, the SNP data from an anonymous African sample was supported by data from the SOLiD 2.0 System.
• In the first data release of the 1000 Genomes Project, the small insertions and deletions (indels) data from an anonymous African sample was generated by the SOLiD 2.0 System.
Closer Look at Immune Proteins Provides Insight into Potential Pathogen Protection StrategiesNews
Biologists have resolved the structure of a ring of proteins used by the immune system to summon support when under attack, providing new insight into potential strategies for protection from pathogens.READ MORE
Researchers Discover Mutation That Appears to Protect Against Multiple Aspects of Biological AgingNews
The first genetic mutation that appears to protect against multiple aspects of biological aging in humans has been discovered in an extended family of Old Order Amish living in the vicinity of Berne, Indiana, report Northwestern Medicine scientists.READ MORE
Computer Program Helps Find Ways to Repurpose Existing DrugsNews
Researchers have developed a computer program to find new indications for old drugs. The computer program, called DrugPredict, matches existing data about FDA-approved drugs to diseases, and predicts potential drug efficacy.READ MORE