Corporate Banner
Satellite Banner
Next Gen Sequencing
Scientific Community
 
Become a Member | Sign in
Home>News>This Article
  News
Return

What's the Big Deal about Big Data?

Published: Monday, September 17, 2012
Last Updated: Monday, September 17, 2012
Bookmark and Share
The data-intensive nature of scientific research is currently driving the emergence of big data solutions that can gather, analyze, and transport extremely large volumes of data among multiple locations worldwide.

Wikipedia defines big data as "a collection of data sets so large and complex that it becomes difficult to process using on-hand database management tools. The challenges include capture, storage, search, sharing, analysis, and visualization".

Laboratories have been dealing with large amounts of data for decades, with the volume increasing dramatically every year and the trend now being toward larger data sets.  The problem has long been how to manage and mine that data for relevant information.  In the current data-intensive environment, the difficulty executing data management tasks has increased exponentially.

What's interesting is how big data is changing the nature of data management in the lab.  Relational databases and desktop statistics and visualization packages that have been so effective previously are not up to the task.  Instead, big data utilizes massively parallel software running on a large number of servers, typically more than any one business can afford.

One such solution is an open-source NoSQL database that is designed for massive amount of data delivery over web and cloud applications. NoSQL databases do not use tables and thus generally do not use SQL as the query language. What they do use is a distributed, fault-tolerant architecture that manages the data redundantly on multiple servers.

NoSQL databases don't replace databases such as Oracle RDBMS, instead they provide an entirely new way to manage data because they allow applications to collect and analyze massive amounts of information from numerous sources.

Life sciences laboratories are particularly affected by the big data trend.  When it comes to genomics, for instance, petabyte-scale networks are emerging that better support genomic research and emerging clinical requirements.  

There is also growing demand to manage big data using cloud computing platforms, and to move large volumes of next-gen DNA sequencing and research data at high-speed over vast distances.  The challenges of performing these activities into and out of the cloud are being addressed. This area has been led by Genentech, one of the early adopters of big data and cloud computing solutions to support their research.

Perhaps laboratories should have seen this coming since it is the inevitable result of better instrumentation that generates more data faster that then needs better analytical solutions–but hindsight is always 20/20. 


Further Information

Join For Free

Access to this exclusive content is for Technology Networks Premium members only.

Join Technology Networks Premium for free access to:

  • Exclusive articles
  • Presentations from international conferences
  • Over 3,000+ scientific posters on ePosters
  • More than 4,500+ scientific videos on LabTube
  • 35 community eNewsletters


Sign In



Forgotten your details? Click Here
If you are not a member you can join here

*Please note: By logging into TechnologyNetworks.com you agree to accept the use of cookies. To find out more about the cookies we use and how to delete them, see our privacy policy.

Related Content

AGBT: Going Beyond Sequencing
Advances in Genome Biology and Technology 2016 saw a strong focus on the workflow: 'sample in, insight out'.
Thursday, February 25, 2016
Scientific News
Monovar Drills Down Into Cancer Genome
Rice, MD Anderson develop program to ID mutations in single cancer cells.
Five New Breast Cancer Genes Found
Discovery of mutations paves the way for personalised treatment of breast cancer.
New Neurodevelopmental Syndrome Identified
Study pinpoints underlying genetic mutations, raising hopes for targeted therapies.
Uncovering Hidden Genomic Alterations that Drive Cancers
Tested on large tumor genomics database, REVEALER method allows researchers to connect genomics to cell function.
Gene Behind Rare Childhood Syndrome Identified
Online activism by one patient’s mother spurred research collaboration which led to the identification of a new genetic syndrome.
Resilience Project Identifies Rare Unaffected Individuals
Researchers from Mount Sinai and Sage Bionetworks report analysis of nearly 600,000 genomes for resilience project.
Rare DNA Will Have Nowhere To Hide
Two National Institutes of Health grants back Rice University effort to develop new diagnostics.
Virus Causing Tilapia Die-Offs Identified
Discovery of the virus causing Tilapia die-offs in Israel and Ecuador points the way to protecting a fish that feeds multitudes.
Children With Cancer To Get New Gene Test
Pilot study will sequence 81 cancer genes in children’s tumours to help personalise cancer treatment.
How The Bat Got Its Wings
Finding may provide clues to human limb development and malformations.
Skyscraper Banner

SELECTBIO Market Reports
Go to LabTube
Go to eposters
 
Access to the latest scientific news
Exclusive articles
Upload and share your posters on ePosters
Latest presentations and webinars
View a library of 1,800+ scientific and medical posters
3,000+ scientific and medical posters
A library of 2,500+ scientific videos on LabTube
4,500+ scientific videos
Close
Premium CrownJOIN TECHNOLOGY NETWORKS PREMIUM FOR FREE!