Corporate Banner
Satellite Banner
Informatics
Scientific Community
 
Become a Member | Sign in
Home>News>This Article
  News
Return

How it Works: Advanced Data Analysis Using Visualisation

Published: Monday, June 24, 2013
Last Updated: Monday, June 24, 2013
Bookmark and Share
Visualisation is a powerful tool for those working in molecular biology, here Qlucore offers a five-step method to ensure repeatable and significant results.

Problem:

A common problem affecting many scientists, especially those working in the area of molecular biology, is the vast amount of data that is created by their experiments. With such a large volume of data to consider, it is often impossible to derive any real biological meaning from their findings with the naked eye alone or with standard statistical software packages, which means that sophisticated data algorithms need to be developed in order for researchers to interpret their data effectively.

Until now, computer software designed for this purpose has focused on being able to handle increasingly vast amounts of data. As a result, the role of the scientist/researcher has partly been set aside, and a lot of data analysis is now performed by specialist bioinformaticians and biostatisticians. In most cases, however, this model has several drawbacks, since it is typically the scientist who knows the most about the specific area being studied.

Solution:

Even though the exploration and analysis of large data sets can be challenging, the active use of Visualisation techniques can provide a powerful way of identifying important structures and patterns very quickly. Visualisation provides the user with instant feedback, and with results that present themselves as they are being generated. Visualisation is also an important tool to stimulate innovation as a result of scientists now being able to analyze data.

We recommend a five-step method to ensure repeatable and significant results when using Visualisation. By applying this five-step method, it is possible to investigate large and complex data sets without being a statistics expert. The method is described below in more detail, but some basics need to be in place at the start.

First of all, the high dimension data needs to be reduced to lower dimensions so that it can be plotted in 3D. We recommend the use of Principal Component Analysis (PCA) for this purpose. Tools to color data to enhance the information are also required, as well as filters and tools to select and deselect parts of the data set.

At this stage, researchers can begin the five-step Visualisation process by detecting and removing the strongest signal present in the active dataset. Once this signal is identified, it can be removed in order to see whether there are any other obscured (but still detectable) signals present. Removing a strong signal will usually result in the reduction of both the number of active samples and/or variables.

Step two of the Visualisation process is to assess the signal-to-noise ratio in the data by using PCA and randomization. The strength of a visually detected signal or pattern is measured by examining the amount of variance captured in the 3D PCA-plot. This captured variance is compared with what the researcher would expect to capture if the real variables were all replaced by random variables, and will therefore give a clear indication of how reliable the identified pattern is.

Step three is to remove any "noise" by variance filtering. If researchers can see a significant signal-to-noise ratio in their active dataset, they should try to remove some of the active variables that are most likely contributing to the noise.

Step four offers the option of performing statistical tests that can be applied to any/all of the other stages of the five-step process: either during the initial analysis, when a step is repeated, at the end of a step, or not at all.

The final step uses graphs to refine the search for subgroups or clusters. Connecting samples in networks or graphs, for example, makes it possible to move into higher dimensions (i.e. more than three), since the graph created in a sample plot is based on the distances in the space of all active variables, and can therefore provide more insight into the structure of the data.

These five steps are then repeated until there are no more structures to be found.

When used in this way, Visualisation can be used as a powerful tool for researchers, since the human brain is very good at detecting structures and patterns. As such, if data can be visualized in a clear way, scientists can identify any interesting and/or significant results easily, by themselves, without having to rely on specialist bioinformaticians and biostatisticians. Instead the scientist can co-operate with the bioinformaticians to achieve even more interesting results.

Another important aspects of visualisation are the stimulation of innovation and the organizational learning effect of getting al competence groups (scientists and bioinformaticians) deeply involved in the data analysis.


Further Information
Access to this exclusive content is for Technology Networks Premium members only.

Join Technology Networks Premium for free access to:

  • Exclusive articles
  • Presentations from international conferences
  • Over 2,400+ scientific posters on ePosters
  • More than 3,700+ scientific videos on LabTube
  • 35 community eNewsletters


Sign In



Forgotten your details? Click Here
If you are not a member you can join here

*Please note: By logging into TechnologyNetworks.com you agree to accept the use of cookies. To find out more about the cookies we use and how to delete them, see our privacy policy.

Related Content

Qlucore, Nebion Collaborate
Partnership aims to address complementary use cases.
Friday, February 07, 2014
Using Qlucore Omics Explorer for Interpreting Leukemia Proteomics Data
Qlucore software has speeded up the process and enabled discovery for leukemia researcher Steven Kornblau.
Monday, November 18, 2013
New Research Aims to Stop ‘Blood Doping' During Cycling and Other Competitive Sports
As cyclists take to the roads of Surrey, England, the subject of blood doping raises its head once again.
Thursday, August 01, 2013
Qlucore Receives R&D Funding
VINNOVA Grant will speed the interactivity and visual feedback of Next Generation Sequencing (NGS) data analysis for scientists.
Monday, April 29, 2013
Researchers Develop Animal Free Methods for Testing Chemical Compounds for Allergens
EU-funded research project developing in vitro (‘out of body’) test strategies to reduce or replace animal testing use gene expression analysis software.
Monday, April 08, 2013
Qlucore Targets Academic and Commercial Biotech, Life Science Markets with Novo Newton Scientific Ltd
New alliance increases Qlucore's sales and marketing presence in Ireland, Spain, Italy and South Africa.
Thursday, January 24, 2013
NHS Urged to Prepare for ‘Genetic Revolution’
The ability to bring biologists into the data analysis phase will be key to achieving this important goal.
Monday, November 21, 2011
Qlucore to Expand its Marketing Efforts with New High-profile Appointment to its Board
New appointment coincides with the injection of new capital to increase market activities of its data analysis tool
Wednesday, November 25, 2009
Scientific News
Sorting Through Cellular Statistics
Aaron Dinner, professor in chemistry, and his graduate student Herman Gudjonson are trying to read the manual of life, DNA, as part of the Dinner group’s research into bioinformatics—the application of statistics to biological research.
Paving the way to Better Ovarian Cancer Diagnosis
Aïcha BenTaieb will present her invention for automated identification of ovarian cancer’s many subtypes at an international conference this fall.
New Tool Uses 'Drug Spillover' to Match Cancer Patients with Treatments
Researchers have developed a new tool that improves the ability to match drugs to disease: the Kinase Addiction Ranker (KAR) predicts what genetics are truly driving the cancer in any population of cells and chooses the best "kinase inhibitor" to silence these dangerous genetic causes of disease.
Computer Model Could Explain how Simple Molecules Took First Step Toward Life
Two Brookhaven researchers developed theoretical model to explain the origins of self-replicating molecules.
The Mystery of the Instant Noodle Chromosomes
Researchers from the Lomonosov Moscow State University evaluated the benefits of placing the DNA on the principle of spaghetti.
Web App Helps Researchers Explore Cancer Genetics
Brown University computer scientists have developed a new interactive tool to help researchers and clinicians explore the genetic underpinnings of cancer.
An Innovative Algorithm to Decipher How Drugs Work Inside the Body
Researchers at Columbia University Medical Center (CUMC) have developed a computer algorithm that is helping scientists see how drugs produce pharmacological effects inside the body.
How do Networks Shape the Spread of Disease and Gossip?
A team of mathematicians from Oxford University, University of North Carolina at Chapel Hill, and Rutgers University used a set of mathematical rules to encode how a contagion spreads, and then studied the outcomes of these rules.
AncestryDNA and Calico to Research the Genetics of Human Lifespan
Collaboration will analyze family history and genetics to facilitate development of cutting-edge therapeutics.
Informatics Tool Helps Scientists Prioritize Protein Modification Research
Researchers have developed a new informatics technology that analyzes existing data repositories of protein modifications and 3D protein structures to help scientists identify and target research on "hotspots" most likely to be important for biological function.
Scroll Up
Scroll Down
SELECTBIO

Skyscraper Banner
Go to LabTube
Go to eposters
 
Access to the latest scientific news
Exclusive articles
Upload and share your posters on ePosters
Latest presentations and webinars
View a library of 1,800+ scientific and medical posters
2,400+ scientific and medical posters
A library of 2,500+ scientific videos on LabTube
3,700+ scientific videos
Close
Premium CrownJOIN TECHNOLOGY NETWORKS PREMIUM FREE!