We've updated our Privacy Policy to make it clearer how we use your personal data.

We use cookies to provide you with a better experience. You can read our Cookie Policy here.

Advertisement
ArrayExpress Database Doubles in Size to 100,000 Hybridizations
News

ArrayExpress Database Doubles in Size to 100,000 Hybridizations

ArrayExpress Database Doubles in Size to 100,000 Hybridizations
News

ArrayExpress Database Doubles in Size to 100,000 Hybridizations

Read time:
 

Want a FREE PDF version of This News Story?

Complete the form below and we will email you a PDF version of "ArrayExpress Database Doubles in Size to 100,000 Hybridizations"

First Name*
Last Name*
Email Address*
Country*
Company Type*
Job Function*
Would you like to receive further email communication from Technology Networks?

Technology Networks Ltd. needs the contact information you provide to us to contact you about our products and services. You may unsubscribe from these communications at any time. For information on how to unsubscribe, as well as our privacy practices and commitment to protecting your privacy, check out our Privacy Policy

ArrayExpress, the publicly available database of transcriptomics data at the European Molecular Biology Laboratory's European Bioinformatics Institute [EMBL-EBI], has doubled in size in 2007, reaching the 100,000-hybridisation milestone. The database now holds snapshots of gene expression [identifying which genes are specifically expressed in a particular tissue or in response to a drug, for example] for more than 180 species under thousands of experimental conditions.

The latest acceleration in growth reflects not only the increased numbers of direct submissions, but also the mass import of data from the Gene Expression Omnibus [GEO], which is produced by the US National Institute of Biotechnology Information.

The import of data from GEO is the first step towards the regular exchange of data among public repositories for transcriptomic data. Similar data exchange agreements among biological data providers are widely recognized as being the most effective way of maintaining and quality-controlling the public record of biological information.

GEO data entering ArrayExpress are curated and scored for compliance to MIAME, the microarray community's minimum information standard. Users can therefore search for experiments that have been submitted to either database, annotated with common terms, and can download them in MAGETAB – a user-friendly tab-delimited format that simplifies meta-analyses of experiments from different labs. MAGETAB was developed under the direction of the MGED society, which works to simplify data sharing for microarray researchers.

The imported GEO data, like all the data in ArrayExpress, have also been integrated with other EBI resources. For example, users can now hop from an ArrayExpress entry straight to the relevant genes in Ensembl or proteins in UniProt, simplifying the process of data analysis and download for biomedical researchers.

The rate of growth of ArrayExpress seems set to increase further in the future, as new high-throughput sequencing-based transcriptomics applications are already resulting in the generation of huge amounts of data. Dealing with this barrage of new data will be the next challenge for ArrayExpress and its collaborators.

Advertisement