Tool for the Identification of Targeted Sequences from Multidimensional High Throughput Sequencing Data
News Oct 07, 2013
The advent of next-generation high-throughput technologies has revolutionized whole genome sequencing, yet some experiments require sequencing only of targeted regions of the genome from a very large number of samples. These regions can be amplified by PCR and sequenced by next-generation methods using a multidimensional pooling strategy. However, there is at present no available generalized tool for the computational analysis of target-enriched NGS data from multidimensional pools.
Here we present InsertionMapper, a pipeline tool for the identification of targeted sequences from multidimensional high throughput sequencing data. InsertionMapper consists of four independently working modules: Data Preprocessing, Database Modeling, Dimension Deconvolution and Element Mapping. We illustrate InsertionMapper with an example from our project 'New reverse genetics resources for maize', which aims to sequence-index a collection of 15,000 independent insertion sites of the transposon Ds in maize. Identified sequences are validated byPCR assays. This pipeline tool is applicable to similar scenarios requiring analysis of the tremendous output of short reads produced in NGS sequencing experiments of targeted genome sequences.
InsertionMapper is proven efficacious for the identification of target-enriched sequences from multidimensional high throughput sequencing data. With adjustable parameters and experiment configurations, this tool can save great computational effort to biologists interested in identifying their sequences of interest within the huge output of modern DNA sequencers. InsertionMapper is freely accessible at https://sourceforge.net/p/insertionmapper and http://bo.csam.montclair.edu/du/insertionmapper.
This article is puclished online in BMC Genomics and is free to access.
Edith Heard Unanimously Selected as Next Director General of EMBLNews
At its 53rd meeting yesterday, EMBL Council selected Edith Heard as the organization’s fifth Director General. Heard’s mandate is scheduled to begin 1 January 2019.READ MORE
Common Muscle Strength Genes Identified in Humans for First Time EverNews
The very large number of individuals participating in UK Biobank provides a powerful resource for identifying genes involved in complex traits such as muscle strength.
CRISPR Transforms Living Cells Into Data Storage DevicesNews
Genome engineering technology transforms living cells into archival data storage devices that capture, store, and propagate information over time.READ MORE
Comments | 0 ADD COMMENT
EMBL Course: Next Generation Sequencing: RNA Sequencing Library Preparation
Apr 23 - Apr 27, 2018
EMBO Practical Course: Microbial Metagenomics: A 360º Approach
Apr 23 - Apr 30, 2018
EMBL Course: Next Generation Sequencing: Whole Genome Sequencing Library Preparation
Apr 16 - Apr 20, 2018
EMBL Course: Introduction to Next Generation Sequencing
Apr 09 - Apr 12, 2018