[HTML][HTML] SHRiMP: accurate mapping of short color-space reads

SM Rumble, P Lacroute, AV Dalca… - PLoS computational …, 2009 - journals.plos.org
SM Rumble, P Lacroute, AV Dalca, M Fiume, A Sidow, M Brudno
PLoS computational biology, 2009journals.plos.org
The development of Next Generation Sequencing technologies, capable of sequencing
hundreds of millions of short reads (25–70 bp each) in a single run, is opening the door to
population genomic studies of non-model species. In this paper we present SHRiMP-the
SHort Read Mapping Package: a set of algorithms and methods to map short reads to a
genome, even in the presence of a large amount of polymorphism. Our method is based
upon a fast read mapping technique, separate thorough alignment methods for regular letter …
The development of Next Generation Sequencing technologies, capable of sequencing hundreds of millions of short reads (25–70 bp each) in a single run, is opening the door to population genomic studies of non-model species. In this paper we present SHRiMP - the SHort Read Mapping Package: a set of algorithms and methods to map short reads to a genome, even in the presence of a large amount of polymorphism. Our method is based upon a fast read mapping technique, separate thorough alignment methods for regular letter-space as well as AB SOLiD (color-space) reads, and a statistical model for false positive hits. We use SHRiMP to map reads from a newly sequenced Ciona savignyi individual to the reference genome. We demonstrate that SHRiMP can accurately map reads to this highly polymorphic genome, while confirming high heterozygosity of C. savignyi in this second individual. SHRiMP is freely available at http://compbio.cs.toronto.edu/shrimp.
PLOS