Question

Align short reads to multiple reference sequence

1

Entering edit mode

10.8 years ago

Sandeep ▴ 260

I am trying to align miRNA sequencing reads (very short reads) to multiple fasta sequence reference(~150). I am expecting my reads to align separately against individual fasta sequences. To achieve this I am using SHRiMP aligner with the following command

SHRiMP_2_2_3/bin/gmapper-ls -N 2 -o 1 -E input.fasta ../reference.fasta > output.sam

The output consists of all reads that are mapped to the all the reference sequences, but each read is mapped only once. So assuming read1 maps against my 1st fasta reference sequence, then it will not report any hit against the other reference sequence.

Is there a way to achieve what I am trying to do?

I have also tried creating index file for my reference sequence and tried aligning using bowtie

bowtie index input.fastq > output

But, even this result in the read aligning only once with the reference.

Is there a parameter I can add to SHRiMP to obtain hits for individual fasta reference?

bowtie short-reads shrimp next-gen mirna • 5.4k views

ADD COMMENT • link updated 3.5 years ago by Ram 45k • written 10.8 years ago by Sandeep ▴ 260

1

Entering edit mode

You mean you have a big reference fasta file consisting of 150 sequences and you are aligning your reads against it. As you are using -o1 parameter , the aligner is reporting the top alignment (maximum alignment score) for reads. I guess if a read is aligned against different fasta sequences with equal alignment score, then one of the alignment is getting reported due to -o 1 parameter. The best thing to do would be align these reads to every fasta sequence individually. You can create 150 reference indices and align the reads to them. You can then sort all the reads by queryname and then write a script that pick up a read in all sam files and check if it has been mapped to all the reference sequences with equal scores.

ADD REPLY • link updated 5.6 years ago by Ram 45k • written 10.8 years ago by Ashutosh Pandey 12k

0

Entering edit mode

I do have 150 reference sequences. I wanted to avoid creating 150 reference sequences and aligning them. I have tried bowtie with -a option.

Will try to align separately also. Thank you.

ADD REPLY • link updated 5.6 years ago by Ram 45k • written 10.8 years ago by Sandeep ▴ 260

Ram · Answer 1 · 2014-07-21

1

Entering edit mode

10.8 years ago

cts ★ 1.7k

These programs are designed to give only a single alignment. If you want alignments from all of your query sequences onto your database you could just use blast. Alternatively bowtie can also give you all alignments with the -a option; check out the manual.

ADD COMMENT • link updated 5.6 years ago by Ram 45k • written 10.8 years ago by cts ★ 1.7k