BWA mapping with mismatches
1
2
Entering edit mode
8.6 years ago
Sissi ▴ 60

Hi guys,

I've read a lot around but I still can't find a solution. I need to map (with BWA bwasw) contigs on a Reference genome and they share only 70% of identity. Using the default parameters I don't see any mapped contigs. So, I would like to allow more mismatches. Which parameters should I change?

Thanks a lot

Assembly sequencing mapping bwa RNA-Seq • 5.3k views
ADD COMMENT
1
Entering edit mode

Hi Agata!

Thanks for your reply! I tried to change -b in bwa bwasw and -B in bwa mem, but when I export the mapped contigs with samtools, the fasta file is empty. Something is still wrong :/

Sil

ADD REPLY
0
Entering edit mode

Could you write commands for bwa i samtools, please.

ADD REPLY
0
Entering edit mode

Agata, here the commands I used:

bwa bwasw -b 10 ../Ref.fasta oases-all.fasta | samtools view -bS - > G10.bam samtools view -F 4 G10.bam | awk '{print">"$1"\n"$10}' > G10.fasta

ADD REPLY
0
Entering edit mode

How big is your BAM file? Maybe wrong reference?

ADD REPLY
0
Entering edit mode

The bam file is 62kb and the reference quite short, just 2000bp more or less. By blastn I'm sure there are some contigs that match to this Reference, that's why I'm pretty sure that If I map them with less stringency I should get something :/

ADD REPLY
0
Entering edit mode

It looks like you are trying to align Oases output, which is a transcriptome assembler. Is that correct? If so, then you should be using a splice-aware aligner like TopHat or STAR.

ADD REPLY
0
Entering edit mode

Why BWA-SW specifically? Why not try BWA-MEM? Or a different aligner entirely?

ADD REPLY
0
Entering edit mode

There is no specific reason actually..

ADD REPLY
0
Entering edit mode

Why do you need only 70% of identity? Can you explain the application scene? Maybe I can write a simple aligner for your requirement.

ADD REPLY
0
Entering edit mode

Hi biomaster!

I'm doing de novo assembly of viral siRNAs. I blasted (blastn) the contigs and found that the closest virus has only 70% of identity. Now I want to map this contigs on this viral reference sequence to reconstruct my consensus, but with the default parameters of BWA I can't get any mapped contigs (too many mismatches I guess). Hope I explained my self :)

ADD REPLY
1
Entering edit mode
8.6 years ago
agata88 ▴ 870

Here is a manual, http://bio-bwa.sourceforge.net/bwa.shtml

and as far I can see you can use:

-b INT Mismatch penalty [3]

Try to use different options to create results you need. Alternatively bwa mem has more options to check.

Best,

Agata

ADD COMMENT

Login before adding your answer.

Traffic: 1657 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6