How to map by bwa with 50 mismatches?

Entering edit mode

8.2 years ago

agata88 ▴ 870

Hi all,

I am trying to map amplicon PR reads (2x150bp) to gene reference by bwa. I would like to allow bwa align reads with 50 mismatches in first 150bp read and the same for second read.

Could you please help me find an option or algorithm that will be able to do that?

Thanks in advance,

Best,

Agata

bwa • 2.5k views

ADD COMMENT • link 8.2 years ago by agata88 ▴ 870

Entering edit mode

I wonder where those mismatches come from, and I believe you are trying to achieve something that probably has a better solution. What is the aim of this project? What is sequenced?

ADD REPLY • link 8.2 years ago by WouterDeCoster 47k

Entering edit mode

If this is amplicon seq why do you need to allow 1/3rd mismatches as compared to the length of the reads?

ADD REPLY • link 8.2 years ago by GenoMax 148k

Entering edit mode

Are you trying to induce artificial mutations to these regions?

ADD REPLY • link 8.2 years ago by H.Hasani ▴ 990

Entering edit mode

This is for HLA typing. So I have very polymorphic regions of different genes in one sample. I don't know yet if this is a good solution, I would like to test that.

ADD REPLY • link 8.2 years ago by agata88 ▴ 870

Entering edit mode

In that case take a look at: https://omictools.com/hla-typing-category
Software For Hla Typing Using Ngs?

ADD REPLY • link 8.2 years ago by GenoMax 148k

Entering edit mode

Which one of the program listed here would you recommend for Amplicon-seq?

ADD REPLY • link 8.2 years ago by agata88 ▴ 870

Entering edit mode

Ohh, sorry I deleted posts...can you anyhow bring them back?

ADD REPLY • link 8.2 years ago by agata88 ▴ 870

Entering edit mode

Thanks, but I am not looking for a ready to go program :) Do you know which option of bwa can I use?

ADD REPLY • link 8.2 years ago by agata88 ▴ 870

Entering edit mode

Please use ADD COMMENT/ADD REPLY to answer to earlier answers or comments to keep this thread logically structured and easy to follow.

ADD REPLY • link 8.2 years ago by WouterDeCoster 47k

Entering edit mode

Sounds like you don't want to use "right tool for the job" :-)

If you want to use bwa then go for it. You could start with smaller seeds (-k) than default, drop penalty for mismatches (-B) etc. Could try the ready made -x settings for (PacBio/Ont2d) as a start.

ADD REPLY • link 8.2 years ago by GenoMax 148k

Entering edit mode

I am little confused... I need to analyse KIR and HLA genes ...most of the programs analyse only few HLA genes, and diving into whole pipelines is more time consuming that writing own pipeline (since the procedure is not much complicated). So, I wanted to repeat this methodology:

https://genomemedicine.biomedcentral.com/articles/10.1186/gm396

and use it for all genes that I need in MY analysis.

ADD REPLY • link 8.2 years ago by agata88 ▴ 870

Entering edit mode

Yet in that paper they don't use BWA either. Why not use their methodology and indeed their available software rather than re-invent the wheel?

ADD REPLY • link 8.2 years ago by Chris Cole ▴ 800

Entering edit mode

Ah, sorry, not this paper:

here is the proper one:

http://bmcgenomics.biomedcentral.com/articles/10.1186/1471-2164-14-355

ADD REPLY • link 8.2 years ago by agata88 ▴ 870

Entering edit mode

Another example of the incomplete information that accompanies bioinformatics sections in papers. You could use bwa with just default parameters to see what you get. Perhaps the authors just used defaults and that is the reason no special mention has been made of detailed settings.

ADD REPLY • link 8.2 years ago by GenoMax 148k