Hi
I'm trying to make GATK UnifiedGenotyper to detect indels inserted by wgsim short reads simulator.
Broad employees just informed me that indels are not called because wgsim creates reads of Q17 and everything below 20 is thrown away.
I checked the fastq files, and indeed, all qualities are "2" which is if I understood well the ASCII translation corresponds to 17.
Is there a way to make wgsim output reads with a configurable base quality?
I thought that a trick could be to replace these "2222...2" strings by somthing like "????...?" Or is there a more elegant way to proceed?
Thanks.
Double thanks! I will try both.
Double thanks Sir! I will try both solutions...
Wait a minute: I actually used the wgsim available at https://github.com/lh3/wgsim and this produces only "2" qualities ?!
my mistake - I have a file like that I thought I generated it with wgsim - you are correct it does put in 2s. I will correct the answer
No problem. Thanks for answering. I just awked my fastq files with your file and it is doing alignment now :->... I think I will move to MAQ as it looks it offers more control. Cheers.