Hello,
I have created a local blast database and blasted some genes against the reference using
blastn -query a.fasta -db BLASTDATABASE -max_target_seqs 5 -outfmt 6 -out output
I obtain a ~98% similarity score for the best hit. Now, I want to extract the fasta
sequence of that BLAST search together with the 200 bp flanking sequence from both start and stop sides for designing a primer.
Can you please give me a suggestion on how to do this for a single BLAST search?
Thank you.
One way to do this, given that you have start and stop coordinates of the best hit, is by using samtools.
here
a.fasta
- reference file using in blastn;Chr
- chromosome of the hit sequence; forstart
andstop
you can add 200bp to the coordinates of the best hit and extract it from the genomehere you're extracting context for the query, not the hit.
Ok. Thank you. I will try this and see if it works.
you want to extract context region for the hit or the query ?
and spoiler alert: you can't so this with a single BLAST search! (best will be two-step approach)
So, I want to extract the context region for the hit. Because the query gene is from another species and I am trying to find the homologous sequence in my reference species.