Entering edit mode
2.7 years ago
雨
▴
20
I'm working on a gene in all fish (multi-copy, pseudogene, non-existent) and blast can solve part of the problem, but you know, the blast results sometimes don't match very well at the ends, but for a large number of Genome, I want to extend the bases based on the results of the blast for manual judgment. Can you tell me how to fix it, or what software to use? I would appreciate it if you could tell me
Thank you very much for your suggestion. Now I use the blast output file in fmt6 format, and get the extended base positions at the beginning and end in bedops, but the sequence in the inbed file is not modified. Does bedops support modifying the sequence? Or do I need to use the output bed file to find my extended sequence in the fasta file? I would be very grateful if you could answer
Modify the BED intervals with
bedops --range
etc., and then follow this up with usingsamtools faidx
(search on those two keywords) and indexed FASTA to retrieve the sequence over the modified intervals. I have a convenience script here: that uses indexed FASTA andsamtools
to convert BED lines to FASTA records.Example usage:
Replace
--options ...
with command-line options, as needed.