Entering edit mode
4.8 years ago
genomes_and_MGEs
▴
10
Hey guys,
I know it's possible to use seqtk subseq to extract fasta files matching a list. Now, is there a way to extract those fasta files that don't match the exact fasta headers on the list? Thanks!
Looks like you already have a solution. In general, this boils down to list manipulation. If you have a list of all sequences and a list of those that you don't want (those to exclude), grepping the two files using a
-v
option will give you their difference that can be used in normal fashion.