Entering edit mode
4.8 years ago
vishalchanda364
▴
20
Hello, I have been working on a metagenomic data, and now trying to remove ribosomal protein using the blastp command in BLAST+, I want to retain the sequences which doesn't map to the reference. Can anyone tell me how can I do that so that all the mapped sequences Bacterial gets removed and I am left with unmapped sequences which are possible Viral sequences.
The command I used for blastp is;
blastp -db cog -query ../../../Prodigal/F_V_0_29_R.faa -out ../../../blast/output_1 -evalue 1e-10
Thanks in advance, Vishal
Parse entries that don't have hits. You can then fish those sequences out of your file. Here are some options for that: Extract Sequence From Fasta File Using Ids From a separate txt File in linux
Thank You so much for the help.