BLAST pipeline operations
1
0
Entering edit mode
8.2 years ago
skbrimer ▴ 740

So, I have been reading a lot about host removal with viral data. More than one group is just using BLAST to their specific host to remove possible contamination.

How does that work? I know how to download a genome from NCBI. I have ncbi tools on my machines and can create a custom db using only the my host(s) of choice. However I'm fuzzy on how that actually removes them from the read pool.

Can BLAST take all of your reads and only output the reads that have no match or is it more of piping the results to a file and removing everything that has a good match via a script?

I understand BLAST is a slower way of doing this so what would be the advantage of this say over BBsplit (from the BBMap) that can map to multiple references at once? Or just concatenating all the host/viral dna/rna into one file and mapping to it?

Assembly QC • 1.4k views
ADD COMMENT
2
Entering edit mode
8.2 years ago
Prasad ★ 1.6k

if you are looking for contaminated read removal from the data, try DeconSeq

ADD COMMENT
0
Entering edit mode

Thanks, that looks interesting :)

ADD REPLY

Login before adding your answer.

Traffic: 2499 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6