Entering edit mode
2.5 years ago
davidmaimoun
▴
50
Hello,
If I have a genome assemblies from isolates, and I want to know what it is. More accurately, I need to know the isolates are Neisseria or H.Influenza, or not. I would like to query them against Refseq database to see if there are matches. Blast doesn't works good if I query a whole genome right? What is the better way to do that?
Thank you!
As a first approach try TYGS
Thank you Andres
I tried it but it is very slow
Hi David,
how many genomes do you have?
I have 8, We need to use tool that give an answer rapidly to know what to do with the patients The TYGS gave me a pretty good answer though, thank you for the tip. It take time but identified the right specie
May I ask why you are running assemblies first, if speed is paramount? You could e.g. run StrainPhlAn3 directly on the raw reads...you would need to run it with the
--add_viruses
parameter to include Influenza, though.If all that interests you, is if Neisseria and/or Influenza or neither is contained in an isolate, you can certainly speed up your analysis by building a custom reference set with just those and a few controls and then use a kmer-based alignment-free classification.
I thought it would world only on assemblies.
Thank you very much for your help Matthias, this tool is great!