Hello,
I have 12 De-novo assembled fungal genome assemblies.
Background:
- Assemblies are generated from QC-performed Nanopore data.
- Later polished via Racon and Pilon using illumina data.
- Contigs < 2000bp removed and assembly sorted accouding to size using
funannotate clean and sort
I want to perfrom contamination analysis. So i checked the NCBI Foreign Contamination Screen (FCS)
tool. But it required alot of computational resources.
My question:
- Is contamination analysis necessary? (kindly share your thoughts)
- Which tools or pipelines are good for contamination analysis. (if you can direct me toward tutorials)
Thank you. Happy New Year.
I would say no. There should be no contamination if the experimental component of study was rigorously done. There is little chance you added contamination during the analysis phase. If there is possibility of contamination in your data then that should have been addressed before you started assembling the data.
Thank you for clarification.
One other question a bit unrelated. Everytime i post a question i wait for your response. I personally believe that you have more knowledge and experience in genome assemblies. Is there any way i can follow and look at your work. may be os other platforms like Google scholar or anywhere else. If you feel ok in sharing.
I agree that generally you shouldn't need to do a contamination analysis. I added one into a pipeline for my last lab because the quality of samples we received from partners varied a lot. So, if you notice your assemblies are more fragmented than anticipated or find unexplained patterns in data then it can be useful to try and clean up that mess.
I liked using metagenomic tools like Kraken2 for contamination.