Hello. I need to get information on the genomes belonging to the Pangolin lineages of coronavirus. I have one reference strain as fasta file, and two strains as fastq file. I filtered them, aligned them to the reference and did variant calling. My advisor said that I MUST assemble the genomes (I don’t understand what this means to him). But the goal is to find information about two strains on https://cov-lineages.org/resources/pangolin.html. I guess I could do this by getting fasta files from VCF with accepting variant calling, but don’t know how to do it.
I would greatly appreciate any help.
First, I agree with Pierre you are not trying hard enough to follow up. I thought I had given extensive advice already. If your supervisor wants an assembly they likely mean de novo assembly. Anyway, why don't you just ask them what they want? Just using a tool because it can produce some output in a certain format without understanding what it does is the kind of black-box bioinformatics that leads to more problems than it solves.
Processing fastq files for genome assembly MCP counter's problems SSD or HHD for genome analysis