Question

How to transfer gff3 annotation from scaffolds to reference genome?

0

Entering edit mode

5.7 years ago

Bea • 0

Good afternoon everyone,

I am doing genome skimming to retrieve microsatellite loci from my genomic dataset.

To start, I filtered the raw Illumina reads (2x150bp) with Trimmomatic and then did an assembly de novo with SPAdes. Then, I mapped the SPAdes scaffolds to the reference nuclear genome for my study species. Using the scaffolds that mapped to the reference, I ran msatcommander with which I identified microsatellite loci.

From the msatcommander output, I know the start/end position within scaffolds of my microsatellite loci, so I created a gff3 file. I can visualize the microsat annotation in IGV when opening the the scaffolds file as genome and the gff3 file as track. The gff3 file is ok.

I now want to translate the microsatellite loci coordinates relative to my scaffolds to coordinates relative to the whole genome I am using as a reference. Remember that I mapped my scaffolds to the reference already, so I think it is quite an easy step, but by looking around I could not figure out a quick way out of it.

I checked this answer and I think it is really close to what I need, but not exactly.

If you have any input on this, that would be great! Thank you, Bea

Genome Skimming Microsatellite Annotation • 1.4k views

ADD COMMENT • link 5.7 years ago by Bea • 0

0

Entering edit mode

Maybe flo, which helps running UCSC liftover, might be helpful? https://github.com/wurmlab/flo

ADD REPLY • link 5.7 years ago by jean.elbers ★ 1.7k