Dear BioStars,
I am using a piece of software (PARalyzer for PAR-clip data). The pipeline I am using requires a GENCODE format .gtf annotation. Since the official GENCODE rat annotation is not available, maybe somebody tried re-structuring the UCSC annotation to fit the GENCODE format ?
I already tried appending a second "filler" column to match the column order, but there are way more differences in the last multi-entry column. The home-made annotation with filler column didn't work (all works perfectly for GENCODE human and mouse annotation).
I would be very thankful for any tips or maybe files ;)
Have a nice weekend!
Piotr
You want Ensembl. We make the GENCODE GTFs (for human and mouse) and we make the GTFs for all our other species in the same style.
They are exactly in the same style ?
I used the newest rat Ensembl GTF, but had to make some minor changes - for example in human GENCODE gtf there is a "gene_type" not "gene_biotype" in the key:value column, exon numbers have no quotes in GENCODE GTF and the chromosome names are different (chr1 in GENCODE vs 1 in ENSEMBL).
I ended up writing a Python script in the end for making those minor changes. All seems to work now.
Best regards,
Piotr
Is this script available on GitHub? I am running into the same problem, and I also have the same problem of not wanting to spend time re-inventing the wheel. My pipeline works perfectly if I use GENCODE GTFs, but does not work if I use the Ensembl GTF for rat.