i need a gene annotation file for the mm39 mouse genome in the genePred format. I found that there is a utility which can convert the information from the gtf format. However, where I would download the gtf file it says that it was created by a conversion from the genePred format.
Of course I could still reconvert it to genePred, however this seems wrong. Is there an official source for the genePred format where I can directly download it from?
Assuming that you are interested in RefSeq annotations, you have the following options:
Use UCSC Table Browser and download the table. You can use the settings shown in the screenshot below. More information about the data included in this track can be found here. In particular, note that there is an additional bin column that you may have to remove before downstream processing.
Download RefSeq annotation in GFF3 format from NCBI and use the UCSC utilitygff3ToGenePred to generate the GenePred file:
thanks a lot. The bin column was actually something I needed so the conversion with gff3ToGenePred or gtfToGenePred did not create the exact format required downstream.
thanks a lot. The bin column was actually something I needed so the conversion with gff3ToGenePred or gtfToGenePred did not create the exact format required downstream.