I downloaded ensGene.txt
file from UCSC here and wanted to create custom GFF annotation for MISO. I can generate the GFF annotation using the entire ensGene.txt file but I am only interested to generate for a small subset. Then I create a subset by grep such like:
$grep ENSG00000141510 ensGene.txt > ensGens_sub.txt
But I got error when I rerun the rnaseqlib
program:
Making GFF alternative events annotation...
- UCSC tables read from: /data/projects/JingChen-TCGA/Alternative_splicing/Data/UCSC/local
- Output dir: /data/projects/JingChen-TCGA/Alternative_splicing/Data/miso_annotation/tp53
Traceback (most recent call last):
File "/usr/local/lib/python2.7/dist-packages/rnaseqlib/gff/gff_make_annotation.py", line 65, in <module>
main()
File "/usr/local/lib/python2.7/dist-packages/rnaseqlib/gff/gff_make_annotation.py", line 61, in main
make_annotation(args)
File "/usr/local/lib/python2.7/dist-packages/rnaseqlib/gff/gff_make_annotation.py", line 29, in make_annotation
raise Exception, "No UCSC tables found in %s." %(tables_dir)
Exception: No UCSC tables found in /data/projects/JingChen-TCGA/Alternative_splicing/Data/UCSC/local.
It says "No UCSC tables found in /data/projects/JingChen-TCGA/Alternative_splicing/Data/UCSC/local.".
Does anybody know the reasons and how to correctly subset the UCSC table like ensGene.txt?
Did you make sure the table you are using as input for your program is present at
/data/projects/JingChen-TCGA/Alternative_splicing/Data/UCSC/local
?do following and check