I am trying to obtain stats on how many 3'UTRs are annotated in Ensembl. I would really like to download the as many annotated 3'UTRs as possible from as many species as possible and find some basics stats on them (length and so on). For these purposes, its not even really neccessary to download fasta sequences if I can avoid it.
I cannot figure out how to do this in a non-manual way i.e. going into biomart and individually query each species database, which took me forever to do for just a handful of them.
I don't code in PERL, so I am aware of the PERL API but I'm really lost on how to actually formulate my query using that language (I usually use Python).
I currenly use a HPC and my laptop doesn't have a lot of memory so I would really like to do through command-line if thats possible.
Also - not sure ifit might be possible to do using the UCSC genome browser? I tried but, again, didn't know how to apply my query to multiple species.