Hi Biostars,
I have a RNASeq count table at gene level from HTSeq, and I want to get the gene length table from the used gtf file (from Ensembl) only for the genes in the HTSeq table. Is there anyway or some scripts to do this ?
ENSG00000000003 0 0 1 4 0 0 0 0 0 15 2 11 0
ENSG00000000419 2 2 3 3 14 20 9 13 37 81 13
ENSG00000000460 2 5 7 5 43 11 15 11 14 71 38
....
....
this is my HTSeq table, and I want to get the length only for ENSG00000000003, ENSG00000000419, ENSG00000000460, ...
Hello,
I used the command which is in the
README.md
file to get thetranscript_length
from a gtf file like the following:For Eg: Let's say I have the transcript
transcript_002
with start and end positions (42939 49164) with the above command, I got the following output:But when I used another command which is from here
https://gist.github.com/sp00nman/10372555
I got another output:
Which is the right one?
Hi,
The first command considers the transcript length without taking into account first and the last base. Second one included that by adding +1 to it. In my opinion second one makes sense. Though the first command won't affect the results significantly.