Hello, I am new to NGS analysis. I am working on rice and interested in Gene level analysis.
I have counted the reads using HTseq and need to estimate the gene lengths.
I have a locus.gff file with the format:
chr01 irgsp1_locus gene 2983 10815 . + . ID=Os01g0100100;Name=Os01g0100100;Note=RabGAP/TBC domain containing protein. (Os01t0100100-01);Transcript variants=Os01t0100100-01
chr01 irgsp1_locus gene 11218 12435 . + . ID=Os01g0100200;Name=Os01g0100200;Note=Conserved hypothetical protein. (Os01t0100200-01);Transcript variants=Os01t0100200-01
chr01 irgsp1_locus gene 11372 12284 . - . ID=Os01g0100300;Name=Os01g0100300;Note=Cytochrome P450 domain containing protein. (Os01t0100300-00);Transcript variants=Os01t0100300-00
I also have the usual transcript.gff file. How can I extract the gene length from this file? Kindly suggest.
Hello Istvan, Thanks for your quick reply. I have a transcript.gff file and transcript_exon.gff file. I will try what you said.