Hi,
Is there a tool to create a fasta file using the long gene length sequence (i.e., TSE to TSS) that includes all the genomic elements, including 5' UTR, exons, introns 3'UTR from hg38? And then create a new gtf file using the fasta file subset and original gtf file information?
Thank you!
This is awesome. Thank you and it worked like a charm!
Juke34 I have another question. Is there a way to extract the longest gene sequence without the introns? For example:
agat_sp_extract_sequences.pl -g infile.gff -f infile.fasta -t exon
, but I want the sequence in the gene level and not in the transcript level. Thanks a lot!Transcript level and gene level is the same when you have only one isoform. If you have several isoforms you may filter them to keep the longest with agat, or merge locations with bedtools intersect