Hi fellow humans,
I'm rather new in this field and I am wondering how I can get the desired information from my gtf file in an elegant way.
I want only the columns 1,3,4,5 and 9(the gene_ID)! 1,3,4,5 is easy, but I'm lacking an efficient coding approach to add also only the gene_id"xy" for each row...
The aim is to write it into a simple .txt file with 5 columns, you know. I'd appreciate any help. The gtf Input looks the following
NC_004593.1 RefSeq exon 1 68 . + . gene_id ""; transcript_id "unknown_transcript_1"; anticodon "(pos:31..33)"; gbkey "tRNA"; product "tRNA-Phe"; exon_number "1";
May want to try this tool: Extracting genomic feature sequences from GTF/GFF files with AGAT