Hello, I am new to Bioinformatics and I am dealing with a gtf file. I have a gtf file and in this gtf file the column look like following
1 StringTie transcript 463572 478996 1000 - . gene_id "STRG.1"; transcript_id "STRG.1.1"; reference_id "ENSBTAT00000015319"; ref_gene_id "ENSBTAG00000011528"; ref_gene_name "SMIM11A"; cov "7.464981"; FPKM "8.923761"; TPM "15.880815";
1 StringTie exon 463572 463746 1000 - . gene_id "STRG.1"; transcript_id "STRG.1.1"; exon_number "1"; reference_id "ENSBTAT00000015319"; ref_gene_id "ENSBTAG00000011528"; ref_gene_name "SMIM11A"; cov "2.531429";
I want to filter transcript with value FPKM value greater than 2. Please help me
C: filtered tab delimited file with awk
You can do that in Bash or Python for example. Here are the steps :