I have a list of gene id's and I want to match those with gene names and source (lincRNA
, antisense
, protein_coding
, ...) from Ensembl gtf file. For example this is a small part of the list as shown:
Gene id strand
ENSG00000242959 1
ENSG00000160396 -1
ENSG00000229494 1
ENSG00000230262 -1
ENSG00000229240 -1
ENSG00000223569 1
I got help before by using awk command to match gene id with gene name, so how can we include the source with them
Hi Geek,
Komal helped me by using the following awk command
and it works very well and I merged the result file with my file using R, So I wounder if we can add the source column in this command.
M K I have replied to you on the previous question. Also, do not duplicate your posts.