Entering edit mode
3.6 years ago
asumani
▴
70
Hi,
I have converted human ENSEMBL gene ids to GENE SYMBOL using AnnotationDbi package. So, I have a list of genes names like "TSPAN6","PDE11A". Is there any tool to group genes by protein coding and non-coding features? It could be a more detailed grouping also, but this broad grouping would also work for me.
Thanks in advance.
Get Ensembl GTF file for human here. Uncompress using
gunzip
.Protein coding genes:
You can look at
gene_biotype
field and create other groups.Thanks for the awk pipes! But I don't look for something like extracting information from GTF. I assumed there is already a tool which gives out the gene feature in gene_biotype given the gene symbol.
You could use Ensembl BioMart to get that information then.
please post example input, expected output and tried code (if there is any).