Where to find all genes and their biotypes?
1
0
Entering edit mode
4 weeks ago
DN99 ▴ 20

Hi all, I'm trying to figure out where I can get/make a file that contains gene names, their chromosome, position, their source, and their biotype. Like this:

chr    start    end    gene    biotype    source
1    11869    14412    DDX11L1    pseudogene    ensembl_havana

I'd like to get this for GRCh38 and GRCh37 - is there any resource that offers this or something similar that I can filter this information from?

I've tried searching ensembl and their FTP but I haven't found data that has this information in one place.

biotype genes ensembl • 263 views
ADD COMMENT
1
Entering edit mode
4 weeks ago
ATpoint 86k

This is all summarized in a GTF file, for example at https://www.ensembl.org/Homo_sapiens/Info/Index

Press "Download GTF", and then download the file. It contains exactly what you're asking for.

ADD COMMENT
0
Entering edit mode

Once you download the GTF you can use https://agat.readthedocs.io/en/latest/tools/agat_sp_extract_attributes.html from AGAT toolkit to get the info.

ADD REPLY

Login before adding your answer.

Traffic: 1878 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6