Entering edit mode
4 weeks ago
DN99
▴
20
Hi all, I'm trying to figure out where I can get/make a file that contains gene names, their chromosome, position, their source, and their biotype. Like this:
chr start end gene biotype source
1 11869 14412 DDX11L1 pseudogene ensembl_havana
I'd like to get this for GRCh38 and GRCh37 - is there any resource that offers this or something similar that I can filter this information from?
I've tried searching ensembl and their FTP but I haven't found data that has this information in one place.
Once you download the GTF you can use https://agat.readthedocs.io/en/latest/tools/agat_sp_extract_attributes.html from AGAT toolkit to get the info.