Entering edit mode
10.3 years ago
Joey
▴
430
I downloaded the NCBI GRCh37 gene list from https://bitbucket.org/baderlab/fast/downloads.
What is the best way to find out how many are pseudogenes in this list? I tried Biomart but I wasn't able to do a full merge.
Thanks,
-Joey
Let me slightly update this answer with recent information from GENCODE V31 (July 2019, http://genome.ucsc.edu/goldenpath/newsarch.html ) which contains 18.536 annotated pseudogenes (not including polymorphic pseudogenes, according to documentation)
Specify output file name and check chrom, cdsStart, cdsEnd and name2 in the required fields to get the desired .bed file