Hello,
I want to extract sequences of all members of particular gene family (including pseudogenes), together with the information about their strand. I like GeneCards (http://www.genecards.org/), but I cannot find the information about which release and coordinate system they are using. I also like biomart interface, but their latest release is GRCh37. GeneCards and biomart are definitely not using same coordinates and when I look at one particular gene, coordinates extracted from biomart look more reasonable. But even if I decide to use biomart, I will miss updates from the latest release and indeed, one of the genes I am interested in has 60bp longer sequence in RefSeq than in biomart.
Any advice how would you do this exercise? :)
BioMart is a data mining tool; it is not tied to any specific genome release. There are other implementations online which use GRCh38.