Hi all,
Can someone tell me why the TCGA patient numbers are different when comparing GDC data portal to firebrowse? The overall patient numbers are more or less identical. However, when looking at mRNA-seq data the patient numbers differ.
E.g. HNSC, Firebrowse shows 520 patients with mRNA-seq data whereas GDC shows 501 patients. Why is that? Sometimes it's even the other way around as with OV data.
Thanks for your input!
Thanks for your comment. I speculated that it would be something like that. This would explain that in some instances GDC has less patients than Firebrowse, which is also mainly the case. However, OV and GBM have indeed more patients with mRNA-seq in GDC. Does firebrowse also filter data? Do you know that?
From the link link, I think firebrowse will also do some filter. But almost will be retained.