Gene symbol for original TCGA data ...help!
2
0
Entering edit mode
7.2 years ago
elizabethR ▴ 70

Hi all

I have been on maternity leave for a year prior to which I was analysing TCGA RNASeq cancer data, which I downloaded, collated and analysed. I have been trying to run some new analyses now IM back at work but the gene symbol names have changed that Im looking at! I tried to run GSEA in WebGestalt and found a lot that couldnt be mapped and I assume this is because some gene symbols have changed. Now I am in a conundrum. I cant merely access a file of different gene ID types because these contain the new gene symbol IDs so I cannot programme it to look them up.

Has any one had this problem? And if so how did you overcome it? I was thinking Id need an archive gene ID conversion file so I could change the gene ID from gene symbol to something like ensembl and run that instead in my GSEA. Id like to avoid redownloading and reprocessing the data if possible!

Many thanks in advance

TCGA genesymbol • 3.5k views
ADD COMMENT
0
Entering edit mode

Can you post examples of what you are referring to? TCGA data moved to the new portal while you were gone and perhaps that may have something to do with this.

ADD REPLY
0
Entering edit mode

Yes, FAM38A is now Piezo1 for instance. When I ran the data through WebGestalt it couldnt map 1338 of the genes and eyeballing them it looks like their gene symbols have changed.

ADD REPLY
2
Entering edit mode
7.2 years ago
GenoMax 147k

I think your best bet is to use the multi-symbol gene name checker tool provided by HGNC. That should help map all old gene names.

ADD COMMENT
0
Entering edit mode
7.2 years ago
elizabethR ▴ 70

Thank you that's incredibly helpful!

ADD COMMENT

Login before adding your answer.

Traffic: 2748 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6