Hi,
I've got RNA-seq data from TCGA. I have gene expression level and also isoform expression level. I want to know how can I map the isoform ID of transcripts to Entreize gene ID.
my isoform IDs looks like as follow:
isoform_id normalized_count
uc011lsn.1 0.0000
uc010unu.1 20.1848
uc010uoa.1 7.1561
uc002bgz.2 36.1698
uc002bic.2 0.0000
uc010zzl.1 188.5822
uc001jiu.2 1085.9445
uc010qhg.1
I would normally recommend either BioMart or the UCSC Table Browser for this task. But before we go any further: none of those isoform IDs appear to be valid? I found some corresponding Entrez IDs from this mailing list and those IDs are not valid either, having been replaced.