I have isoform expression file of human cancer from TCGA. I want to map all this isoform to gene, in other word, I want to group the isoform which belong to one gene together. the ID of isoforms belong to UCSC. Does anybody knows how can I do this ?
isoform_id normalized_count
uc011lsn.1 0.0000
uc010unu.1 20.1848
uc010uoa.1 7.1561
uc002bgz.2 36.1698
uc002bic.2 0.0000
uc010zzl.1 188.5822
uc001jiu.2 1085.9445
uc010qhg.1 0.0000
uc011krn.1
In UCSC genome browser itself u can output the ucsc_ids as well as gene names . try to see the tablebrowser and the output options :)
Normally one could use biomart (or the biomaRt Bioconductor package). However, none of these IDs seem to be included in that :(