TCGA sample names for .idat files
1
0
Entering edit mode
8.6 years ago
igor 13k

How can I map sample names (TCGA-XX-XXXX*) to .idat filenames for TCGA data? There must be a key somewhere, but I cannot seem to be able to find it.

For example, the .idat files are here: https://tcga-data.nci.nih.gov/tcgafiles/ftp_auth/distro_ftpusers/anonymous/tumor/laml/cgcc/jhu-usc.edu/humanmethylation450/methylation/jhu-usc.edu_LAML.HumanMethylation450.Level_1.2.5.0/

The corresponding processed files have sample names: https://tcga-data.nci.nih.gov/tcgafiles/ftp_auth/distro_ftpusers/anonymous/tumor/laml/cgcc/jhu-usc.edu/humanmethylation450/methylation/jhu-usc.edu_LAML.HumanMethylation450.Level_3.2.5.0/

tcga TCGA • 3.0k views
ADD COMMENT
1
Entering edit mode
8.6 years ago
jananiharan ▴ 10

When you download TCGA data, you should get a file named file_manifest.txt. This contains the sample barcode and the corresponding .idat file name.

For eg.:

Disease Platform Type Center Platform Level Barcode File Name BRCA DNA Methylation JHU_USC HumanMethylation450 Level_1 TCGA-BH-A1FE-11B-14D-A13T-05 6026818132_R02C01_Grn.idat BRCA DNA Methylation JHU_USC HumanMethylation450 Level_1 TCGA-E9-A1NA-11A-33D-A145-05 6026818122_R05C02_Red.idat

ADD COMMENT
0
Entering edit mode

Yes, I realize there exists a file like that. Where do I find it?

ADD REPLY
0
Entering edit mode

It should be automatically generated when you download TCGA data. When you unzip the tarball, you should see this file.

ADD REPLY
0
Entering edit mode

I provided links specifically to illustrate that is not the case.

ADD REPLY

Login before adding your answer.

Traffic: 1672 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6