Hello,
I want to analyze multiple files from the TCGA-BRCA project downloaded from the GDC portal. However, I have some difficulty using different data from the same samples.
For example, a case ID TCGA-E2-A1IU has proteome profiling and DNA methylation data. The problem is that the methylation file has its unique name (e2ae1cb4-0422-4c7d-9878-ba8449afaacd.methylation_array.sesame.level3betas.txt), and other IDs or barcodes do not exist in the file.
I searched other files for the case, such as biospecimen or clinical data files, but I could not find any information that maps between case IDs and file IDs. I may write all of the case ID and file name pairs one by one, but it may be inefficient.
I would appreciate it if you could teach me any method to solve the problem.
Thank you.
Thank you so much! I think I can try both the sample sheet and the R package.