Grouping TCGA samples by tumor_tissue_site: Can some groups be merged to decrease the number of groups?
1
0
Entering edit mode
18 months ago

I am working with TCGA data and I want to group the samples based on the tumor_tissue_site field. However, some of the groups are quite specific and merging them together may be reasonable to decrease the number of groups. For example, the following groups could be merged: "'Chest - Breast', 'Chest - Chest wall', 'Chest - Lung/pleura', 'Chest - Mediastinum', 'Chest - Other (please specify)' and "'Head and Neck', 'Head and Neck - Head', 'Head and Neck - Head|Chest - Chest wall', 'Head and Neck - Neck|Head and Neck - Other (please specify)', 'Head and Neck - Other (please specify)'".

I would appreciate it if someone could explain the difference between these sites and whether it would be reasonable to merge them together. Specifically, I am interested in knowing whether merging these groups would result in a loss of information, and whether it would be appropriate for my analysis?

Grouping TCGA • 1.2k views
ADD COMMENT
1
Entering edit mode

Look at the human body plot at the right side of the GDC data portal https://portal.gdc.cancer.gov/, is this grouping what you are looking for?

ADD REPLY
0
Entering edit mode

Yes, exactly. However, when grouping the data by the 'tumor_tissue_site' column, I've encountered numerous additional groups that I'm unsure how to merge. Are there any other columns in the TCGA clinical data metadata that could be considered as tumor source sites?

ADD REPLY
1
Entering edit mode
18 months ago
Zhenyu Zhang ★ 1.2k

primary site?

You can check properties in https://docs.gdc.cancer.gov/Data_Dictionary/viewer/

ADD COMMENT

Login before adding your answer.

Traffic: 2481 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6