Question

RNAseq normal matches using publicly available databases

0

Entering edit mode

3.8 years ago

geneart$$ ▴ 50

Hi, Was wondering if anyone has any advice on using publicly available normal matches to use in RNAseq analysis with our tumor samples. We have liver cancer tumors but lack normal tissue for some of the cases. So can we perform RNAseq analysis using liver normal RNAseq data from TCGA with our tumor data? 25 tumor-25 normal (different normal patients for liver from TCGA)

Does this pose a problem of any kind in pairwise analysis? They will not be from the same patient as ours obviously but still normal and molecular signatures for normal should be pretty much same ? Is there anything I will need to pay close attention to?

I have not done this type of analysis before and so please pardon my ignorance here!

Thankyou for your time !

RNA-Seq TCGA DGE normal tumor • 878 views

ADD COMMENT • link updated 3.8 years ago by ATpoint 85k • written 3.8 years ago by geneart$$ ▴ 50

score 2 · Answer 1 · 2021-01-29

No, you cannot. Please use the search function for previous threads on that matter. The crux here is that the condition (tumor/normal) are confounded by study so you cannot distinguish biological from technical/batch effects, and there will be plenty. For illustration, you can go through Basic normalization, batch correction and visualization of RNA-seq data which contains data from the exact same specimen but prepared with different library prep kits. That somewhat mimics different studies. You will see that if you perform DEG analysis between the sample sample, just comparing kits there will be hundreds of DEGs. That means your analysis would be spawned with false calls that do not reflect any biological but pure technical differences. For tumor-only data it will probably come down to either defining sub-groups in your data, either basic on clinical metadata or clustering-based approaches, and then compare these groups with each other.