Entering edit mode
6.5 years ago
shivangi.agarwal800
▴
120
Hi
I have list of transcripts (~70,000) with their expression values in cancer and normal adjacent samples, I want to calculate p-value for each pair of transcripts. How to do that? Thanks in advance.
Regards
You should provide further information, such as:
Nobody can help you sufficiently with the information that you have currently provided.
OK, thanks for the information. The data is raw TCGA expression data and contains transcript id, expression value (TPM) in cancer and adjacent normal samples
Hello again . With this data, you cannot produce a p-value per gene. With your data, you just have a single value for each gene in the tumour and normal samples. Can you elaborate (describe further) the source of the data? Many third-party websites (i.e. outside of the National Cancer Institute of the USA) host TCGA data, which is in various stages of processing.
Hi The data is taken from TCGA by cancerrna nexus and we have got from there. Can I apply student's t test for the same?
No, with the data that you have, you cannot use the Student's t-test. For example, if you wanted to derive a p-value for the uc001aaa.3 gene, your comparison would just be 0.519993743 Vs. 2.07946602736613E-95. A p-value cannot be derived from just 2 values.
From Cancer RNA-seq Nexus, you should try to obtain the expression values for the genes across all tumours and all normal samples. Then, you could begin to think about conducting differential expression analysis.
If you do not have much experience with bioinformatics, then can I suggest that you reach out to (that is, contact) a local collaborator (in your university / college, or some other), and ask them for assistance.
Also, there are web-based GUIs that allow you to analyse TCGA data, such as cBioPortal
If you have count data, try following one of the RNA-seq expression tutorials online. Here's a good one to start with: https://f1000research.com/articles/4-1070/v1
I want to do it using t-test or any other statistical test in spss.
Those tests are not appropriate for expression data.