Hi,
I have paired-end sequencing data for 15 cancer and 15 normal samples. I am doing RNA-seq analysis for these samples.
1) While performing RNAseq analysis, is it mandatory to have approximately the same number of reads between the normal and cancer samples?
2) if yes, what is the significance of it?
3) if no, what is the significance of it?
Cancer samples basic stats:
Min : 21,900,652
Max : 161,154,015
Average : 105,993,656
Stdev : 26,799,534
Normal samples basic stats:
Min : 87,393,757
Max : 121,500,267
Average : 101,632,800
Stdev : 9,609,422
Thanks, Venkateshr89 +1 for your suggestions and the paper.
1) Can you suggest any specific tool for estimating the coverage for RNAseq analysis?
2) Also, I have been provided 30 samples (15 pairs). Each pair has one cancer and one normal sample. I read somewhere in biostar regarding the replicates is important for RNAseq analysis. What is biological replicate and technical replicate?
3) Is my pair (1 cancer and 1 normal) considered as biological replicates?
You can use fastqc and also IGV browser to check the coverage.
You can further use HTseq counts and dexseq_counts.py scripts to get the exon counts.
I don't know your hypothesis so I am afraid if I can answer your question.
Thanks