Entering edit mode
5.8 years ago
deeksha3696
▴
10
I am doing an RNAseq analysis. (using edgeR) I am loading my data for differential analysis, my data is of normalized gene expression(FPKM). I am now just trying to make DEGList object but, it is showing "Negative counts not allowed"
.
I have removed all negative values from the data, it is still showing the error(( Negative counts not allowed).
Please suggest what should i do. thank you!!
Command:
dgeFull <- DGEList(y, group = sampleInfo$condition)
Error: Negative counts not allowed
thanks from where can i get the raw counts
Download the data from NCBI or directly from ENA ( Fast download of FASTQ files and metadata from the European Nucleotide Archive (ENA) ).
From where did you get the fpkm counts...?
We are bad at reading your mind so you'll have to elaborate.
I downloaded the fpkm values from NCBI GEO DATA SET
FPKM values are not suitable for statistics, this is a recurrent discussion. See also edgeR user's guide, end of 2.3 on page 12. You need raw read counts, if they don't provide them at GEO, you can redo the analysis:
download fastq files -> align them to reference e.g. with STAR -> counts reads with e.g. featureCounts
While I don't completely agree that FPKM can't be used for any statistics (and I think it can be useful for visualization), it is true that you need raw counts for edgeR (and most RNA-Seq Bioconductor differential expression packages).
I also agree re-processing raw data can be frequently be helpful. Even if you end up with a similar result, you should understand the analysis process better and be more confident in your results :)
Please add a link to the dataset you are using. It may be necessary to download the reads and process those yourself.