Dear all
Hi
I downloaded a data set in class "RangedSummarizedExperiment" from "https://jhubiostatistics.shinyapps.io/recount/". I want to find differential expression genes. My code is :
load("~/Downloads/rse_gene.Rdata")
class(rse_gene)
**[1] "RangedSummarizedExperiment"
attr(,"package")
[1] "SummarizedExperiment"**
data=colData(rse_gene)
names= names(colData(rse_gene))
write.table(data,file="colData.csv", col.names=names,sep="\t",row.names=FALSE)
data1=fread("~/Downloads/colData.txt")
colData(rse_gene) =DataFrame(data1)
colData(rse_gene)$disease.status = as.factor(colData(rse_gene)$disease.status)
dds <- DESeqDataSet(rse_gene, design = ~ disease.status)
**converting counts to integer mode Error in validObject(.Object) :
invalid class “DESeqDataSet” object: NA values are not allowed in the count matrix In addition: Warning message: In mde(x) : NAs introduced by coercion to integer range**
I use two solution for this problem but both of them were useless:
1- Keep only rows with non-zero counts:
rse_gene <- rse_gene[rowSums(assay(rse_gene)) != 0, ]
2- Replace the NA value by -9 :
countdata <- assay(rse_gene)
replace(countdata,countdata==0,-9)
coldata <- colData(rse_gene)
ddsMat <- DESeqDataSetFromMatrix(countData = countdata, colData = coldata, design = ~ disease.status)
I will be appreciate if any one can help me!!
Try this:
Btw, what is the accession number of the data?