Issue with DESeq2: Unable to create a DESeqDataSet because names in colData don't match column names in countData

0

Entering edit mode

7.0 years ago

catglen012 ▴ 10

Hello! I loaded my gene(/transcript) count matrix and labels into DESeq2:

> countData <- as.matrix(read.csv(file.choose(), row.names = "gene_id"))
> colData <- read.table(text = readLines(file.choose(), warn = FALSE), header = TRUE, sep = "," )

But when I check that all sample IDs in colData are also in CountData and match their orders, I obtain the following:

> all(rownames(colData) %in% colnames(countData))
[1] FALSE

It then checked to see how my gene_count_matrix.csv looks like and the column has MSTRG as the gene ID and it does not match the ID names found in phenotype_data.

I was wondering how do I make them both match so that I am able to generate a DESeqDataSet? My phenotype_data looks just like the data found in the nature paper

https://www.nature.com/articles/nprot.2016.095

and I followed exactly the protocol for Using StringTie with DESeq2.

RNA-Seq R DESeq2 • 3.6k views

ADD COMMENT • link 7.0 years ago by catglen012 ▴ 10

0

Entering edit mode

Please post the first line of countData and the contents of colData.

ADD REPLY • link 7.0 years ago by Devon Ryan 105k

0

Entering edit mode

countData:

gene_id    Ceratodon_GG1    Ceratodon_R40
MSTRG.9134      3497                  998

colData:

ids                 sex
Ceratodon_GG1   female

ADD REPLY • link 7.0 years ago by catglen012 ▴ 10

0

Entering edit mode

You only have 2 samples?

ADD REPLY • link 7.0 years ago by Devon Ryan 105k

0

Entering edit mode

hello! I was able to resolve the issue by fixing how the colData is arranged. Thanks for the help nevertheless.

ADD REPLY • link 6.9 years ago by catglen012 ▴ 10

0

Entering edit mode

Hi catglen012, could you mention what exactly you did to the colData to fix it? I have the same issue

ADD REPLY • link 5.7 years ago by c_u ▴ 530

Login before adding your answer.