Multiple columns in RNAseq count matrix?
1
1
Entering edit mode
3.2 years ago
Jeff-Gui ▴ 10

Hi, in GSE131592, each file corresponds to the counting matrix of one sample. In each file there are three columns. Does anyone know how to get the count of this sample?

The picture shows GSM3790428 as an example. No reply from the dataset author yet.

example

RNAseq • 2.5k views
ADD COMMENT
2
Entering edit mode
3.2 years ago

That is htseq count data. The three columns represent the number of reads assigned to that gene assuming the library is 1) unstranded 2) stranded in the forward direction, 3) stranded in the reverse direction.

ADD COMMENT
0
Entering edit mode

Thanks. Without checking raw reads to see how exactly the library is generated, is it reasonable to simply take the second column because of the much fewer ambiguous count than the first?

ADD REPLY
0
Entering edit mode

I'd look over more of the columns, and if the general trend is that columns 1 and 2 are nearly the same, that means the prep is stranded forward, and you should use column 2.

ADD REPLY

Login before adding your answer.

Traffic: 1552 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6