Hi guys,
I feel like this should be super obvious, but I am struggling to find anywhere online the steps to figure out what machine a data set came from. The files have a .count
file extension on them, and are filled with ENSEMBL IDs followed by the count. an example:
ENSMUSG00000000001.4 3906
ENSMUSG00000000003.11 0
ENSMUSG00000000028.10 1005
ENSMUSG00000000031.11 4766
ENSMUSG00000000037.12 20
ENSMUSG00000000049.7 0
ENSMUSG00000000056.7 775
ENSMUSG00000000058.6 546
ENSMUSG00000000078.6 1075
Is there a way to determine the machine used based on that?
Thanks
What do you mean by machine?
If you are talking about sequencing machine, no it's not super obvious. After processing a lot your RNA-seq data looks like what your handling now. That's a raw counts file. I'm guessing
featureCounts
was applied to calculate raw counts (if you meantmachine=tool
).That's probably not possible. Looks like a file generated by htseq-count/featurecounts/... which will only contain counts per gene/transcript and no meta-data on the run. Those should be in the fastq and/or bam file.