Entering edit mode
3.7 years ago
pavelasquezv
▴
50
Hi, I hope you are well
Please, I need to create a count matrix for RNAseq analyses. My input is in .bam format. Do you have any suggestions to do it in R or Linux?
This is the example:
gene_id, sample1, sample2, sample3
gene1, 203, 204, 304
gene2, 12, 324, 434
Many thanks,
All the best Alex
You can also use
salmon
inalignment-based mode
(LINK). Only requirement is that you have a transcriptome file available that you want to quantify against. You will need to create the said matrix by usingtximport
(LINK).Hi!
You can use some R packages such as Tximport or Tximeta. The last allows you to store metadata related to the reference transcriptome (in case you align with Salmon) employed for alignment. After importing the counts, Tximeta creates a SummarizedExperiment object in which you are able to summarize your counts at transcript or gene level, store your metadata and have information related to the transcript/gene coordinates.
Hope it works!
Tximport will handle getting the counts from separate files into a single object in R, but it won't generate the count files.
Many thanks, my friend!
I will see those packages
Kind regards, Alex