Analyzing RNA-seq data with Salmon
1
0
Entering edit mode
21 days ago

Hello everyone, I want to use the SALMON tool to quantify some RNA-seq data and then perform a differential expression analysis with DESeq2 and edgeR. While reading the Salmon manual, a question arose regarding the input needed to generate the index for aligning my reads. It mentions that a reference transcriptome should be used. My question is whether I can use the files available on NCBI, which are in the formats "rna.fna.gz", "rna.gbff.gz", "rna_from_genomic.fna.gz". I ask this because some tutorials specify a file format like ".cdna.all.fa.gz", as mentioned in the following link https://combine-lab.github.io/salmon/getting_started/. I would appreciate it if someone could take the time to respond.

quantifying RNAseq data countmatrix Salmon • 246 views
ADD COMMENT
1
Entering edit mode
21 days ago
GenoMax 145k

Short answer is yes. You should double-check the file to make sure that it contains transcripts/CDS (for prokaryotes). There was a recent example (prokaryotic) where a file from NCBI labeled rna only contained rRNA/tRNA.

ADD COMMENT
0
Entering edit mode

thanks!, I will check if my files contain cdna

ADD REPLY

Login before adding your answer.

Traffic: 2376 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6