Hi,
I need to quantify gene expression by salmon so I need Ensembl gene annotation GRCh37.87 likely in fasta
I tried ftp://ftp.ensembl.org/pub/grch37/current/fasta/homo_sapiens/ but not working
Do you know know from where I can downlioad such file for salmon?
Thank you
Ensembl gene annotation for GRCh37.87 would be in GTF or GFF3 not FASTA.
edit: ftp://ftp.ensembl.org/pub/grch37/release-87/
Sorry but in Salmon manual says in fasta
https://salmon.readthedocs.io/en/latest/salmon.html
If Salmon needs the FASTA sequences for the transcripts, then you can do the follow these steps (http://ccb.jhu.edu/software/stringtie/gff.shtml#gffread_ex) using the GRCh37 reference from GENCODE and getting the GTF file from possibly from ftp://ftp.ensembl.org/pub/grch37/release-87/gtf/homo_sapiens/Homo_sapiens.GRCh37.87.chr.gtf.gz. This assumes your BAM files are alignments to the main assembly and not including the alternative haplotypes or patches.
I have bam files I have alignment come from
GRCh37_g1k
by STAR; Now I need to quantify raw counts; I have done that by featurecounts but I have a lot of strange features so I decided to used Salmon. Thank you anyway but I don't know why I can not open these linksSo not I though to convert my bam to fastq
When I used this command in Salmon I obtained this error
Finally I used gffread but I am getting this error
have you tried
wget ftp://ftp.ensembl.org/pub/grch37/release-87/gtf/homo_sapiens/Homo_sapiens.GRCh37.87.chr.gtf.gz
?Perhaps your chromosomes are in the format
>1
for chr1 instead of the gencode>chr1
format?I just tried ftp://ftp.ensembl.org/pub/grch37/current/fasta/homo_sapiens/cdna/Homo_sapiens.GRCh37.cdna.all.fa.gz, it worked for me. This is the file that I think you need for Salmon.