Fasta to gff with custom set of genes
1
0
Entering edit mode
7.6 years ago
tlorin ▴ 370

Dear all,

I have a custom list of 100 genes that I manually curated to obtain the full CDS and I would like to make differential expression (DE) analysis between samples for this very subset. I now I cannot simply map all the reads onto this subset and perform DE analysis because I would have normalization bias (using DESeq2 or edgeR), so I need to map all the reads on the whole genome.

Fortunately, I also have the raw sequence of a genome (multifasta file) as long as an automatic annotation – and the corresponding GFF file. The problem is that this annotation is not good enough of the 100 curated genes.

My plan was (1) run BLAT to get the exact genomic coordinates of my manually curated set of genes (2) merge the newly obtained GFF with the first (automatic and non-curated) one with Cufflinks gffcompare or and (3) run DESeq2 using this new annotation.

Would any of you have any suggestion regarding this protocol or any alternative tools to suggest?

Many thanks!

gff fasta RNA-Seq genome deseq2 • 1.7k views
ADD COMMENT
1
Entering edit mode
7.6 years ago

now I cannot simply map all the reads onto this subset and perform DE analysis because I would have normalization bias (using DESeq2 or edgeR), so I need to map all the reads on the whole genome.

I don't agree completely. If your reads are RNASeq reads and you map them against a transcriptome, it shouldn't be a problem.

Would any of you have any suggestion regarding this protocol or any alternative tools to suggest?

A suggestion would be to try using GMAP with GFF output, so you map the sequences to the genome and get a GFF as output automatically. It's really handy.

ADD COMMENT
1
Entering edit mode

I don't agree completely. If your reads are RNASeq reads and you map them against a transcriptome, it shouldn't be a problem. Definitely! Was just saying that you have to map your reads to a whole genome or transcriptome, not to "simply" 100 or 200 genes.

ADD REPLY

Login before adding your answer.

Traffic: 2767 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6