I'm searching about best practices for differential expression analysis, and I found that paper https://www.ncbi.nlm.nih.gov/pubmed/24300110 (the most closely paper related to my questions)
But, that paper talk about the methods of doing the differential expression analysis starting with an input with expression data, like the input for DESeq package, right?
But, I'm thinking about the whole project, for example:
If I have only a few sequences (e.g. genes) and not the whole assembled genome, Can I make a differential expression analysis?
Which is the best tool to make it? (considering the situation above)
It's necessary a kind of normalization before the gene expression analysis?
With my knowledge, I designed an experiment like this:
- Sequence reference: A fasta file with nucleotide information of 5 genes.
- RNA-seq libraries: fastq files from RNA-seq experiments with the following conditions: Control, Treatment 1, Treatment 2, Treatment 3.
- Mapping: Bowtie2
- Output conversion using bam files information to make a table with a count of alignments of each mapping analysis.
- DESeq analysis using as input the output created in the previous step.
It's is that? I have no idea if a simple mapping analysis with bowtie using just the sequence of genes can be used to infer gene expression difference.
Best,
With only 5 genes of interest, why aren't you using qPCR?
because I'm working with public data with a lot of different species....
So does that mean you're not actually going to perform the sequencing yourself, but you're going to download data that other people have sequenced and deposited in a public repo?
Your working with lots of public NGS data with only 5 genes?
A couple of points; 3 replicates is the bare minimum. DESeq uses information from all the genes to estimate dispersion, that step might be a little strange with only a handful of genes being measured.