Hello all,
This is probably a pretty simple question, but I've never done an analysis with transgenic expression involved before and I want to make sure I do it correctly. Basically, I have RNA-seq data, a genome and annotation files for the host, and the sequence of a transgene and its expression vector. The transgene is exogenous and therefore not in the host genome. Mapping the reads to the host transcriptome is straightforward enough, but what would be the best method by which to get counts data for the exogenous transgene (along with any portions of the vector itself which may be present)?
Thanks,
Erik
Alternatively, you can take the unmapped reads from the host genome and map it to a reference genome of your expression vector.
Great, thank you! I'm using STAR so this should work just fine. Again, thanks so much!!
Oh, and one more question about this. I have to add it to the GFF file if I want STAR to map it, don't I?
Not necessarily unless you are using STAR to do your gene/exon counts. Otherwise, you could just count the alignment to the specific chromosome.