Hello Everyone,
I have detected a duplication in a population using whole-genome sequencing. Now I want to see if the duplicated version of the gene is also being expressed. Fortunately we have some expression data from the same individuals generated using RNA-SEQ. I have never handled RNA-SEQ data before but somebody else in the lab has been generating and processing this data. Is there a straightforward way (like a tool) to extract all the transcripts that have been mapped to a specific gene? The goal is to see if transcripts exist with the variants that we see in the exons of the duplicated version of the gene to know if it is being expressed.
Thanks in advance for your help!
Best, Rubal
I don't think this will work without a bit of clever processing. If the gene is duplicated, a lot of reads will map equally well to both genes, and probably just be assigned to one of them randomly (depending on the aligner).
If there are SNVs specific to the duplicated copy of the gene, then looking at the proportion of that snp at each site will tell you how frequently it was expressed, relative to the other alleles.