Entering edit mode
12.2 years ago
GPR
▴
390
Hello, I am planing an experiment, where I will collect RNA-seq data on a 3'-polyA capture library. My question is, after alignment with Bowtie, what is the best way to retrieve the gene names of the loci to which the data aligned? These are short reads that map to the 3'-end of transcripts and that's it. Any idea, will be appreciated. G.
Do you have a gene model file for the organism? Then after mapping, its basically the same procedure. You could just count the number of reads mapped to each gene, albeit it maps to the 3' region of your gene. If necessary, to filter out certain mapping errors, you could also choose a window size from the 3' region of your gene and obtain counts over that region. Those with counts are the ones you are interested in (typically with a certain threshold over the number of read counts per million, cpm).