Hi all,
My colleague has a cell line model, of which it is mouse cell but a human gene is inserted and expressed. I was going to use STAR then RSEM to do the quantification. However, I am not sure if I need to play with the settings for multi-mapped reads since the human ortholog might be very similar to the mouse ortholog. Or I should say, in general, I am not sure when options like –outFilterMultimapNmax in STAR should be set manually (I have heard that it's necessary for pseudogenes and repeats, but not sure about orthologs).
I assume that I should first modify the fasta and gtf file to create an artificial chromosome for the gene, then try to align it with various setting if the two genes are very similar?
Many thanks
I might suggest seeing how similar the sequences are, the more similar they are... MAYBE you can avoid adding the exact human sequence in the alignment index.
However, I think it's probably best to add the human FASTA sequence to the sequences you use to generate the genome index. That will be most accurate.