Hi, I read the samtools documentation for the pileup format, but am not clear about the difference between lower and upper case letters.
For example: chr2 777777 T 31 ..........,,,,,,,,,,CCcccCCCCCC
So, the refrernce is T, and the c/C difference is which strand the letter is, right? does it mean that the F strand had a C and the R strand had a G that is here written as c to account for direction? Is it correct to sum up all the Cs together?
thanks for all the help!
Hi , Yes you understood rigth for reverse strand you read is reverse to align it , then it means c is g :) Yes you can sum all the c/C together because of complementarity of DNA. But for example if your gene is in reverse strand DNA and you want to know the translation of the protein consequences of your mutation you will need to use g. Best