Hello, I have a mapped bam files (from bwa) and I want to extract which base on the read overlaps a specific base pair in the genome (ie base 4 of 55). I also want to extract the base is at that position.
I've made some progress on this in terms of obtaining reads which overlap the site, getting the read lengths, and getting the starting positions of the read. However, I'm having a hard time figuring out how to extract the specific base, and whether that is affected by whether the read is in the forward or reverse direction.
For an example, I'd like to extract position 96244549 from the following 2 reads. Any help would be appreciated!
MYRUN 16 chr5 96244500 37 100M * 0 0 CAAGATTACTCAGCTAGTCAGTGGCAGAGCTAAGGTTTAAACTGAATGGTCCAGCTATAATGACCTACTTCTGTTATTGTTCTCTCTTTCATATTTGTTT DDDDDIIIIIIIIIJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJIIIIIIIII
MYRUN 16 chr5 96244531 37 62M * 0 0 AAGATTTAAACTGAATGGCCCAGCTATAATGACCTACTTCTGTTATTGTTCTCTATTTCATA JJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJ