Entering edit mode
6.9 years ago
cjgunase
▴
50
I want to cluster the genomic bins based on pairwise correlation > 0.75 into by sequentially traversing the table. (sample is shown in the image). If correlated the bins are averaged and the next bin is considered. soon the correlation is calculated sequentially until the criteria (0.75) fails. and start from the failed bin and continues again.
I am working on a script but If any one knows existing algorithm/ tool which can do this please let me know.
Thank you
https://drive.google.com/open?id=1PmmoPYCSzFwCM2NueNRzboy2rr12uB2g
This looks like something that could easily be coded. I would encourage you to think about how you would do it, and then begin to actually do it. Getting the data into R would help.
You will help yourself in your future career by beginning to work on these highly customised approaches on your own merit.
Good luck
Kevin