I have a file with Transcription Factors (TF) and the predicted Target Genes (TG) they regulate, followed by the regulation score of the transcription factor on the target gene:
TF TG Score
SOX2 STMN2 0.34215
SOX2 SALL1 0.124431
SOX2 PDPN 0.0443292
SOX2 UGT8 0.0371112
PRDM1 APLNR 0.0197349
I have a file like this for two samples. Each file has different transcription factors, target genes, and regulation scores. I would like to find the % similarity between the files.
Ideally the % similarity would reflect if the same transcription factor / target gene pair is present in both files, and if the regulation score is close to the same value or not (ie if regulation score 1 - regulation score 2 is large then less similarity, if small then more similarity). How best to code this in R?
I use these files to make heatmaps, so if there is a tool for comparing heatmaps, that could also work.