Hello everyone I'm looking for a function in python or Biopython that can calculate the tetranucleotide frequency
of a given regions of scaffold.
The idea is that I have several regions and I want to identify possible changes in nucleotide composition
that correspond to the an endogenization
regions within my genome, for that I need to calculate theTNFs across regions for these contigs. I then need to calculate the Pearson correlation of these frequencies compared to the TNF of a set of the largest contigs in these genome assemblies (these contigs being probably really from the genome and were not endogenized).
Does someone know a such package in python?
Thanks you