Entering edit mode
7.9 years ago
pourbiostars
•
0
Hello,
I'm actually using OMA with this parameter:
- " # Type of input sequence data, has to be either 'DNA' or 'AA' InputDataType := 'DNA'; "
So OMA knows that I'm using DNA sequences. But I find this parameter inside the parameter file:
- " # alignments which have a score lower than MinScore will not be considered. # The scores are in Gonnet PAM matrices units. MinScore := 181; "
Gonnet PAM matrices being amino acid based, I would like to know if OMA convert DNA into AA before the allVSall step ?
Thank you.
Best regards.
Thank you for your answer,
When comparing species (from the same plant taxonomy family) does using CDS sequence instead of AA matter when the clustering is done ?
We recommend using protein sequences, because we have done much more benchmarking on this type of data. There are practical advantages too: public genomes exported from OMA, including precomputed all-against-all, are only available at for protein sequences. By contrast, the DNA-level comparison support is still experimental in OMA. We have started using it for non-coding regions.
So for coding regions, I strongly recommend that you use the translated sequences.