Question

Predict species from DNA sequences

0

Entering edit mode

23 months ago

p2016k • 0

Dear all,

I want to predict representative species from a set of DNA sequencing reads (say 100 bp each).

For this, I am looking for resources/help to perform this task. For instance, what set of models would be ideal to try?

My best accuracies (avg of 0.5) were achieved using binary classification (NB, RF, DT) using for training genomic sequences, trimmed to 100 bp (or other) of several species . Considering species specific Kmers didn't enhance the metrics.

I appreciate any comments/resources/help,

Best and thanks in advance.

Species Classification ML • 606 views

ADD COMMENT • link updated 23 months ago by natasha.sernova ★ 4.0k • written 23 months ago by p2016k • 0

score 0 · Answer 1 · 2023-05-09

0

Entering edit mode

23 months ago

natasha.sernova ★ 4.0k

There was a review in 2003 that might be helpful. Cross-Species Sequence Comparisons: A Review of Methods and Available Resources Kelly A. Frazer,1,6 Laura Elnitski,2,3 Deanna M. Church,4 Inna Dubchak,5 and Ross C. Hardison3

ADD COMMENT • link 23 months ago by natasha.sernova ★ 4.0k