Hi, I am looking to make a machine learning model for identifying antibiotic resistant in bacteria. I have 400 assembled fasta files of salmonella dublin and a metadata file (sample id, date, genes etc). However, I have a fundamental skills of machine learning. Any suggesting would be appreciated of proceeding the dataset for making a model.
Thank you,
Thank you for your reply. I am a little bit off route of my question. It has a correction, it is about predicting antimicrobial MICs.
I am trying to follow these two papers (link below). They are using k-mer/unitigs based approach to get MICs using machine learning. However, I am still trying to figure out how to get unitigs from my assembled fasta files. If I use AMRFinder or ResFinder, I will get the AMR genes with nodes and genomic information but the nodes sequence is larger than a unitig.
https://journals.asm.org/doi/pdf/10.1128/JCM.01260-18 https://www.sciencedirect.com/science/article/pii/S1319562X22001309
The less information you provide, the less useful suggestion you get. It may not matter to you, but that also means we are less productive in writing our answers.