Generating Hits files for BlobTools
1
0
Entering edit mode
15 months ago
Takuma ▴ 20

To decontaminate bacteria and viruses nucleotide, I am trying to perform BlobTools for assembled genome of one beetle.

In the step for generating hits files, I did diamond blastx:

diamond blastx \
--query MyAssembledGenome.fa \
--db uniprot_sprot.dmnd \
--max-target-seqs 1 \
--sensitive \
--threads 8 \
--evalue 1e-25 \
--outfmt 6 \
--out result.out

Consequently, only 4664 scaffolds were aligned against uniprot/swiss-prot database. But the MyAssembledGenome.fasta has about 55,000 scaffolds in total. Is the small number of hit scaffolds usually happened ?

And how should I treat non-hit scaffolds ? removing them for downstream analyses, such as identification of transposable element ?

BlobTools UniProt • 625 views
ADD COMMENT
0
Entering edit mode
5 months ago
jtorre28 • 0

You could try using a larger database such as ncbi's nr database in fasta format instead of the default format then from there diamond can add the nr database fasta files as a database.

The swissprot database can be used later for your annotations since it is more accurate.

ADD COMMENT

Login before adding your answer.

Traffic: 1923 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6