The BUSCO assessment of the MAKER annotation results shows a low completeness

0

Entering edit mode

29 days ago

mut • 0

I performed the first round of MAKER annotation for my genome using RNA-seq and homologous proteins. The BUSCO assessment of the resulting proteins showed a completeness of only 64%. I then selected genes with AED less than 0.1 to train Augustus and SNAP, and performed a second round of MAKER annotation. However, the BUSCO assessment of the resulting proteins is still very low, only 53%. Why is this happening?

enter image description here

BUSCO MAKER • 884 views

ADD COMMENT • link updated 5 days ago by Abieskawa • 0 • written 29 days ago by mut • 0

0

Entering edit mode

What did you do before MAKER annotation, did you do repeat masking?

ADD REPLY • link 29 days ago by Abieskawa • 0

0

Entering edit mode

I used RepeatMasker independently and generated a GFF3 format file, which was used as input for MAKER

#

enter image description here

"I would like to know if it is normal that the BUSCO evaluation of the proteins and transcripts output from the first round, where I only used EST and protein for annotation, is only around 60%. When I use the output from the first round to train de novo prediction tools such as Augustus and SNAP and run the second round of MAKER, the BUSCO evaluation of the proteins is still very poor. Why is that?"

ADD REPLY • link 29 days ago by mut • 0

0

Entering edit mode

Please do not paste screenshots of plain text content, it is counterproductive. You can copy paste the content directly here (using the code formatting option shown below), or use a GitHub Gist if the content volume exceeds allowed length here.

code_formatting

ADD REPLY • link 16 days ago by Ram 44k

0

Entering edit mode

I think 60% is way too low

ADD REPLY • link 5 days ago by Abieskawa • 0

0

Entering edit mode

What's the completeness of the genome alone?

ADD REPLY • link 29 days ago by Michael 55k

0

Entering edit mode

BUSCO assesses the completeness of the genome as 96%.

ADD REPLY • link 29 days ago by mut • 0

0

Entering edit mode

Then clearly something is wrong with the annotation by MAKER .. (as this number indicates that the built-in annotation process of BUSCO is out-performing MAKER)

ADD REPLY • link 29 days ago by lieven.sterck 15k

0

Entering edit mode

I did not try Maker before, but my colleagues did. I am not sure the BUSCO, but number of the predicted genes was around 50,000 in Tilapia. We cannot sure it is the problem of Maker or the repeat masking process (he did not apply repeatmodeler build custom repeat database first), and there are more pipeline tools can be applied, so we stop using it now. You can try:

give up Maker, use Braker or Ginger
Inspect the result of repeat masking, if it masks too much or other problems

ADD REPLY • link 28 days ago by Abieskawa • 0

0

Entering edit mode

Can you try without rnaseq data?

ADD REPLY • link 29 days ago by samuel.a.odonnell ▴ 580

0

Entering edit mode

How did you run BUSCO? and what database of BUSCO did you use ? What version of MAKER ?