retro-engineered annotation on genome assembly
1
0
Entering edit mode
8.2 years ago
guillaume.rbt ★ 1.0k

Hi everyone,

I'm working on a fungus species, on which I have two genome assemblies, performed on two different strains, and also one annotation for each assembly.

By crossing the annotations peptide sequences results, with BDBH analysis, I get some common proteins, and some specific to each strain.

I know, by blasting them on genomes assembly, that most of the "strain specifics" genes are however also present on the other strain genome. (certainly due to the different annotation software)

What I would like to do is to retrieve the sequences of one strain specifics genes on the other strain genomes, so that I complete the annotation.

Would anybody have a clue on how doing such a thing?

Thanks

Assembly blast bdbh annotation peptides • 1.6k views
ADD COMMENT
1
Entering edit mode
8.1 years ago
Bill Pearson ★ 1.0k

A possible strategy:

(1) blastp all of fungus1 vs fungus2 and vice versa. Find the proteins in fungus1 that do not have significant hits (possibly with a percent identity and coverage threshold) in fungus2, or have hits that only cover part of the protein, and vice-versa.

(2) take the proteins in fungus1 and fungus2 that do not have a match in the other fungus, and tblastn (tfastx) them against the other fungal assembly. I would expect that many of the fungal proteins that do not match, or match only partially, will be found by the tblastn (tfastx) search. tfastx will be slower but much less sensitive to frameshift errors in the assembly.

ADD COMMENT
0
Entering edit mode

thank you Bill for your answer, I will try that

ADD REPLY

Login before adding your answer.

Traffic: 2077 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6