Entering edit mode
2.7 years ago
O.rka
▴
740
I'm trying to understand the logic of converting the complex MetaEuk
identifiers to the more simple identifiers adopted by BUSCO
so I can reproduce a manual step for gene calls prior to BUSCO. However, what I thought was the case isn't actually correct. I thought you just get the high and low values.
>Transcript_22646|m.42971|NODE_14273_length_2360_cov_2.257701|+|428|5.129e-119|1|98|742|98[98]:742[742]:645[645]
VPVFIMPLFNKYEPLDDGTLKTRIYELAASLKYPLTKLFVMDGSKR...
>Transcript_22646|m.42972|NODE_14273_length_2360_cov_2.257701|-|290|1.787e-77|2|98|742|742[742]:356[356]:387[387]|283[283]:98[98]:186[186]
HQGRMTVMIKRIILVRVHGAQVFQMNLAQTTLQLFGPTQINRKAIG...
Both of these (with my incorrect logic) would return the following:
NODE_14273_length_2360_cov_2.257701_98:742