i have 4500 protein sequences with very very less similarity,having length at least 40 & some may be maximum about 300/500 Is there any effect on quality of phylogeny tree, constructed by using Multiple sequence alignment's(MSA) output.
what should i do if i want to align all protein seq without any loss of useful information? should i go for further data curation?if yes,then how/