I have no experience in annotation RNA seq data (or genome assembly) but I've spent the last weeks ploughing through some papers and looking at manuals.
I am wondering why it seems so common that e.g. proprietary pipelines of sequencers are not used?
E.g. the de novo genome assembly pipeline devised by PacBio seems to be cited less than hifiasm or other open source programs. Is this mostly because they are charging a higher price, so people move over to other options, or is it becaue the field is moving at a fast pace and programs are written and benchmarked making the proprietary pipelines obsolete?
1) Is PacBio Iso-Seq annotation a good choice, or is it surpassed by some recently published/improved tool that I haven't read of yet?
2) Using Full-length RNA sequencing to annotate genomes; What are the best performing tools (in terms of speed and accuracy) in 2021 this type of genome annotation?
2) Time is money computing cost can vary significantly people have to budget their projects according to funding. I assume that the fastest one will be used most, thus cited most? E.g. Hifiasm replacing Falcon.... Also I assume that anyone publishing a new pipeline will only publish if it at least the same in accuracy, if not better than pipelines already on the market or open source. I guess that boils down to people not publishing before their tool is better than other tools.
The question 2 is still standing :)
Time is indeed a factor (though in my institute we're quite fortunate not having to care too much about it). Nonetheless, quality still prevails and should be the first criteria, if there are tools that perform similarly, then yes you can go for the fastest one. On the other hand, would you choose a fast tool over a slower one if the former performs much less than the latter?
Ah, there your assumption is wrong, there a likely plenty of reasons why people publish something: getting funding, project is dying anyway, ... I would not assume that since it's published more recently it will work better or faster. Of course in that paper their tool will perform better and faster but that is not always true when seeing the general picture.
For the reason I have put forward already, speed is rarely a criteria in genome annotation. Moreover the time it takes to perform an annotation is so dependent on several other factors it rarely is the tool itself that determines the runtime. I mean an annotation done with only intrinsic info (not even trained for a particular species) will run lighting fast. The one I used to use a lot (EuGene, INRA) will annotate a Gb genome in minutes in this mode. However, then I'm not taking the weeks/months of optimizing it for a new species into account. If I use that same tool but would like to throw in some more data (proteins, RNAseq, ... ) it will do the annotation in, let's say, few hours but then I'm not taking the time to perform protein alignments, RNAseq mapping, ... into account (if I add that it will likely be days to weeks) .... I hope you understand that the tool itself is much less of a factor and there are many other factors in play that determine (total) runtime.