I have found out the core ortholog set (Core Genome) of 30 bacterial strains using NCBI Blast Package. But finding Pan Genome (Unique genes + accessory genes + Core genes) of same dataset of 30 organisms is becoming hectic. As it is not possible to align each genome with other, around overall 900 times. I can`t derive any other logical pattern so that I can determine the accessory genes without repeats. Please help. Thanks in advance.
First I would like to commend you on pursuing it over the years, that is very admirable.
Now as a scientific software goes there are quite a few more essential steps.
Where is the source code where is the documentation, where is what the tool actually does? Where are the example inputs and outputs. All that is required for a proper scientific software.
I and many others would object to running an, executable especially a windows based one.
Put your code on Github instead of the awful sourceforge, open the sources and show what you can do. Most good companies hire off of github directly, I have myself received many offers on my github account alone.
Thanks for the appreciation. I am planning to compile Linux and Mac executables too. Of course, your suggestion about GitHub is better.
I refer to source code, let people compile your code so that there is less danger of having a compromised binary.
I would strongly urge everyone to NEVER download and run binaries.
I found that the way to tell who is a novice programmer is whether they are willing to show the source code. Those that are starting out often seem to think their code is somehow precious and everyone is out to steal it and sell as their own. Nothing could be further from the truth.
Let people use and understand what the software does, put whatever license you want on it, if you want to retain commercial rights so be it. Do everything you can to demonstrate that it is worth other people's time