What are the best methods to maintain up-to-date biological database?
I used a gnome from NCBI which was a large gz file. Now there is a newer release available in gz format. Every-time there is a change in database I have to download it completely and reanalysis the data. Same is the problem with some other databases. Is there a way to just only get the information which have been updated only?
Thanks. I am interested in gz files or any flat file data. Is there any tool/method of just only downloading changes!
No, it doesn't look like UCSC publishes diffs. It is probably easier for them to publish compressed files. You may need to download the entire file.
UCSC does publish a number of "diffs", or in this context chain files, between a number of different assemblies in different organisms here. i will update the answer to be more detailed.