I open this question because I didn't find much information about this topic so far. I am trying to upload data from a RNA-seq project to GEO (387,5GB) with UNIX command line and I am getting the error:
Lost data connection to remote host after 1xxxxxxxx bytes had been sent: Broken pipe.
The number of bytes being variable each time. After asking the IT service in my institution, they told me that the FTP protocol is very slow and the broken connection is expected for such big files.
I solved The issue using the scipt in this post
However, it will be helpful if anybody can share other answers to this problem, maybe also to improve the speed, as it is taking a lot of time to submit all files.
Thanks a lot!
That does not sound like a bioinformatics question to me!
Submit your data to ArrayExpress, it has a better interface for metadata management and file uploading. The direct FTP connection is also fast, accession ids are provided within a couple of hours and with a week they will provide the reviewer account details.
Thanks! I will definitely try ArrayExpress next time.